Tom Aarsen's picture

Tom Aarsen

tomaarsen

·

https://linkedin.com/in/tomaarsen

AI & ML interests

NLP: text embeddings, information retrieval, named entity recognition, few-shot text classification

Recent Activity

updated a model 18 minutes ago

cross-encoder/ms-marco-MiniLM-L6-v2

updated a model 18 minutes ago

cross-encoder/ms-marco-MiniLM-L4-v2

updated a model 19 minutes ago

cross-encoder/ms-marco-TinyBERT-L2-v2

View all activity

Organizations

tomaarsen's activity

upvoted a collection about 4 hours ago

Instella ✨

Announcing Instella, a series of 3 billion parameter language models developed by AMD, trained from scratch on 128 Instinct MI300X GPUs. • 5 items • Updated about 13 hours ago • 3

upvoted a collection about 24 hours ago

Granite Embedding Models

4 items • Updated 10 days ago • 6

upvoted a collection 1 day ago

Hallucination detection

Trained ModernBERT (base and large) for detection hallucinations in LLM responses. The models are trained as token classifications. • 4 items • Updated about 22 hours ago • 14

upvoted a paper 1 day ago

LettuceDetect: A Hallucination Detection Framework for RAG Applications

Paper • 2502.17125 • Published 10 days ago • 7

upvoted 2 papers 5 days ago

Rank1: Test-Time Compute for Reranking in Information Retrieval

Paper • 2502.18418 • Published 9 days ago • 24

DRAMA: Diverse Augmentation from Large Language Models to Smaller Dense Retrievers

Paper • 2502.18460 • Published 9 days ago • 1

upvoted a paper 6 days ago

Granite Embedding Models

Paper • 2502.20204 • Published 7 days ago • 3

upvoted a collection 6 days ago

rank1

rank1 is the first test-time compute reasoning model in IR • 15 items • Updated 7 days ago • 3

upvoted a paper 6 days ago

NeoBERT: A Next-Generation BERT

Paper • 2502.19587 • Published 8 days ago • 37

upvoted a paper 13 days ago

MMTEB: Massive Multilingual Text Embedding Benchmark

Paper • 2502.13595 • Published 15 days ago • 31

upvoted 2 collections 15 days ago

ModernGLiClass

GLiClass with ModernBERT backbone • 2 items • Updated 15 days ago • 6

The Ultimate Collection of Code Classifiers

🔥 15 classifiers, 124M parameters, one per programming language— for assessing the educational value of GitHub code • 15 items • Updated 14 days ago • 10

upvoted an article 16 days ago

Article

Introducing Three New Serverless Inference Providers: Hyperbolic, Nebius AI Studio, and Novita 🔥

17 days ago

• 93

upvoted 2 articles 21 days ago

Article

From Chunks to Blocks: Accelerating Uploads and Downloads on the Hub

23 days ago

• 49

Article

1 Billion Classifications

22 days ago

• 42

upvoted a collection 22 days ago

Nomic Embed v2

Multilingual Embedding Models • 4 items • Updated 19 days ago • 14

upvoted an article 23 days ago

Article

From Llasa to Llasagna 🍕: Finetuning LLaSA to generates Italian speech and other languages

By

and 1 other •

23 days ago

• 26

upvoted an article 24 days ago

Article

Open R1: Update #2

By

and 6 others •

24 days ago

• 197

upvoted a paper 25 days ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published 30 days ago • 198

upvoted a collection 28 days ago

GTE ModernBERT

GTE Models Based on ModernBERT • 2 items • Updated Jan 21 • 15