Open to Collab

Solomatin Roman

Samoed

AI & ML interests

None yet

Recent Activity

updated a dataset 3 days ago

mteb/SoundDescsA2TRetrieval

published a dataset 3 days ago

mteb/SoundDescsA2TRetrieval

updated a dataset 3 days ago

mteb/SoundDescsT2ARetrieval

View all activity

Organizations

upvoted a paper 15 days ago

GradMem: Learning to Write Context into Memory with Test-Time Gradient Descent

Paper • 2603.13875 • Published Mar 14 • 36

upvoted a paper 19 days ago

Prism-Reranker: Beyond Relevance Scoring -- Jointly Producing Contributions and Evidence for Agentic Retrieval

Paper • 2604.23734 • Published 22 days ago • 3

upvoted an article 25 days ago

Article

DenseOn with the LateOn: Open State-of-the-Art Single and Multi-Vector Models

lightonai

•

26 days ago

• 38

upvoted 2 articles about 1 month ago

Article

Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers

tomaarsen

•

Apr 16

• 71

Article

Multimodal Embedding & Reranker Models with Sentence Transformers

tomaarsen

•

Apr 9

• 59

upvoted a paper about 1 month ago

Beyond Hard Negatives: The Importance of Score Distribution in Knowledge Distillation for Dense Retrieval

Paper • 2604.04734 • Published Apr 6 • 12

upvoted an article about 1 month ago

Article

BidirLM: Turning Generative LLMs into the Best Open-Source Omnimodal Encoders

Nicolas-BZRD

•

Apr 7

• 27

upvoted a paper about 1 month ago

BidirLM: From Text to Omnimodal Bidirectional Encoders by Adapting and Composing Causal LLMs

Paper • 2604.02045 • Published Apr 2 • 37

upvoted a paper about 2 months ago

LMEB: Long-horizon Memory Embedding Benchmark

Paper • 2603.12572 • Published Mar 13 • 73

upvoted an article 3 months ago

Article

ColBERT-Zero: To Pre-train Or Not To Pre-train ColBERT models?

lightonai

•

Feb 19

• 21

upvoted 2 papers 3 months ago

ColBERT-Zero: To Pre-train Or Not To Pre-train ColBERT models

Paper • 2602.16609 • Published Feb 18 • 7

MAEB: Massive Audio Embedding Benchmark

Paper • 2602.16008 • Published Feb 17 • 25

upvoted a collection 3 months ago

LateOn-Code 💻

Collection

State-of-the-art late interaction code retrieval models • 6 items • Updated Apr 7 • 20

upvoted 3 articles 3 months ago

Article

LateOn-Code & ColGrep: LightOn unveils state-of-the-art code retrieval models and code search tooling

lightonai

•

Feb 12

• 56

Article

Community Evals: Because we're done trusting black-box leaderboards over the community

burtenshaw, SaylorTwift, kramp, merve, davanstrien, nielsr, julien-c

•

Feb 4

• 89

Article

Nemotron ColEmbed V2: Raising the Bar for Multimodal Retrieval with ViDoRe V3’s Top Model

nvidia

•

Feb 4

• 28

upvoted an article 4 months ago

Article

🥃 Distilling Tiny Embeddings

NeuML

•

Jan 10

• 23

upvoted a collection 4 months ago

Qwen3-VL-Embedding

Collection

2 items • Updated Jan 8 • 68

upvoted a collection 5 months ago

SauerkrautLM-Vision-Document-Retrieval

Collection

7 items • Updated Dec 15, 2025 • 9

upvoted a paper 5 months ago

T-pro 2.0: An Efficient Russian Hybrid-Reasoning Model and Playground

Paper • 2512.10430 • Published Dec 11, 2025 • 119

Solomatin Roman

AI & ML interests

Recent Activity

Organizations

Samoed's activity

DenseOn with the LateOn: Open State-of-the-Art Single and Multi-Vector Models

Training and Finetuning Multimodal Embedding & Reranker Models with Sentence Transformers

Multimodal Embedding & Reranker Models with Sentence Transformers

BidirLM: Turning Generative LLMs into the Best Open-Source Omnimodal Encoders

**ColBERT-Zero: To Pre-train Or Not To Pre-train ColBERT models?**

LateOn-Code & ColGrep: LightOn unveils state-of-the-art code retrieval models and code search tooling

Community Evals: Because we're done trusting black-box leaderboards over the community

Nemotron ColEmbed V2: Raising the Bar for Multimodal Retrieval with ViDoRe V3’s Top Model

🥃 Distilling Tiny Embeddings

ColBERT-Zero: To Pre-train Or Not To Pre-train ColBERT models?