ColBERT-Zero: To Pre-train Or Not To Pre-train ColBERT models Paper • 2602.16609 • Published 14 days ago • 6
artificial-hivemind Collection This collection contains datasets for the Artificial Hiveminds paper. • 4 items • Updated May 16, 2025 • 13
LightOnOCR-2 🦉 Collection LightOnOCR-2-1B: a lightweight high-performance end-to-end OCR model family • 12 items • Updated about 22 hours ago • 22
Sarashina2.2 Collection Large Language Models developed by SB Intuitions. Pretrained and instruction-tuned models are available in three sizes: 0.5B, 1B, and 3B. • 6 items • Updated Mar 5, 2025 • 8
view article Article Welcome EmbeddingGemma, Google's new efficient embedding model +4 Sep 4, 2025 • 273
view article Article Training and Finetuning Sparse Embedding Models with Sentence Transformers v5 Jul 1, 2025 • 133
FreshStack: Building Realistic Benchmarks for Evaluating Retrieval on Technical Documents Paper • 2504.13128 • Published Apr 17, 2025 • 7
AceMath Collection We are releasing math instruction models, math reward models, general instruction models, all training datasets, and a math reward benchmark. • 11 items • Updated about 12 hours ago • 16
CG-RAG: Research Question Answering by Citation Graph Retrieval-Augmented LLMs Paper • 2501.15067 • Published Jan 25, 2025 • 1
Qwen2.5-1M Collection The long-context version of Qwen2.5, supporting 1M-token context lengths • 3 items • Updated Dec 31, 2025 • 126
🏟️ Long Code Arena Collection All the resources for our Long Code Arena benchmark! • 12 items • Updated 2 days ago • 6
OLMoE (November 2024) Collection Artifacts for open mixture-of-experts language models. • 13 items • Updated Dec 23, 2025 • 31