DataGemma Release Collection A series of pioneering open models that help ground LLMs in real-world data through Data Commons. • 2 items • Updated 19 days ago • 85
SmolLM2 Collection State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated Feb 20 • 253
Searching for Best Practices in Retrieval-Augmented Generation Paper • 2407.01219 • Published Jul 1, 2024 • 11
RAFT: Adapting Language Model to Domain Specific RAG Paper • 2403.10131 • Published Mar 15, 2024 • 73
Load 4bit models 4x faster Collection Native bitsandbytes 4bit pre quantized models • 25 items • Updated 1 day ago • 56
Sora Reference Papers Collection A collection of all papers referenced in OpenAI's "Video generation models as world simulators" technical report • openai.com/sora • 30 items • Updated Oct 3, 2024 • 52
Matryoshka Embedding Models Collection https://huggingface.co/blog/matryoshka • 14 items • Updated 28 days ago • 16