LLMs Can Easily Learn to Reason from Demonstrations Structure, not content, is what matters! Paper β’ 2502.07374 β’ Published 14 days ago β’ 33
Hibiki fr-en Collection Hibiki is a model for streaming speech translation , which can run on device! See https://github.com/kyutai-labs/hibiki. β’ 5 items β’ Updated 18 days ago β’ 50
SFT Memorizes, RL Generalizes: A Comparative Study of Foundation Model Post-training Paper β’ 2501.17161 β’ Published 27 days ago β’ 107
Reasoning Datasets Collection Distilled synthetic Reasoning datasets β’ 7 items β’ Updated 22 days ago β’ 55
Eagle 2 Collection Eagle 2 is a family of frontier vision-language models with vision-centric design. The model supports 4K HD input, long-context video, and grounding. β’ 9 items β’ Updated Jan 23 β’ 31
view article Article Train 400x faster Static Embedding Models with Sentence Transformers Jan 15 β’ 148
Deepthink and Reasoning Collection Best for Deepthink and Reasoning β’ 14 items β’ Updated Jan 24 β’ 19
ModernBERT Collection Bringing BERT into modernity via both architecture changes and scaling β’ 3 items β’ Updated Dec 19, 2024 β’ 138
Bamba Collection Collection of Bamba - hybrid Mamba2 model architecture based models trained on open data β’ 8 items β’ Updated Dec 18, 2024 β’ 18
Falcon3 Collection Falcon3 family of Open Foundation Models is a set of pretrained and instruct LLMs ranging from 1B to 10B parameters. β’ 40 items β’ Updated 11 days ago β’ 81