SmolVLM: Redefining small and efficient multimodal models Paper • 2504.05299 • Published 14 days ago • 164
📚 LLM pretraining datasets Collection A collection of datasets for LLM pretraining • 9 items • Updated Mar 7 • 6
future-technologies/Universal-Transformers-Dataset Viewer • Updated 6 days ago • 70.1M • 3.79k • 64
Common Corpus Collection Largest multilingual pretraining data. • 1 item • Updated Nov 13, 2024 • 10
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models Paper • 2402.03300 • Published Feb 5, 2024 • 116
view article Article You could have designed state of the art positional encoding Nov 25, 2024 • 231
view article Article π0 and π0-FAST: Vision-Language-Action Models for General Robot Control Feb 4 • 144