view article Article LeRobot goes to driving school: World’s largest open-source self-driving dataset 26 days ago • 73
SurveyX: Academic Survey Automation via Large Language Models Paper • 2502.14776 • Published Feb 20 • 96
view article Article Introducing smolagents: simple agents that write actions in code. Dec 31, 2024 • 956
view article Article Introducing the Synthetic Data Generator - Build Datasets with Natural Language Dec 16, 2024 • 121
view article Article Train 400x faster Static Embedding Models with Sentence Transformers Jan 15 • 170
view article Article Training and Finetuning Embedding Models with Sentence Transformers v3 May 28, 2024 • 205
OpenCoder Collection OpenCoder is an open and reproducible code LLM family which matches the performance of top-tier code LLMs. • 8 items • Updated Nov 23, 2024 • 81
Llama 3.2 Collection This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated Dec 6, 2024 • 585
Gemma-APS Release Collection Gemma models for text-to-propositions segmentation. The models are distilled from fine-tuned Gemini Pro model applied to multi-domain synthetic data. • 3 items • Updated 3 days ago • 21
Llama 3.2 Collection Meta's new Llama 3.2 vision and text models including 1B, 3B, 11B and 90B. Includes GGUF, 4-bit bnb and original versions. • 27 items • Updated about 12 hours ago • 60
view article Article Llama 3.1 - 405B, 70B & 8B with multilinguality and long context Jul 23, 2024 • 232