Nemotron Agentic & Tool-Use Collection Datasets for building models capable of function calling, multi-step agentic tasks, terminal use, and SWE workflows. • 9 items • Updated 5 days ago • 7
view article Article Keep the Tokens Flowing: Lessons from 16 Open-Source RL Libraries +7 aminediroHF, qgallouedec, kashif, lewtun, edbeeching, albertvillanova, nouamanetazi, lvwerra, sergiopaniego • Mar 10 • 154
spec-diffusion/deepseek-ai-DeepSeek-R1-Distill-Qwen-7B-lr9e5-ep20-bs8-w16-pc005-GSM8K Updated Mar 19, 2025
view article Article Ultra-Long Sequence Parallelism: Ulysses + Ring-Attention Technical Principles and Implementation exploding-gradients • Sep 16, 2025 • 20
👩💻 OlympicCoder Collection Reasoning datasets and models for competitive coding • 6 items • Updated Dec 7, 2025 • 20
view article Article Continuous batching from first principles +1 ror, ArthurZ, mcpotato • Nov 25, 2025 • 395
view post Post 2236 Mistral's new SOTA coding models Devstral 2 can now be Run locally! (25GB RAM) 🐱We fixed the chat template, so performance should be much better now!24B: unsloth/Devstral-Small-2-24B-Instruct-2512-GGUF123B: unsloth/Devstral-2-123B-Instruct-2512-GGUF🧡Step-by-step Guide: https://docs.unsloth.ai/models/devstral-2 See translation 🔥 8 8 🚀 5 5 ❤️ 3 3 🤗 2 2 + Reply
T-pro 2.0: An Efficient Russian Hybrid-Reasoning Model and Playground Paper • 2512.10430 • Published Dec 11, 2025 • 119
view article Article Transformers v5: Simple model definitions powering the AI ecosystem +2 lysandre, ArthurZ, cyrilvallez, reach-vb • Dec 1, 2025 • 311