Running 2.43k 2.43k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published Jan 22 • 375
view article Article Introducing multi-backends (TRT-LLM, vLLM) support for Text Generation Inference Jan 16 • 71
Running on CPU Upgrade 12.9k 12.9k Open LLM Leaderboard 🏆 Track, rank and evaluate open LLMs and chatbots