Exploring the Limit of Outcome Reward for Learning Mathematical Reasoning Paper • 2502.06781 • Published Feb 10 • 60
Running 2.24k 2.24k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
view article Article Accelerate Large Model Training using PyTorch Fully Sharded Data Parallel May 2, 2022 • 3
Mediator: Memory-efficient LLM Merging with Less Parameter Conflicts and Uncertainty Based Routing Paper • 2502.04411 • Published Feb 6 • 4
neuralmagic/Meta-Llama-3.1-70B-Instruct-quantized.w8a8 Text Generation • Updated about 1 month ago • 4.03k • 19
view article Article Token Merging for fast LLM inference : Background and first trials with Mistral By samchain • Apr 30, 2024 • 4