Running 1.89k 1.89k The Ultra-Scale Playbook ๐ The ultimate guide to training LLM on large GPU Clusters
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B Text Generation โข Updated 7 days ago โข 1.38M โข โข 1.21k