Running 2.42k 2.42k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
michaelbenayoun/llama-2-tiny-4kv-heads-4layers-random Text Generation • Updated Oct 14, 2024 • 6.91k
Running on Zero 643 643 Whisper Large V3 🤫 Transcribe audio from microphone, files, or YouTube videos
Distributed Training Collection Papers and resources related to distributed training. • 5 items • Updated Jun 3, 2024