view article Article LLM Inference on Edge: A Fun and Easy Guide to run LLMs via React Native on your Phone! 29 days ago • 49
Running 2.4k 2.4k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
Running 2.4k 2.4k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
Domino: Eliminating Communication in LLM Training via Generic Tensor Slicing and Overlapping Paper • 2409.15241 • Published Sep 23, 2024 • 1
Phi-3 Technical Report: A Highly Capable Language Model Locally on Your Phone Paper • 2404.14219 • Published Apr 22, 2024 • 258