Running 2.23k 2.23k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
Running on Zero 427 427 Chat with DeepSeek-VL2-small 🌍 Generate responses using images and text input
Running 867 867 FineWeb: decanting the web for the finest text data at scale 🍷 Generate high-quality web text data for LLM training
Foundation AI Papers Collection Curated List of Must-Reads on LLM reasoning at Temus AI team • 135 items • Updated Jun 15, 2024 • 32