Gemma 3 Collection All versions of Google's new multimodal models in 1B, 4B, 12B, and 27B sizes. In GGUF, dynamic 4-bit and 16-bit formats. β’ 29 items β’ Updated about 15 hours ago β’ 48
Running 2.41k 2.41k The Ultra-Scale Playbook π The ultimate guide to training LLM on large GPU Clusters
view article Article Fine-tuning LLMs to 1.58bit: extreme quantization made easy Sep 18, 2024 β’ 228
Running 116 116 Open-LLM performances are plateauing, letβs make the leaderboard steep again π Update leaderboard for fair model evaluation
view article Article A failed experiment: Infini-Attention, and why we should keep trying? Aug 14, 2024 β’ 61
Running on Zero 29 29 Gpt2 Multiplication Predictor π Multiply large numbers using different reasoning methods