Gemma 3 Collection All versions of Google's new multimodal models in 1B, 4B, 12B, and 27B sizes. In GGUF, dynamic 4-bit and 16-bit formats. • 29 items • Updated about 7 hours ago • 32
Running on CPU Upgrade 671 671 Open ASR Leaderboard 🏆 Request evaluation of a speech recognition model
SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features Paper • 2502.14786 • Published 22 days ago • 129
Running 2.25k 2.25k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters
Hibiki fr-en Collection Hibiki is a model for streaming speech translation , which can run on device! See https://github.com/kyutai-labs/hibiki. • 5 items • Updated Feb 6 • 50