view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM 30 days ago β’ 382
GemmaX2 Collection GemmaX2 language models, including pretrained and instruction-tuned models of 2 sizes, including 2B, 9B. β’ 7 items β’ Updated Feb 7 β’ 21
Running 2.44k 2.44k The Ultra-Scale Playbook π The ultimate guide to training LLM on large GPU Clusters
OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models Paper β’ 2502.01061 β’ Published Feb 3 β’ 211
Qwen2.5-VL Collection Vision-language model series based on Qwen2.5 β’ 11 items β’ Updated 10 days ago β’ 439
REINFORCE++: A Simple and Efficient Approach for Aligning Large Language Models Paper β’ 2501.03262 β’ Published Jan 4 β’ 99
VITA-1.5: Towards GPT-4o Level Real-Time Vision and Speech Interaction Paper β’ 2501.01957 β’ Published Jan 3 β’ 46
Running 535 535 Open Source Ai Year In Review 2024 π» What happened in open-source AI this year, and whatβs next?