Model Merging Collection Model Merging is a very popular technique nowadays in LLM. Here is a chronological list of papers on the space that will help you get started with it! • 30 items • Updated Jun 12, 2024 • 235
Crowdsource, Crawl, or Generate? Creating SEA-VL, a Multicultural Vision-Language Dataset for Southeast Asia Paper • 2503.07920 • Published 6 days ago • 91
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM 5 days ago • 270
view article Article HuggingFace, IISc partner to supercharge model building on India's diverse languages 18 days ago • 14
view article Article A Deepdive into Aya Vision: Advancing the Frontier of Multilingual Multimodality 13 days ago • 66
Token-Efficient Long Video Understanding for Multimodal LLMs Paper • 2503.04130 • Published 10 days ago • 79
EuroBERT: Scaling Multilingual Encoders for European Languages Paper • 2503.05500 • Published 9 days ago • 72
view article Article DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge By NormalUhr • Feb 7 • 74
view article Article Illustrating Reinforcement Learning from Human Feedback (RLHF) Dec 9, 2022 • 199
Running 2.26k 2.26k The Ultra-Scale Playbook 🌌 The ultimate guide to training LLM on large GPU Clusters