MobileLLM Collection Optimizing Sub-billion Parameter Language Models for On-Device Use Cases (ICML 2024) https://arxiv.org/abs/2402.14905 • 9 items • Updated Nov 27, 2024 • 113
SmolVLM: Redefining small and efficient multimodal models Paper • 2504.05299 • Published 10 days ago • 160
OmniSVG: A Unified Scalable Vector Graphics Generation Model Paper • 2504.06263 • Published 9 days ago • 141
Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems Paper • 2504.01990 • Published 17 days ago • 241
Wan: Open and Advanced Large-Scale Video Generative Models Paper • 2503.20314 • Published 22 days ago • 48
A Comprehensive Survey on Long Context Language Modeling Paper • 2503.17407 • Published 28 days ago • 49
RWKV-7 "Goose" with Expressive Dynamic State Evolution Paper • 2503.14456 • Published 30 days ago • 137
YuE: Scaling Open Foundation Models for Long-Form Music Generation Paper • 2503.08638 • Published Mar 11 • 62
Sim-to-Real Reinforcement Learning for Vision-Based Dexterous Manipulation on Humanoids Paper • 2502.20396 • Published Feb 27 • 15
RAD: Training an End-to-End Driving Policy via Large-Scale 3DGS-based Reinforcement Learning Paper • 2502.13144 • Published Feb 18 • 38
Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention Paper • 2502.11089 • Published Feb 16 • 153
YuLan-Mini: An Open Data-efficient Language Model Paper • 2412.17743 • Published Dec 23, 2024 • 67