meta-llama/Llama-4-Scout-17B-16E-Instruct Image-Text-to-Text β’ Updated 2 days ago β’ 101k β’ β’ 630
Dita: Scaling Diffusion Transformer for Generalist Vision-Language-Action Policy Paper β’ 2503.19757 β’ Published 14 days ago β’ 48
Embodied-Reasoner: Synergizing Visual Search, Reasoning, and Action for Embodied Interactive Tasks Paper β’ 2503.21696 β’ Published 12 days ago β’ 21
DAPO: An Open-Source LLM Reinforcement Learning System at Scale Paper β’ 2503.14476 β’ Published 21 days ago β’ 115
mistralai/Mistral-Small-3.1-24B-Instruct-2503 Image-Text-to-Text β’ Updated about 10 hours ago β’ 121k β’ 1.08k
VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search Paper β’ 2503.10582 β’ Published 26 days ago β’ 21
KodCode: A Diverse, Challenging, and Verifiable Synthetic Dataset for Coding Paper β’ 2503.02951 β’ Published Mar 4 β’ 29