An Empirical Study of GPT-4o Image Generation Capabilities Paper • 2504.05979 • Published 2 days ago • 56
Skywork R1V: Pioneering Multimodal Reasoning with Chain-of-Thought Paper • 2504.05599 • Published 3 days ago • 68
OmniSVG: A Unified Scalable Vector Graphics Generation Model Paper • 2504.06263 • Published 2 days ago • 112
One-Minute Video Generation with Test-Time Training Paper • 2504.05298 • Published 3 days ago • 80
Inference-Time Scaling for Generalist Reward Modeling Paper • 2504.02495 • Published 7 days ago • 48
Scaling Language-Free Visual Representation Learning Paper • 2504.01017 • Published 9 days ago • 25
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM 30 days ago • 382
When Less is Enough: Adaptive Token Reduction for Efficient Image Representation Paper • 2503.16660 • Published 21 days ago • 71
Running 545 545 Scaling test-time compute 📈 Enhance math problem solving by scaling test-time compute
DAPO: An Open-Source LLM Reinforcement Learning System at Scale Paper • 2503.14476 • Published 23 days ago • 116