A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce Paper • 2504.11343 • Published 7 days ago • 13
InternVL3: Exploring Advanced Training and Test-Time Recipes for Open-Source Multimodal Models Paper • 2504.10479 • Published 7 days ago • 235
Pangu Ultra: Pushing the Limits of Dense Large Language Models on Ascend NPUs Paper • 2504.07866 • Published 12 days ago • 8
VisualPRM: An Effective Process Reward Model for Multimodal Reasoning Paper • 2503.10291 • Published Mar 13 • 34
An Empirical Study of GPT-4o Image Generation Capabilities Paper • 2504.05979 • Published 14 days ago • 59
Skywork R1V: Pioneering Multimodal Reasoning with Chain-of-Thought Paper • 2504.05599 • Published 14 days ago • 80
OmniSVG: A Unified Scalable Vector Graphics Generation Model Paper • 2504.06263 • Published 13 days ago • 147
Inference-Time Scaling for Generalist Reward Modeling Paper • 2504.02495 • Published 19 days ago • 52
view article Article Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM Mar 12 • 394
When Less is Enough: Adaptive Token Reduction for Efficient Image Representation Paper • 2503.16660 • Published Mar 20 • 73