GEN3C: 3D-Informed World-Consistent Video Generation with Precise Camera Control Paper • 2503.03751 • Published about 11 hours ago • 3 • 2
Sim-to-Real Reinforcement Learning for Vision-Based Dexterous Manipulation on Humanoids Paper • 2502.20396 • Published 6 days ago • 11 • 2
HAIC: Improving Human Action Understanding and Generation with Better Captions for Multi-modal Large Language Models Paper • 2502.20811 • Published 6 days ago • 1 • 2
SoS1: O1 and R1-Like Reasoning LLMs are Sum-of-Square Solvers Paper • 2502.20545 • Published 6 days ago • 18 • 2
Mobius: Text to Seamless Looping Video Generation via Latent Shift Paper • 2502.20307 • Published 7 days ago • 16 • 2
FlexiDiT: Your Diffusion Transformer Can Easily Generate High-Quality Samples with Less Compute Paper • 2502.20126 • Published 7 days ago • 19 • 2
R1-T1: Fully Incentivizing Translation Capability in LLMs via Reasoning Learning Paper • 2502.19735 • Published 7 days ago • 7 • 2
TheoremExplainAgent: Towards Multimodal Explanations for LLM Theorem Understanding Paper • 2502.19400 • Published 7 days ago • 41 • 2
An Overview of Large Language Models for Statisticians Paper • 2502.17814 • Published 9 days ago • 4 • 2
WebGames: Challenging General-Purpose Web-Browsing AI Agents Paper • 2502.18356 • Published 9 days ago • 10 • 2
SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution Paper • 2502.18449 • Published 8 days ago • 62 • 5
X-Dancer: Expressive Music to Human Dance Video Generation Paper • 2502.17414 • Published 9 days ago • 11 • 3
VideoGrain: Modulating Space-Time Attention for Multi-grained Video Editing Paper • 2502.17258 • Published 10 days ago • 71 • 4
RIFLEx: A Free Lunch for Length Extrapolation in Video Diffusion Transformers Paper • 2502.15894 • Published 12 days ago • 19 • 3
One-step Diffusion Models with $f$-Divergence Distribution Matching Paper • 2502.15681 • Published 12 days ago • 6 • 2
SIFT: Grounding LLM Reasoning in Contexts via Stickers Paper • 2502.14922 • Published 15 days ago • 29 • 3
Think Inside the JSON: Reinforcement Strategy for Strict LLM Schema Adherence Paper • 2502.14905 • Published 16 days ago • 9 • 2
InterFeedback: Unveiling Interactive Intelligence of Large Multimodal Models via Human Feedback Paper • 2502.15027 • Published 13 days ago • 7 • 2