Personalized Text-to-Image Generation with Auto-Regressive Models Paper • 2504.13162 • Published 3 days ago • 11
GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation Paper • 2504.08736 • Published 9 days ago • 46
Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation Paper • 2503.24379 • Published 20 days ago • 74
Exploring the Effect of Reinforcement Learning on Video Understanding: Insights from SEED-Bench-R1 Paper • 2503.24376 • Published 20 days ago • 37
Position: Interactive Generative Video as Next-Generation Game Engine Paper • 2503.17359 • Published about 1 month ago • 62
Position: Interactive Generative Video as Next-Generation Game Engine Paper • 2503.17359 • Published about 1 month ago • 62
Position: Interactive Generative Video as Next-Generation Game Engine Paper • 2503.17359 • Published about 1 month ago • 62 • 3
Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation Paper • 2503.16430 • Published Mar 20 • 35
RoboFactory: Exploring Embodied Agent Collaboration with Compositional Constraints Paper • 2503.16408 • Published Mar 20 • 40
PUMA: Empowering Unified MLLM with Multi-granular Visual Generation Paper • 2410.13861 • Published Oct 17, 2024 • 57
GameFactory: Creating New Games with Generative Interactive Videos Paper • 2501.08325 • Published Jan 14 • 66 • 3
FreeDoM: Training-Free Energy-Guided Conditional Diffusion Model Paper • 2303.09833 • Published Mar 17, 2023
Zero-Shot Image Restoration Using Denoising Diffusion Null-Space Model Paper • 2212.00490 • Published Dec 1, 2022
SkillMimic: Learning Reusable Basketball Skills from Demonstrations Paper • 2408.15270 • Published Aug 12, 2024 • 1
WorldSimBench: Towards Video Generation Models as World Simulators Paper • 2410.18072 • Published Oct 23, 2024 • 20