Personalized Text-to-Image Generation with Auto-Regressive Models Paper โข 2504.13162 โข Published 5 days ago โข 11 โข 1
Personalized Text-to-Image Generation with Auto-Regressive Models Paper โข 2504.13162 โข Published 5 days ago โข 11
GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation Paper โข 2504.08736 โข Published 11 days ago โข 47
GenMAC: Compositional Text-to-Video Generation with Multi-Agent Collaboration Paper โข 2412.04440 โข Published Dec 5, 2024 โข 21
HoloPart: Generative 3D Part Amodal Segmentation Paper โข 2504.07943 โข Published 12 days ago โข 28
Exploring the Effect of Reinforcement Learning on Video Understanding: Insights from SEED-Bench-R1 Paper โข 2503.24376 โข Published 22 days ago โข 38
GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing Paper โข 2503.10639 โข Published Mar 13 โข 49
Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation Paper โข 2503.16430 โข Published Mar 20 โข 35
PUMA: Empowering Unified MLLM with Multi-granular Visual Generation Paper โข 2410.13861 โข Published Oct 17, 2024 โข 57
GameFactory: Creating New Games with Generative Interactive Videos Paper โข 2501.08325 โข Published Jan 14 โข 66