2 14 2

Kaiyue Sun

Kaiyue

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

Personalized Text-to-Image Generation with Auto-Regressive Models

upvoted a paper 8 days ago

GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation

liked a Space 11 days ago

reasoning-datasets-competition/README

View all activity

Organizations

Kaiyue's activity

upvoted a paper 5 days ago

Personalized Text-to-Image Generation with Auto-Regressive Models

Paper • 2504.13162 • Published 5 days ago • 11

upvoted a paper 8 days ago

GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation

Paper • 2504.08736 • Published 11 days ago • 47

upvoted 3 papers 11 days ago

GenMAC: Compositional Text-to-Video Generation with Multi-Agent Collaboration

Paper • 2412.04440 • Published Dec 5, 2024 • 21

HoloPart: Generative 3D Part Amodal Segmentation

Paper • 2504.07943 • Published 12 days ago • 28

Exploring the Effect of Reinforcement Learning on Video Understanding: Insights from SEED-Bench-R1

Paper • 2503.24376 • Published 22 days ago • 38

upvoted a paper 28 days ago

GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing

Paper • 2503.10639 • Published Mar 13 • 49

upvoted a paper 29 days ago

Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation

Paper • 2503.16430 • Published Mar 20 • 35

upvoted a paper about 1 month ago

PUMA: Empowering Unified MLLM with Multi-granular Visual Generation

Paper • 2410.13861 • Published Oct 17, 2024 • 57

upvoted a paper 3 months ago

GameFactory: Creating New Games with Generative Interactive Videos

Paper • 2501.08325 • Published Jan 14 • 66

upvoted a paper 4 months ago

GS-DiT: Advancing Video Generation with Pseudo 4D Gaussian Fields through Efficient Dense 3D Point Tracking

Paper • 2501.02690 • Published Jan 5 • 17

upvoted a paper 5 months ago

T2V-CompBench: A Comprehensive Benchmark for Compositional Text-to-video Generation

Paper • 2407.14505 • Published Jul 19, 2024 • 27

upvoted 2 papers 7 months ago

Loong: Generating Minute-level Long Videos with Autoregressive Language Models

Paper • 2410.02757 • Published Oct 3, 2024 • 37

Accelerating Auto-regressive Text-to-Image Generation with Training-free Speculative Jacobi Decoding

Paper • 2410.01699 • Published Oct 2, 2024 • 18

upvoted a paper about 1 year ago

Motion-I2V: Consistent and Controllable Image-to-Video Generation with Explicit Motion Modeling

Paper • 2401.15977 • Published Jan 29, 2024 • 40