SynerGen-VL: Towards Synergistic Image Understanding and Generation with Vision Experts and Token Folding Paper • 2412.09604 • Published 13 days ago • 35
NDC-Scene: Boost Monocular 3D Semantic Scene Completion in Normalized Device Coordinates Space Paper • 2309.14616 • Published Sep 26, 2023
ADDP: Learning General Representations for Image Recognition and Generation with Alternating Denoising Diffusion Process Paper • 2306.05423 • Published Jun 8, 2023
JourneyDB: A Benchmark for Generative Image Understanding Paper • 2307.00716 • Published Jul 3, 2023 • 19
The All-Seeing Project: Towards Panoptic Visual Recognition and Understanding of the Open World Paper • 2308.01907 • Published Aug 3, 2023 • 11
Uni-Perceiver: Pre-training Unified Architecture for Generic Perception for Zero-shot and Few-shot Tasks Paper • 2112.01522 • Published Dec 2, 2021
Uni-Perceiver v2: A Generalist Model for Large-Scale Vision and Vision-Language Tasks Paper • 2211.09808 • Published Nov 17, 2022
PUMA: Empowering Unified MLLM with Multi-granular Visual Generation Paper • 2410.13861 • Published Oct 17 • 52
SynerGen-VL: Towards Synergistic Image Understanding and Generation with Vision Experts and Token Folding Paper • 2412.09604 • Published 13 days ago • 35
PUMA: Empowering Unified MLLM with Multi-granular Visual Generation Paper • 2410.13861 • Published Oct 17 • 52