26 22 13

qinqi

Dakerqi

AI & ML interests

None yet

Recent Activity

authored a paper 1 day ago

UniPercept: Towards Unified Perceptual-Level Image Understanding across Aesthetics, Quality, Structure, and Texture

authored a paper 1 day ago

Accelerating Masked Image Generation by Learning Latent Controlled Dynamics

authored a paper 1 day ago

InternVL-U: Democratizing Unified Multimodal Models for Understanding, Reasoning, Generation and Editing

View all activity

Organizations

authored 8 papers 1 day ago

UniPercept: Towards Unified Perceptual-Level Image Understanding across Aesthetics, Quality, Structure, and Texture

Paper • 2512.21675 • Published Dec 25, 2025 • 25

Accelerating Masked Image Generation by Learning Latent Controlled Dynamics

Paper • 2602.23996 • Published Feb 27 • 8

InternVL-U: Democratizing Unified Multimodal Models for Understanding, Reasoning, Generation and Editing

Paper • 2603.09877 • Published Mar 10 • 48

Project Imaging-X: A Survey of 1000+ Open-Access Medical Imaging Datasets for Foundation Model Development

Paper • 2603.27460 • Published 27 days ago • 68

Training-Free Acceleration for Document Parsing Vision-Language Model with Hierarchical Speculative Decoding

Paper • 2602.12957 • Published Feb 13

Project Imaging-X: A Survey of 1000+ Open-Access Medical Imaging Datasets for Foundation Model Development

Paper • 2603.27460 • Published 27 days ago • 68

Project Imaging-X: A Survey of 1000+ Open-Access Medical Imaging Datasets for Foundation Model Development

Paper • 2603.27460 • Published 27 days ago • 68

LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model

Paper • 2604.20796 • Published 3 days ago • 219

upvoted a paper 1 day ago

LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model

Paper • 2604.20796 • Published 3 days ago • 219

liked a model 1 day ago

inclusionAI/LLaDA2.0-Uni

Any-to-Any • 16B • Updated about 13 hours ago • 103 • 161

upvoted a paper 24 days ago

On Token's Dilemma: Dynamic MoE with Drift-Aware Token Assignment for Continual Learning of Large Vision Language Models

Paper • 2603.27481 • Published 27 days ago • 35

New activity in Alpha-VLLM/Lumina-Image-2.0 about 2 months ago

I would like to obtain your contact information and customize a model

#19 opened about 2 months ago by

Huahua789

upvoted a paper 2 months ago

Modality Gap-Driven Subspace Alignment Training Paradigm For Multimodal Large Language Models

Paper • 2602.07026 • Published Feb 2 • 140

liked a model 2 months ago

neta-art/Neta-Lumina

Text-to-Image • Updated Aug 5, 2025 • 4.81k • 320

upvoted a paper 4 months ago

Act2Goal: From World Model To General Goal-conditioned Policy

Paper • 2512.23541 • Published Dec 29, 2025 • 23

authored 3 papers 4 months ago

Unimedvl: Unifying Medical Multimodal Understanding And Generation Through Observation-Knowledge-Analysis

Paper • 2510.15710 • Published Oct 17, 2025 • 8

Parameter-Efficient Fine-Tuning for Pre-Trained Vision Models: A Survey and Benchmark

Paper • 2402.02242 • Published Feb 3, 2024

dMLLM-TTS: Self-Verified and Efficient Test-Time Scaling for Diffusion Multi-Modal Large Language Models

Paper • 2512.19433 • Published Dec 22, 2025 • 3

upvoted a paper 4 months ago

dMLLM-TTS: Self-Verified and Efficient Test-Time Scaling for Diffusion Multi-Modal Large Language Models

Paper • 2512.19433 • Published Dec 22, 2025 • 3

updated a collection 4 months ago

Lumina-Image 2.0

Collection

3 items • Updated Dec 19, 2025

qinqi

AI & ML interests

Recent Activity

Organizations

Dakerqi's activity

I would like to obtain your contact information and customize a model