arxiv:2605.22012
Yuran Wang
Ryann829
AI & ML interests
Multimodal Large Language Model
Recent Activity
authored a paper about 1 hour ago
LatentOmni: Rethinking Omni-Modal Understanding via Unified Audio-Visual Latent Reasoning upvoted a paper about 12 hours ago
LatentOmni: Rethinking Omni-Modal Understanding via Unified Audio-Visual Latent Reasoning authored a paper 2 days ago
Artifact-Bench: Evaluating MLLMs on Detecting and Assessing the Artifacts of AI-Generated Videos