Submitted by liujch1998 39 OLMoTrace: Tracing Language Model Outputs Back to Trillions of Training Tokens · 31 authors 1
Submitted by imryanxu 23 A Unified Agentic Framework for Evaluating Conditional Image Generation · 10 authors 1
Submitted by zhoutianyi 21 Missing Premise exacerbates Overthinking: Are Reasoning Models losing Critical Thinking Skill? · 4 authors 2
Submitted by Dubhe-zmc 18 GenDoP: Auto-regressive Camera Trajectory Generation as a Director of Photography · 6 authors 1
Submitted by wangqiang9 15 FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis · 8 authors 2
Submitted by AmeyaPrabhu 10 A Sober Look at Progress in Language Model Reasoning: Pitfalls and Paths to Reproducibility · 6 authors 2
Submitted by yunlong10 9 Caption Anything in Video: Fine-grained Object-centric Captioning via Spatiotemporal Multimodal Prompting · 19 authors
Submitted by Boese0601 7 DiTaiListener: Controllable High Fidelity Listener Video Generation with Diffusion · 6 authors 1
Submitted by akhaliq 6 VideoChat-R1: Enhancing Spatio-Temporal Perception via Reinforcement Fine-Tuning · 10 authors 1
Submitted by phermosilla 5 Masked Scene Modeling: Narrowing the Gap Between Supervised and Self-Supervised Learning in 3D Scene Understanding · 3 authors 1
Submitted by borgr 4 Pretraining Language Models for Diachronic Linguistic Change Discovery · 5 authors 1
Submitted by nicolay-r 3 RuOpinionNE-2024: Extraction of Opinion Tuples from Russian News Texts · 5 authors 1
Submitted by akhaliq 3 WildGS-SLAM: Monocular Gaussian Splatting SLAM in Dynamic Environments · 6 authors 2
Submitted by benlipkin - Fast Controlled Generation from Language Models with Adaptive Weighted Rejection Sampling · 12 authors 1
Submitted by ethHuiZhang - RobustDexGrasp: Robust Dexterous Grasping of General Objects from Single-view Perception · 5 authors 1