Submitted by Hush-cd 62 xVerify: Efficient Answer Verifier for Reasoning Model Evaluations · 9 authors 1
Submitted by xufangzhi 41 Genius: A Generalizable and Purely Unsupervised Self-Training Framework For Advanced Reasoning · 9 authors 1
Submitted by zhoutianyi 29 How Instruction and Reasoning Data shape Post-Training: Data Quality through the Lens of Layer-wise Gradients · 4 authors 1
Submitted by LXT 12 The Scalability of Simplicity: Empirical Analysis of Vision-Language Learning with a Single Transformer · 7 authors 2
Submitted by wbhu-tc 9 NormalCrafter: Learning Temporally Consistent Normals from Video Diffusion Priors · 5 authors 1
Submitted by SempraETY 8 Efficient Generative Model Training via Embedded Representation Warmup · 4 authors 1
Submitted by IanMagnusson 7 DataDecide: How to Predict Best Pretraining Data with Small Experiments · 13 authors 1
Submitted by yueqis 7 VisualPuzzles: Decoupling Multimodal Reasoning Evaluation from Domain Knowledge · 6 authors 1
Submitted by pierlj 7 RealHarm: A Collection of Real-World Language Model Application Failures · 4 authors 2
Submitted by davanstrien 6 DeepMath-103K: A Large-Scale, Challenging, Decontaminated, and Verifiable Mathematical Dataset for Advancing Reasoning · 15 authors 1
Submitted by weqweasdas 6 A Minimalist Approach to LLM Reasoning: from Rejection Sampling to Reinforce · 11 authors 2
Submitted by Daniel0724 4 SimpleAR: Pushing the Frontier of Autoregressive Visual Generation through Pretraining, SFT, and RL · 7 authors
Submitted by SYZhang0805 4 Diffusion Distillation With Direct Preference Optimization For Efficient 3D LiDAR Scene Completion · 8 authors 1
Submitted by jrd971000 4 Efficient Hybrid Language Model Compression through Group-Aware SSM Pruning · 18 authors 1
Submitted by HenghuiDing 4 PVUW 2025 Challenge Report: Advances in Pixel-level Understanding of Complex Videos in the Wild · 36 authors 1
Submitted by simocimolato 4 AI-University: An LLM-based platform for instructional alignment to scientific classrooms · 8 authors 1
Submitted by CoreloneH 3 D^2iT: Dynamic Diffusion Transformer for Accurate Image Generation · 5 authors 1
Submitted by sukannya 2 LazyReview A Dataset for Uncovering Lazy Thinking in NLP Peer Reviews · 5 authors 1
Submitted by Hoar012 2 Multimodal Long Video Modeling Based on Temporal Dynamic Context · 4 authors 1
Submitted by gigant 2 Summarization of Multimodal Presentations with Vision-Language Models: Study of the Effect of Modalities and Structure · 3 authors 1
Submitted by ziqipang - Aligning Generative Denoising with Discriminative Objectives Unleashes Diffusion for Visual Perception · 3 authors 1
Submitted by ElmanGhazaei - Change State Space Models for Remote Sensing Change Detection · 2 authors 1