DelTA: Discriminative Token Credit Assignment for Reinforcement Learning from Verifiable Rewards Paper • 2605.21467 • Published 7 days ago • 201
Mean Mode Screaming: Mean--Variance Split Residuals for 1000-Layer Diffusion Transformers Paper • 2605.06169 • Published 20 days ago • 229
CiteVQA: Benchmarking Evidence Attribution for Trustworthy Document Intelligence Paper • 2605.12882 • Published 14 days ago • 268
Learning, Fast and Slow: Towards LLMs That Adapt Continually Paper • 2605.12484 • Published 15 days ago • 17
SkillOS: Learning Skill Curation for Self-Evolving Agents Paper • 2605.06614 • Published 20 days ago • 45
SVGS: Enhancing Gaussian Splatting Using Primitives with Spatially Varying Colors Paper • 2411.18966 • Published 23 days ago • 9
Cross-Tokenizer LLM Distillation through a Byte-Level Interface Paper • 2604.07466 • Published Apr 13 • 6
LLaDA2.0-Uni: Unifying Multimodal Understanding and Generation with Diffusion Large Language Model Paper • 2604.20796 • Published Apr 22 • 242
RationalRewards: Reasoning Rewards Scale Visual Generation Both Training and Test Time Paper • 2604.11626 • Published Apr 13 • 102
OmniJigsaw: Enhancing Omni-Modal Reasoning via Modality-Orchestrated Reordering Paper • 2604.08209 • Published Apr 9 • 26