SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution Paper • 2502.18449 • Published 1 day ago • 37
Slamming: Training a Speech Language Model on One GPU in a Day Paper • 2502.15814 • Published 7 days ago • 50
You Do Not Fully Utilize Transformer's Representation Capacity Paper • 2502.09245 • Published 14 days ago • 33
Cramming 1568 Tokens into a Single Vector and Back Again: Exploring the Limits of Embedding Space Capacity Paper • 2502.13063 • Published 8 days ago • 62
ActionPiece: Contextually Tokenizing Action Sequences for Generative Recommendation Paper • 2502.13581 • Published 8 days ago • 5
Autellix: An Efficient Serving Engine for LLM Agents as General Programs Paper • 2502.13965 • Published 7 days ago • 17
Over-Tokenized Transformer: Vocabulary is Generally Worth Scaling Paper • 2501.16975 • Published 29 days ago • 26
CoS: Chain-of-Shot Prompting for Long Video Understanding Paper • 2502.06428 • Published 17 days ago • 10
CodeI/O: Condensing Reasoning Patterns via Code Input-Output Prediction Paper • 2502.07316 • Published 16 days ago • 45
VideoJAM: Joint Appearance-Motion Representations for Enhanced Motion Generation in Video Models Paper • 2502.02492 • Published 22 days ago • 57
Learn-by-interact: A Data-Centric Framework for Self-Adaptive Agents in Realistic Environments Paper • 2501.10893 • Published Jan 18 • 24
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper • 2501.04519 • Published Jan 8 • 258
Training Software Engineering Agents and Verifiers with SWE-Gym Paper • 2412.21139 • Published Dec 30, 2024 • 22
Divot: Diffusion Powers Video Tokenizer for Comprehension and Generation Paper • 2412.04432 • Published Dec 5, 2024 • 16
Feather the Throttle: Revisiting Visual Token Pruning for Vision-Language Model Acceleration Paper • 2412.13180 • Published Dec 17, 2024 • 13