Agent Laboratory: Using LLM Agents as Research Assistants Paper • 2501.04227 • Published 5 days ago • 66
rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking Paper • 2501.04519 • Published 4 days ago • 190
Training Software Engineering Agents and Verifiers with SWE-Gym Paper • 2412.21139 • Published 13 days ago • 20
Divot: Diffusion Powers Video Tokenizer for Comprehension and Generation Paper • 2412.04432 • Published Dec 5, 2024 • 14
Feather the Throttle: Revisiting Visual Token Pruning for Vision-Language Model Acceleration Paper • 2412.13180 • Published 26 days ago • 13
Emergence of Abstractions: Concept Encoding and Decoding Mechanism for In-Context Learning in Transformers Paper • 2412.12276 • Published 27 days ago • 15
AmoebaLLM: Constructing Any-Shape Large Language Models for Efficient and Instant Deployment Paper • 2411.10606 • Published Nov 15, 2024 • 1
MaestroMotif: Skill Design from Artificial Intelligence Feedback Paper • 2412.08542 • Published Dec 11, 2024 • 1
CMT: A Memory Compression Method for Continual Knowledge Learning of Large Language Models Paper • 2412.07393 • Published Dec 10, 2024 • 2
Video Token Merging for Long-form Video Understanding Paper • 2410.23782 • Published Oct 31, 2024 • 2
Aguvis: Unified Pure Vision Agents for Autonomous GUI Interaction Paper • 2412.04454 • Published Dec 5, 2024 • 59
Moto: Latent Motion Token as the Bridging Language for Robot Manipulation Paper • 2412.04445 • Published Dec 5, 2024 • 21
Training Large Language Models to Reason in a Continuous Latent Space Paper • 2412.06769 • Published Dec 9, 2024 • 71
Efficient Long Video Tokenization via Coordinated-based Patch Reconstruction Paper • 2411.14762 • Published Nov 22, 2024 • 11
Trace is the New AutoDiff -- Unlocking Efficient Optimization of Computational Workflows Paper • 2406.16218 • Published Jun 23, 2024 • 2
Combining Induction and Transduction for Abstract Reasoning Paper • 2411.02272 • Published Nov 4, 2024 • 1