MiniMax-01: Scaling Foundation Models with Lightning Attention Paper • 2501.08313 • Published 22 days ago • 272
Towards Best Practices for Open Datasets for LLM Training Paper • 2501.08365 • Published 22 days ago • 53
Kimi k1.5: Scaling Reinforcement Learning with LLMs Paper • 2501.12599 • Published 14 days ago • 86
DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning Paper • 2501.12948 • Published 14 days ago • 295
Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch Paper • 2501.18512 • Published 6 days ago • 25
Hymba: A Hybrid-head Architecture for Small Language Models Paper • 2411.13676 • Published Nov 20, 2024 • 41
SOLAMI: Social Vision-Language-Action Modeling for Immersive Interaction with 3D Autonomous Characters Paper • 2412.00174 • Published Nov 29, 2024 • 23
Puzzle: Distillation-Based NAS for Inference-Optimized LLMs Paper • 2411.19146 • Published Nov 28, 2024 • 17
GRAPE: Generalizing Robot Policy via Preference Alignment Paper • 2411.19309 • Published Nov 28, 2024 • 44
ResearchTown: Simulator of Human Research Community Paper • 2412.17767 • Published Dec 23, 2024 • 14
Large Action Models: From Inception to Implementation Paper • 2412.10047 • Published Dec 13, 2024 • 33