Open Deep Search: Democratizing Search with Open-source Reasoning Agents Paper • 2503.20201 • Published 18 days ago • 43
DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning Paper • 2504.07128 • Published 11 days ago • 54
Quantization Hurts Reasoning? An Empirical Study on Quantized Reasoning Models Paper • 2504.04823 • Published 6 days ago • 27
SkyReels-A2: Compose Anything in Video Diffusion Transformers Paper • 2504.02436 • Published 10 days ago • 35
Skywork R1V: Pioneering Multimodal Reasoning with Chain-of-Thought Paper • 2504.05599 • Published 5 days ago • 73
Hogwild! Inference: Parallel LLM Generation via Concurrent Attention Paper • 2504.06261 • Published 4 days ago • 86
A Comprehensive Survey on Long Context Language Modeling Paper • 2503.17407 • Published 23 days ago • 49
Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't Paper • 2503.16219 • Published 24 days ago • 46
Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning Paper • 2503.15558 • Published 25 days ago • 45
Plug-and-Play 1.x-Bit KV Cache Quantization for Video Large Language Models Paper • 2503.16257 • Published 24 days ago • 23
Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models Paper • 2503.16419 • Published 23 days ago • 67
XAttention: Block Sparse Attention with Antidiagonal Scoring Paper • 2503.16428 • Published 23 days ago • 13
Expert Race: A Flexible Routing Strategy for Scaling Diffusion Transformer with Mixture of Experts Paper • 2503.16057 • Published 24 days ago • 14
DAPO: An Open-Source LLM Reinforcement Learning System at Scale Paper • 2503.14476 • Published 25 days ago • 116
RWKV-7 "Goose" with Expressive Dynamic State Evolution Paper • 2503.14456 • Published 25 days ago • 137
EAGLE-3: Scaling up Inference Acceleration of Large Language Models via Training-Time Test Paper • 2503.01840 • Published Mar 3 • 5