A Comprehensive Survey on Long Context Language Modeling Paper • 2503.17407 • Published 16 days ago • 48
Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't Paper • 2503.16219 • Published 16 days ago • 46
Cosmos-Reason1: From Physical Common Sense To Embodied Reasoning Paper • 2503.15558 • Published 18 days ago • 44
Plug-and-Play 1.x-Bit KV Cache Quantization for Video Large Language Models Paper • 2503.16257 • Published 16 days ago • 23
Stop Overthinking: A Survey on Efficient Reasoning for Large Language Models Paper • 2503.16419 • Published 16 days ago • 65
XAttention: Block Sparse Attention with Antidiagonal Scoring Paper • 2503.16428 • Published 16 days ago • 12
Expert Race: A Flexible Routing Strategy for Scaling Diffusion Transformer with Mixture of Experts Paper • 2503.16057 • Published 16 days ago • 14
DAPO: An Open-Source LLM Reinforcement Learning System at Scale Paper • 2503.14476 • Published 18 days ago • 112
RWKV-7 "Goose" with Expressive Dynamic State Evolution Paper • 2503.14456 • Published 18 days ago • 134
EAGLE-3: Scaling up Inference Acceleration of Large Language Models via Training-Time Test Paper • 2503.01840 • Published Mar 3 • 4
Seedream 2.0: A Native Chinese-English Bilingual Image Generation Foundation Model Paper • 2503.07703 • Published 26 days ago • 34
Identifying Sensitive Weights via Post-quantization Integral Paper • 2503.01901 • Published Feb 28 • 7
DuoDecoding: Hardware-aware Heterogeneous Speculative Decoding with Dynamic Multi-Sequence Drafting Paper • 2503.00784 • Published Mar 2 • 11
Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs Paper • 2503.01743 • Published Mar 3 • 81
FR-Spec: Accelerating Large-Vocabulary Language Models via Frequency-Ranked Speculative Sampling Paper • 2502.14856 • Published Feb 20 • 7