AdaptiVocab: Enhancing LLM Efficiency in Focused Domains through Lightweight Vocabulary Adaptation Paper • 2503.19693 • Published 14 days ago • 75
LogQuant: Log-Distributed 2-Bit Quantization of KV Cache with Superior Accuracy Preservation Paper • 2503.19950 • Published 14 days ago • 10
RWKV-7 "Goose" with Expressive Dynamic State Evolution Paper • 2503.14456 • Published 21 days ago • 136
DAPO: An Open-Source LLM Reinforcement Learning System at Scale Paper • 2503.14476 • Published 21 days ago • 115
φ-Decoding: Adaptive Foresight Sampling for Balanced Inference-Time Exploration and Exploitation Paper • 2503.13288 • Published 22 days ago • 49
SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion Paper • 2503.11576 • Published 25 days ago • 89
Block Diffusion: Interpolating Between Autoregressive and Diffusion Language Models Paper • 2503.09573 • Published 27 days ago • 68