R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning Paper • 2503.05592 • Published 6 days ago • 24
Q-Filters: Leveraging QK Geometry for Efficient KV Cache Compression Paper • 2503.02812 • Published 9 days ago • 9
Q-Filters Collection Pre-computed Q-Filters for efficient KV cache compression. • 15 items • Updated 10 days ago • 6