VideoRoPE: What Makes for Good Video Rotary Position Embeddi Collection A storage repo for VideoRoPE. • 1 item • Updated Mar 7 • 2
Farewell to Length Extrapolation, a Training-Free Infinite Context with Finite Attention Scope Paper • 2407.15176 • Published Jul 21, 2024 • 3
A Comprehensive Survey on Long Context Language Modeling Paper • 2503.17407 • Published 25 days ago • 49
DuoDecoding: Hardware-aware Heterogeneous Speculative Decoding with Dynamic Multi-Sequence Drafting Paper • 2503.00784 • Published Mar 2 • 11
RWKV-7 "Goose" with Expressive Dynamic State Evolution Paper • 2503.14456 • Published 27 days ago • 137
Towards Economical Inference: Enabling DeepSeek's Multi-Head Latent Attention in Any Transformer-based LLMs Paper • 2502.14837 • Published Feb 20 • 4
World Modeling Makes a Better Planner: Dual Preference Optimization for Embodied Task Planning Paper • 2503.10480 • Published Mar 13 • 49
LongWanjuan: Towards Systematic Measurement for Long Text Quality Paper • 2402.13583 • Published Feb 21, 2024