CLEX: Continuous Length Extrapolation for Large Language Models Paper • 2310.16450 • Published Oct 25, 2023 • 9
E^2-LLM: Efficient and Extreme Length Extension of Large Language Models Paper • 2401.06951 • Published Jan 13 • 25
Data Engineering for Scaling Language Models to 128K Context Paper • 2402.10171 • Published Feb 15 • 23