Attention Heads of Large Language Models: A Survey Paper โข 2409.03752 โข Published Sep 5, 2024 โข 90
Running on Zero 18 18 Chat with Gemma-2-9B-Chinese-Chat ๐ฌ Chat with a helpful AI assistant in Chinese
view article Article Introducing the LiveCodeBench Leaderboard - Holistic and Contamination-Free Evaluation of Code LLMs Apr 16, 2024 โข 15
Griffin: Mixing Gated Linear Recurrences with Local Attention for Efficient Language Models Paper โข 2402.19427 โข Published Feb 29, 2024 โข 57