inference Star Attention: Efficient LLM Inference over Long Sequences Paper • 2411.17116 • Published Nov 26, 2024 • 53
Star Attention: Efficient LLM Inference over Long Sequences Paper • 2411.17116 • Published Nov 26, 2024 • 53
llm_model PygmalionAI/mythalion-13b Text Generation • 13B • Updated Sep 15, 2023 • 1.15k • • 162 Nitral-AI/Poppy_Porpoise-0.72-L3-8B Text Generation • 8B • Updated Jul 4, 2024 • 17 • • 36 Lewdiculous/KukulStanta-7B-GGUF-IQ-Imatrix 7B • Updated Apr 7, 2024 • 89 • 9 Lewdiculous/L3-8B-Stheno-v3.2-GGUF-IQ-Imatrix 8B • Updated Feb 2, 2025 • 16.5k • 237
inference Star Attention: Efficient LLM Inference over Long Sequences Paper • 2411.17116 • Published Nov 26, 2024 • 53
Star Attention: Efficient LLM Inference over Long Sequences Paper • 2411.17116 • Published Nov 26, 2024 • 53
llm_model PygmalionAI/mythalion-13b Text Generation • 13B • Updated Sep 15, 2023 • 1.15k • • 162 Nitral-AI/Poppy_Porpoise-0.72-L3-8B Text Generation • 8B • Updated Jul 4, 2024 • 17 • • 36 Lewdiculous/KukulStanta-7B-GGUF-IQ-Imatrix 7B • Updated Apr 7, 2024 • 89 • 9 Lewdiculous/L3-8B-Stheno-v3.2-GGUF-IQ-Imatrix 8B • Updated Feb 2, 2025 • 16.5k • 237