MaziyarPanahi
/

Mixtral-8x22B-v0.1-GGUF

Text Generation

4-bit precision

8-bit precision

Mixture of Experts

Model card Files Files and versions Community

MaziyarPanahi commited on Apr 11

Commit

e57b55a

•

1 Parent(s): c21d268

Correct number of parameters

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -32,7 +32,7 @@ language:
 On April 10th, [@MistralAI](https://huggingface.co/mistralai) released a model named "Mixtral 8x22B," an 176B MoE via magnet link (torrent):
-- 176B MoE with ~40B active
 - Context length of 65k tokens
 - The base model can be fine-tuned
 - Requires ~260GB VRAM in fp16, 73GB in int4

 On April 10th, [@MistralAI](https://huggingface.co/mistralai) released a model named "Mixtral 8x22B," an 176B MoE via magnet link (torrent):
+- 141B MoE with ~35B active
 - Context length of 65k tokens
 - The base model can be fine-tuned
 - Requires ~260GB VRAM in fp16, 73GB in int4