ybelkada
/

Mixtral-8x7B-Instruct-v0.1-bnb-4bit

Text Generation

Mixture of Experts

text-generation-inference

4-bit precision

Model card Files Files and versions Community

ybelkada commited on Dec 25, 2023

Commit

0c02dd3

·

1 Parent(s): 5d08940

Create README.md

Files changed (1) hide show

README.md +28 -0

README.md ADDED Viewed

	@@ -0,0 +1,28 @@

+---
+inference: false
+language:
+- en
+library_name: transformers
+license: apache-2.0
+model_name: Mixtral 8X7B - bnb 4-bit
+model_type: mixtral
+pipeline_tag: text-generation
+  '
+quantized_by: ybelkada
+tags:
+- mistral
+- mixtral
+---
+# Mixtral 8x7B Instruct-v0.1 - `bitsandbytes` 4-bit
+This repository contains the bitsandbytes 4-bit quantized version of [`mistralai/Mixtral-8x7B-Instruct-v0.1`](https://huggingface.co/mistralai/Mixtral-8x7B-Instruct-v0.1). To use it, make sure to have the latest version of `bitsandbytes` and `transformers` installed from source:
+Loading this model as such: will directly load the quantized model in 4-bit precision.
+```python
+from transformers import AutoModelForCausalLM, AutoTokenizer
+model_id = "ybelkada/Mixtral-8x7B-Instruct-v0.1-bnb-4bit"
+model = AutoModelForCausalLM.from_pretrained(model_id)
+```
+Note you need a CUDA-compatible GPU device to run low-bit precision models with `bitsandbytes`