Update README.md
Browse files
README.md
CHANGED
@@ -28,6 +28,9 @@ language:
|
|
28 |
<a><img src="octopus-v4-logo.png" alt="nexa-octopus" style="width: 40%; min-width: 300px; display: block; margin: auto;"></a>
|
29 |
</p>
|
30 |
|
|
|
|
|
|
|
31 |
## Introduction
|
32 |
|
33 |
Octopus-V4-3B, an advanced open-source language model with 3 billion parameters, serves as the master node in Nexa AI's envisioned graph of language models. Tailored specifically for the MMLU benchmark topics, this model efficiently translates user queries into formats that specialized models can effectively process. It excels at directing these queries to the appropriate specialized model, ensuring precise and effective query handling.
|
|
|
28 |
<a><img src="octopus-v4-logo.png" alt="nexa-octopus" style="width: 40%; min-width: 300px; display: block; margin: auto;"></a>
|
29 |
</p>
|
30 |
|
31 |
+
## Quantized Octopus V4
|
32 |
+
To run the model on-device, we have prepared [quantized models](https://huggingface.co/NexaAIDev/octopus-v4-gguf) in gguf format for you.
|
33 |
+
|
34 |
## Introduction
|
35 |
|
36 |
Octopus-V4-3B, an advanced open-source language model with 3 billion parameters, serves as the master node in Nexa AI's envisioned graph of language models. Tailored specifically for the MMLU benchmark topics, this model efficiently translates user queries into formats that specialized models can effectively process. It excels at directing these queries to the appropriate specialized model, ensuring precise and effective query handling.
|