Update README.md
Browse files
README.md
CHANGED
@@ -48,10 +48,10 @@ it contains the following quantizations of the original weights from Together's
|
|
48 |
> the model was fine-tuned for that size), it is recommended that you keep your context as small as possible
|
49 |
|
50 |
> If you need quantizations for Together Computer's
|
51 |
-
> [Llama-2-7B-32K-Instruct](https://huggingface.co/togethercomputer/Llama-2-7B-32K-Instruct
|
52 |
> model, then look for
|
53 |
-
> [LLaMA-2-7B-32K-Instruct_GGUF](https://huggingface.co/rozek/LLaMA-2-7B-32K-Instruct_GGUF
|
54 |
-
> which is currently being uploaded
|
55 |
|
56 |
## How Quantization was done ##
|
57 |
|
|
|
48 |
> the model was fine-tuned for that size), it is recommended that you keep your context as small as possible
|
49 |
|
50 |
> If you need quantizations for Together Computer's
|
51 |
+
> [Llama-2-7B-32K-Instruct](https://huggingface.co/togethercomputer/Llama-2-7B-32K-Instruct)
|
52 |
> model, then look for
|
53 |
+
> [LLaMA-2-7B-32K-Instruct_GGUF](https://huggingface.co/rozek/LLaMA-2-7B-32K-Instruct_GGUF)
|
54 |
+
> which is currently being uploaded and should become available by the end of this day
|
55 |
|
56 |
## How Quantization was done ##
|
57 |
|