rozek
/

LLaMA-2-7B-32K_GGUF

Text Generation

text-generation-inference

togethercomputer

Inference Endpoints

Model card Files Files and versions Community

rozek commited on Aug 30, 2023

Commit

dd5e04c

·

1 Parent(s): 809f0ca

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -48,10 +48,10 @@ it contains the following quantizations of the original weights from Together's
 > the model was fine-tuned for that size), it is recommended that you keep your context as small as possible
 > If you need quantizations for Together Computer's
-> [Llama-2-7B-32K-Instruct](https://huggingface.co/togethercomputer/Llama-2-7B-32K-Instruct/tree/main)
 > model, then look for
-> [LLaMA-2-7B-32K-Instruct_GGUF](https://huggingface.co/rozek/LLaMA-2-7B-32K-Instruct_GGUF/tree/main)
-> which is currently being uploaded
 ## How Quantization was done ##

 > the model was fine-tuned for that size), it is recommended that you keep your context as small as possible
 > If you need quantizations for Together Computer's
+> [Llama-2-7B-32K-Instruct](https://huggingface.co/togethercomputer/Llama-2-7B-32K-Instruct)
 > model, then look for
+> [LLaMA-2-7B-32K-Instruct_GGUF](https://huggingface.co/rozek/LLaMA-2-7B-32K-Instruct_GGUF)
+> which is currently being uploaded and should become available by the end of this day
 ## How Quantization was done ##