rozek
/

LLaMA-2-7B-32K_GGUF

Text Generation

text-generation-inference

togethercomputer

Inference Endpoints

Model card Files Files and versions Community

rozek commited on Aug 28, 2023

Commit

ed9c6c4

•

1 Parent(s): 6a4410d

Update README.md

Files changed (1) hide show

README.md +25 -1

README.md CHANGED Viewed

@@ -25,9 +25,33 @@ However, llama.cpp requires quantized files in the new GGUF format - that's wher
 it contains a few quantizations of the original weights from Together's fined-tuned model (as indicated by
 the file names)
 Concerning the license(s):
-* the [orignal model](https://ai.meta.com/llama/) (from Meta AI) was released under a rather [permittive
 license](https://ai.meta.com/llama/license/)
 * the fine tuned model from Together Computer uses the
 [same license](https://huggingface.co/togethercomputer/LLaMA-2-7B-32K/blob/main/README.md)

 it contains a few quantizations of the original weights from Together's fined-tuned model (as indicated by
 the file names)
+## How the Quantization was done ##
+Since the author does not want arbitrary Python stuff loitering on his computer, the quatization was done
+using [Docker](https://www.docker.com/).
+Assuming that you have the [Docker Desktop](https://www.docker.com/products/docker-desktop/) installed on
+your system and also have a basic knowledge of how it is used, you mayx just follow the instructions shown
+below in order to generate your own quantizations:
+> Nota bene: you will need 30+x GB of free disk space, at least - depending on your quantization
+1. create a new folder called `llama.cpp_in_Docker`<br>this folder will later be mounted into the Docker
+container and store the quantization results
+2. download the weights for the fine-tuned LLaMA-2 model from
+[Hugging Face](https://huggingface.co/togethercomputer/LLaMA-2-7B-32K) into a subfolder of `llama.cpp_in_Docker`
+(let's call the new folder `LLaMA-2-7B-32K`)
+3. within the Docker Desktop, download search for and download a `basic-python` image - just use one of the
+most popular ones
+4. from a terminal session on your host computer (i.e., not a Docker container!), start a new container for the
+downloaded image which mounts the folder we crated before:<br>&nbsp;<br>``
+...
+## License ##
 Concerning the license(s):
+* the [original model](https://ai.meta.com/llama/) (from Meta AI) was released under a rather [permittive
 license](https://ai.meta.com/llama/license/)
 * the fine tuned model from Together Computer uses the
 [same license](https://huggingface.co/togethercomputer/LLaMA-2-7B-32K/blob/main/README.md)