rozek
/

LLaMA-2-7B-32K_GGUF

Text Generation

text-generation-inference

togethercomputer

Inference Endpoints

Model card Files Files and versions Community

rozek commited on Aug 28, 2023

Commit

42000cf

·

1 Parent(s): e88d5a9

Update README.md

Files changed (1) hide show

README.md +6 -3

README.md CHANGED Viewed

@@ -48,7 +48,8 @@ for the downloaded image which mounts the folder we crated before:<br>&nbsp;<br>
   -v ./llama.cpp_in_Docker:/llama.cpp \
   -t basic-python /bin/bash`<br>&nbsp;<br>(you may have to adjust the path to your local folder)
 5. back in the <u>Docker Desktop</u>, open the "Terminal" tab of the started container and enter the
-following commands:<br>&nbsp;<br>```
 apt update
 apt-get install software-properties-common -y
 apt-get update
@@ -63,12 +64,14 @@ choose "Edit file"
 change `ifneq` to `ifeq`
 8. save your change using the disk icon in the upper right corner of the editor pane and open the "Terminal"
 tab again
-9. now enter the following commands:<br>&nbsp;<br>```
 make
 python3 -m pip install -r requirements.txt
 python3 convert.py ../LLaMA-2-7B-32K
 ```
-10. you are now ready to run the actual quantization, e.g., using<br>&nbsp;<br>```
 ./quantize ../LLaMA-2-7B-32K/ggml-model-f16.gguf \
    ../LLaMA-2-7B-32K/LLaMA-2-7B-32K-Q4_0.gguf Q4_0
 ```

   -v ./llama.cpp_in_Docker:/llama.cpp \
   -t basic-python /bin/bash`<br>&nbsp;<br>(you may have to adjust the path to your local folder)
 5. back in the <u>Docker Desktop</u>, open the "Terminal" tab of the started container and enter the
+following commands:<br>&nbsp;<br>
+```
 apt update
 apt-get install software-properties-common -y
 apt-get update
 change `ifneq` to `ifeq`
 8. save your change using the disk icon in the upper right corner of the editor pane and open the "Terminal"
 tab again
+9. now enter the following commands:<br>&nbsp;<br>
+```
 make
 python3 -m pip install -r requirements.txt
 python3 convert.py ../LLaMA-2-7B-32K
 ```
+10. you are now ready to run the actual quantization, e.g., using<br>&nbsp;<br>
+```
 ./quantize ../LLaMA-2-7B-32K/ggml-model-f16.gguf \
    ../LLaMA-2-7B-32K/LLaMA-2-7B-32K-Q4_0.gguf Q4_0
 ```