asedmammad
/

longchat-7b-v1.5-32k-GGML

text-generation-inference

Model card Files Files and versions Community

asedmammad commited on Aug 20, 2023

Commit

9caecfe

•

1 Parent(s): 796ba97

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -23,7 +23,7 @@ GGML files are for CPU + GPU inference using [llama.cpp](https://github.com/gger
 I use the following command line, adjust for your tastes and needs:
 ```
-./main -t 2 -ngl 32 -m longchat-7b-v1.5-32k.ggmlv3.q4_0.bin --color --rope-freq-scale 0.25 --rope-freq-base 240000 -c 32768 --temp 0.7 --repeat_penalty 1.1 -n -1 -p "prompt goes here"
 ```
 Change `-t 2` to the number of physical CPU cores you have. For example if your system has 8 cores/16 threads, use `-t 8`.

 I use the following command line, adjust for your tastes and needs:
 ```
+./main -t 2 -ngl 32 -m longchat-7b-v1.5-32k.ggmlv3.q4_0.bin --color --rope-freq-scale 0.25 --rope-freq-base 347620 -c 32768 --temp 0.7 --repeat_penalty 1.1 -n -1 -p "prompt goes here"
 ```
 Change `-t 2` to the number of physical CPU cores you have. For example if your system has 8 cores/16 threads, use `-t 8`.