unsloth
/

DeepSeek-R1-GGUF

Text Generation

Inference Endpoints

Model card Files Files and versions Community

danielhanchen commited on 12 days ago

Commit

f615ad3

·

verified ·

1 Parent(s): e092ebe

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -40,7 +40,7 @@ apt-get install build-essential cmake curl libcurl4-openssl-dev -y
 git clone https://github.com/ggerganov/llama.cpp
 cmake llama.cpp -B llama.cpp/build \
 	-DBUILD_SHARED_LIBS=OFF -DGGML_CUDA=ON -DLLAMA_CURL=ON
-cmake --build llama.cpp/build --config Release -j --clean-first --target llama-quantize llama-cli
 cp llama.cpp/build/bin/llama-* llama.cpp
 ```
 3. It's best to use `--min-p 0.05` to counteract very rare token predictions - I found this to work well especially for the 1.58bit model.

 git clone https://github.com/ggerganov/llama.cpp
 cmake llama.cpp -B llama.cpp/build \
 	-DBUILD_SHARED_LIBS=OFF -DGGML_CUDA=ON -DLLAMA_CURL=ON
+cmake --build llama.cpp/build --config Release -j --clean-first --target llama-quantize llama-cli llama-gguf-split
 cp llama.cpp/build/bin/llama-* llama.cpp
 ```
 3. It's best to use `--min-p 0.05` to counteract very rare token predictions - I found this to work well especially for the 1.58bit model.