hellork
/

Qwen2-7B-Multilingual-RP-IQ4_NL-GGUF

Model card Files Files and versions

hellork commited on Feb 1

Commit

f846557

·

verified ·

1 Parent(s): 30728ee

Update README.md

Files changed (1) hide show

README.md +16 -0

README.md CHANGED Viewed

@@ -23,6 +23,22 @@ Install llama.cpp through brew (works on Mac and Linux)
 brew install llama.cpp
 ```
 Invoke the llama.cpp server or the CLI.
 ### CLI:

 brew install llama.cpp
 ```
+# Or compile it to take advantage of Nvidia CUDA hardware:
+```bash
+git clone https://github.com/ggerganov/llama.cpp.git
+cd llama*
+# look at docs for other hardware builds or to make sure none of this has changed.
+cmake -B build -DGGML_CUDA=ON
+CMAKE_ARGS="-DGGML_CUDA=on" cmake --build build --config Release # -j6 (optional: use a number less than the number of cores)
+# If your version of gcc is > 12 and it gives errors, use conda to install gcc-12 and activate it.
+# Run the above cmake commands again.
+# Then run conda deactivate and re-run the last line once more to link the build outside of conda.
+```
 Invoke the llama.cpp server or the CLI.
 ### CLI: