mradermacher
/

OpenBioLLM-Llama3-8B-GGUF

Inference Endpoints

Model card Files Files and versions Community

mradermacher commited on May 9

Commit

8115011

•

1 Parent(s): cb76929

Update README.md

Files changed (1) hide show

README.md +2 -1

README.md CHANGED Viewed

@@ -25,8 +25,9 @@ tags:
 <!-- ### vocab_type:  -->
 static quants of https://huggingface.co/aaditya/OpenBioLLM-Llama3-8B
 <!-- provided-files -->
-weighted/imatrix quants are available at https://huggingface.co/mradermacher/OpenBioLLM-Llama3-8B-i1-GGUF
 ## Usage
 If you are unsure how to use GGUF files, refer to one of [TheBloke's

 <!-- ### vocab_type:  -->
 static quants of https://huggingface.co/aaditya/OpenBioLLM-Llama3-8B
+You should use `--override-kv tokenizer.ggml.pre=str:llama3` and a current llama.cpp version to work around a bug in llama.cpp that made these quants.
 <!-- provided-files -->
 ## Usage
 If you are unsure how to use GGUF files, refer to one of [TheBloke's