mradermacher
/

OpenBioLLM-Llama3-8B-GGUF

Inference Endpoints

Model card Files Files and versions Community

mradermacher commited on May 10

Commit

8a60569

•

1 Parent(s): 8115011

auto-patch README.md

Files changed (1) hide show

README.md +1 -0

README.md CHANGED Viewed

@@ -28,6 +28,7 @@ static quants of https://huggingface.co/aaditya/OpenBioLLM-Llama3-8B
 You should use `--override-kv tokenizer.ggml.pre=str:llama3` and a current llama.cpp version to work around a bug in llama.cpp that made these quants.
 <!-- provided-files -->
 ## Usage
 If you are unsure how to use GGUF files, refer to one of [TheBloke's

 You should use `--override-kv tokenizer.ggml.pre=str:llama3` and a current llama.cpp version to work around a bug in llama.cpp that made these quants.
 <!-- provided-files -->
+weighted/imatrix quants seem not to be available (by me) at this time. If they do not show up a week or so after the static ones, I have probably not planned for them. Feel free to request them by opening a Community Discussion.
 ## Usage
 If you are unsure how to use GGUF files, refer to one of [TheBloke's