Omarrran
/

gemma2_model_9B_q4_km

Model card Files Files and versions Community

Omarrran commited on Jan 23

Commit

8e2aed5

·

verified ·

1 Parent(s): 2480f59

Update README.md

Files changed (1) hide show

README.md +21 -0

README.md CHANGED Viewed

@@ -7,6 +7,27 @@ license: mit
 ----------------------------------------------
 # llama.cpp Conversion, Quantization, & Merging

+## To use this model directly import the following
+```
+from llama_cpp import Llama
+llm = Llama.from_pretrained(
+	repo_id="Omarrran/gemma2_model_9B_q4_km",
+	filename="unsloth.Q4_K_M.gguf",
+)
+```
+ To know more about how to use llama_cpp Inferenece mode  see here : [Link to llama_cpp](https://github.com/ggerganov/llama.cpp)
 ----------------------------------------------
 # llama.cpp Conversion, Quantization, & Merging