InferenceIllusionist
commited on
Update README.md
Browse files
README.md
CHANGED
@@ -9,7 +9,7 @@ license: cc-by-nc-4.0
|
|
9 |
* Model creator: [Sao10K](https://huggingface.co/Sao10K/)
|
10 |
* Original model: [Fimbulvetr-11B-v2](https://huggingface.co/Sao10K/Fimbulvetr-11B-v2)
|
11 |
|
12 |
-
<b>Important: </b> Inferencing for newer formats
|
13 |
|
14 |
|
15 |
All credits to Sao10K for the original model. This is just a quick test of the new quantization types such as IQ_3S in an attempt to further reduce VRAM requirements.
|
|
|
9 |
* Model creator: [Sao10K](https://huggingface.co/Sao10K/)
|
10 |
* Original model: [Fimbulvetr-11B-v2](https://huggingface.co/Sao10K/Fimbulvetr-11B-v2)
|
11 |
|
12 |
+
<b>Important: </b> Inferencing for newer formats such as IQ3_S, IQ4_NL tested on latest llama.cpp & koboldcpp v.1.59.1. IQ1_S is only functional on llama.cpp as of 2/26/24.
|
13 |
|
14 |
|
15 |
All credits to Sao10K for the original model. This is just a quick test of the new quantization types such as IQ_3S in an attempt to further reduce VRAM requirements.
|