InferenceIllusionist
/

Fimbulvetr-11B-v2-iMat-GGUF

Inference Endpoints

Model card Files Files and versions Community

InferenceIllusionist commited on Feb 26, 2024

Commit

be559da

·

verified ·

1 Parent(s): 55b4921

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -9,7 +9,7 @@ license: cc-by-nc-4.0
 * Model creator: [Sao10K](https://huggingface.co/Sao10K/)
 * Original model: [Fimbulvetr-11B-v2](https://huggingface.co/Sao10K/Fimbulvetr-11B-v2)
-<b>Important: </b> Inferencing for newer formats i.e. IQ1_S, IQ3_S, IQ4_NL tested on latest llama.cpp & koboldcpp v.1.59.1
 All credits to Sao10K for the original model. This is just a quick test of the new quantization types such as IQ_3S in an attempt to further reduce VRAM requirements.

 * Model creator: [Sao10K](https://huggingface.co/Sao10K/)
 * Original model: [Fimbulvetr-11B-v2](https://huggingface.co/Sao10K/Fimbulvetr-11B-v2)
+<b>Important: </b> Inferencing for newer formats such as IQ3_S, IQ4_NL tested on latest llama.cpp & koboldcpp v.1.59.1. IQ1_S is only functional on llama.cpp as of 2/26/24.
 All credits to Sao10K for the original model. This is just a quick test of the new quantization types such as IQ_3S in an attempt to further reduce VRAM requirements.