Spaces:

ggml-org
/

gguf-my-repo

Running on A10G

App Files Files Community

143

Support creating IQ_NL and IQ_XS quants

#46

by rinaldow - opened Apr 22

Discussion

rinaldow

Apr 22

After reading into these new quantization methods I have come to believe that these new quants perform better than the current implementation and are fully supported by llama.cpp as an optional parameter?
Hope I'll receive any feedback, thanks!

rinaldow changed discussion status to closed Apr 22

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment

Support creating IQ*_NL and IQ*_XS quants

Support creating IQ_NL and IQ_XS quants