Support creating IQ*_NL and IQ*_XS quants

#46
by rinaldow - opened

After reading into these new quantization methods I have come to believe that these new quants perform better than the current implementation and are fully supported by llama.cpp as an optional parameter?
Hope I'll receive any feedback, thanks!

rinaldow changed discussion status to closed

Sign up or log in to comment