TroyDoesAI/Codestral-RAG-19B-Pruned

Hey, Sorry to get back so late, I havent really played with Quantization with imatrix so you are more of an expert in that department.
=)

I try to prune my models to keep the full precision and fit under 24 GB.
Thank you for your kind words, I wish I could help, maybe message someone that releases imatrix.
I am very interested in how a pruned model performs after being quantized as I feel I removed many redundant layers that might have been what makes quantized models perform ok. Please stay in touch. I am very curious and want to learn more! :D

Reach out to me:
https://www.linkedin.com/in/troyandrewschultz/

TroyDoesAI
/

Codestral-RAG-19B-Pruned

Quant with imatrix?