Can it be quantified into a 4bit version and support ollama?
#2
by
alleniver
- opened
Can it be quantified into a 4-bit version and support ollama, because ollama is used by many people now, and it is more convenient to deploy and supports many interfaces. Please consider it, thank you!