ISTA-DASLab/C4-tokenized-llama2
Updated
•
242
None defined yet.
Bridging the Gap Between Promise and Performance for Microscaling FP4 Quantization
The Geometry of LLM Quantization: GPTQ as Babai's Nearest Plane Algorithm