Extreme Compression of Large Language Models via Additive Quantization Paper • 2401.06118 • Published Jan 11, 2024 • 12
ISTA-DASLab/Meta-Llama-3-70B-Instruct-AQLM-2Bit-1x16 Text Generation • Updated May 13, 2024 • 160 • 20
ISTA-DASLab/Mixtral-8x7B-Instruct-v0_1-AQLM-2Bit-1x16-hf Text Generation • Updated Feb 27, 2024 • 49 • 18