Models prequantized with [HIGGS](https://arxiv.org/abs/2411.17525) zero-shot quantization. Requires the latest `transformers` to run.
-
ISTA-DASLab/Llama-3.3-70B-Instruct-HIGGS-GPTQ-4bit
Updated • 16 • 1 -
ISTA-DASLab/Llama-3.1-8B-Instruct-HIGGS-GPTQ-4bit
Text Generation • Updated • 11 -
ISTA-DASLab/Llama-3.1-8B-Instruct-HIGGS-GPTQ-3bit
Text Generation • Updated • 10 -
ISTA-DASLab/Llama-3.1-8B-HIGGS-GPTQ-4bit
Text Generation • Updated • 10