twhoool02
/

Llama-2-7b-hf-AutoGPTQ

Text Generation

text-generation-inference

4-bit precision

Model card Files Files and versions Community

Model Card for twhoool02/Llama-2-7b-chat-hf-AutoGPTQ

Model Details

This model is a GPTQ quantized version of the meta-llama/Llama-2-7b-chat-hf model.

Developed by: Ted Whooley
Library: Transformers, GPTQ
Model type: llama
Model name: Llama-2-7b-chat-hf-AutoGPTQ
Pipeline tag: text-generation
Qunatized by: twhoool02
Language(s) (NLP): en
License: other

Downloads last month: 0

Safetensors

Model size

1.13B params

Tensor type

I32

·

FP16

·

Inference Providers NEW

Text Generation

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for twhoool02/Llama-2-7b-hf-AutoGPTQ

Base model

meta-llama/Llama-2-7b-chat-hf

Quantized

(66)

this model