graelo
/

Qwen2.5-7B-Instruct-1M-AWQ

4-bit precision

Model card Files Files and versions Community

Quantized from Qwen/Qwen2.5-7B-Instruct-1M down to 4 bits, GEMM

Downloads last month: 348

Safetensors

Model size

1.96B params

Tensor type

I32

·

BF16

·

FP16

·

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for graelo/Qwen2.5-7B-Instruct-1M-AWQ

Base model

Qwen/Qwen2.5-7B

Finetuned

Qwen/Qwen2.5-7B-Instruct-1M

Quantized

(64)

this model