alamios
/

QwQ-32B-DRAFT-0.5B-GGUF

Text Generation

Model card Files Files and versions Community

QwQ-32B-DRAFT-0.5B-GGUF

PREVIEW

Draft model for speculative decoding with Qwen/QwQ-32B, preview version.

Downloads last month: 295

GGUF

Model size

494M params

Architecture

qwen2

Hardware compatibility

Log In to view the estimation

4-bit

16-bit

Inference Providers NEW

Text Generation

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for alamios/QwQ-32B-DRAFT-0.5B-GGUF

Base model

Qwen/Qwen2.5-0.5B

Quantized

(60)

this model

Collection including alamios/QwQ-32B-DRAFT-0.5B-GGUF

Draft GGUFs

5 items • Updated Mar 19 • 1