Qwen2.5 1.5B Instruct Draft

This model is exactly the same as Qwen2.5 1.5B Instruct, but the vocabulary is padded to the same size as larger Qwen models (like Qwen2.5 72B Instruct). This allows it to be used as a draft model in speculative decoding.

Downloads last month
4
Safetensors
Model size
1.54B params
Tensor type
BF16
·
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.