Pixtral-12B-2409: 2:4 sparse

Example VLLM usage

vllm serve nintwentydo/pixtral-12b-2409-2of4-sparse --max-model-len 131072 --limit-mm-per-prompt 'image=4'

If you want a more advanced/fully featured chat template you can use this jinja template

Safetensors

Model size

12.7B params

Tensor type

BF16

Inference Providers NEW

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for nintwentydo/pixtral-12b-2409-2of4-sparse

Base model

Quantized

(5)

this model