nintwentydo
/

pixtral-12b-2409-2of4-sparse

Image-Text-to-Text

compressed-tensors

Model card Files Files and versions Community

nintwentydo commited on 6 days ago

Commit

2b60200

·

verified ·

1 Parent(s): c60426f

Update README.md

Files changed (1) hide show

README.md +1 -2

README.md CHANGED Viewed

@@ -16,14 +16,13 @@ license: apache-2.0
 library_name: vllm
 base_model:
 - mistral-community/pixtral-12b
-- mgoin/pixtral-12b
 - mistralai/Pixtral-12B-2409
 base_model_relation: quantized
 ---
 # Pixtral-12B-2409: 2:4 sparse
-2:4 sparse version of [mistral-community/pixtral-12b](https://huggingface.co/mgoin/pixtral-12b) using [kylesayrs/gptq-partition branch of LLM Compressor](https://github.com/vllm-project/llm-compressor/tree/kylesayrs/gptq-partition) for optimised inference on VLLM.
 Example VLLM usage
 ```

 library_name: vllm
 base_model:
 - mistral-community/pixtral-12b
 - mistralai/Pixtral-12B-2409
 base_model_relation: quantized
 ---
 # Pixtral-12B-2409: 2:4 sparse
+2:4 sparse version of [mistral-community/pixtral-12b](https://huggingface.co/mistral-community/pixtral-12b) using [kylesayrs/gptq-partition branch of LLM Compressor](https://github.com/vllm-project/llm-compressor/tree/kylesayrs/gptq-partition) for optimised inference on VLLM.
 Example VLLM usage
 ```