| license: mit | |
| Original model: https://huggingface.co/Qwen/Qwen2-VL-2B-Instruct | |
| Quantitation documentation: https://docs.openvino.ai/nightly/notebooks/qwen2-vl-with-output.html | |
| Quantitation config: | |
| ```python | |
| import nncf | |
| compression_configuration = { | |
| "mode": nncf.CompressWeightsMode.INT4_ASYM, | |
| "group_size": 128, | |
| "ratio": 0.5, | |
| } | |
| ``` |