metadata
base_model: HuggingFaceTB/SmolLM2-1.7B-Instruct
language:
- en
library_name: transformers
license: apache-2.0
tags:
- openvino
- nncf
- fp16
This model is a quantized version of HuggingFaceTB/SmolLM2-1.7B-Instruct
and is converted to the OpenVINO format. This model was obtained via the nncf-quantization space with optimum-intel.
First make sure you have optimum-intel
installed:
pip install optimum[openvino]
To load your model you can do as follows:
from optimum.intel import OVModelForCausalLM
model_id = "AIFunOver/SmolLM2-1.7B-Instruct-openvino-fp16"
model = OVModelForCausalLM.from_pretrained(model_id)