Statically quantized DistilBERT base uncased finetuned MPRC

Table of Contents

Model Details

Model Description: This model is a DistilBERT fine-tuned on MPRC statically quantized with optimum-intel through the usage of huggingface/optimum-intel through the usage of Intel® Neural Compressor.

  • Model Type: Text Classification
  • Language(s): English
  • License: Apache-2.0
  • Parent Model: For more details on the original model, we encourage users to check out this model card.

How to Get Started With the Model

PyTorch

To load the quantized model, you can do as follows:

from optimum.intel import INCModelForSequenceClassification

model_id = "Intel/distilbert-base-uncased-MRPC-int8-static"
model = INCModelForSequenceClassification.from_pretrained(model_id)

Test result

INT8 FP32
Accuracy (eval-f1) 0.9007 0.9027
Model size (MB) 242 268
Downloads last month
23
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Dataset used to train Intel/distilbert-base-uncased-MRPC-int8-static-inc

Collection including Intel/distilbert-base-uncased-MRPC-int8-static-inc