Multi-Intent Detection (MID) Model

This model was fine-tuned for the task of Multi-Intent Detection (MID), a type of multi-label classification where each input can have multiple labels assigned. The dataset used for fine-tuning is specifically designed to simplify the MID task, with the number of labels limited to two per instance.

Model Details

  • Base Model: DeBERTa-v3-base
  • Task: Multi-label classification
  • Number of Labels: 2
  • Fine-tuning Framework: Hugging Face Transformers

Training Configuration

  • Training Arguments:
    • Learning Rate: 2e-5
    • Batch Size (Train): 16
    • Batch Size (Eval): 16
    • Gradient Accumulation Steps: 2
    • Number of Epochs: 8
    • Weight Decay: 0.01
    • Warmup Ratio: 10%
    • Learning Rate Scheduler Type: Cosine
    • Mixed Precision Training: Enabled (FP16)
    • Logging Steps: 50

Performance Metrics

Epoch Training Loss Validation Loss Precision Recall F1 Score Accuracy
0 0.069100 0.069115 0.000000 0.000000 0.000000 0.000000
2 0.024100 0.022929 0.952334 0.316920 0.475576 0.078652
4 0.009200 0.010799 0.959768 0.819894 0.884334 0.653668
6 0.006300 0.008773 0.963243 0.883344 0.921565 0.770654
7 0.006200 0.008707 0.961635 0.886319 0.922442 0.775281

Final Evaluation Metrics (Epoch 8):

  • Validation Loss: 0.0087
  • Precision: 0.9616
  • Recall: 0.8863
  • F1 Score: 0.9224
  • Accuracy: 0.7753

Limitations

  • Simplified Multi-Label Setting: This model assumes a fixed number of two labels per instance, which may not generalize to datasets with more complex multi-label settings.
  • Performance on Unseen Data: The model's performance may degrade if applied to data distributions significantly different from the training dataset.
Downloads last month
3
Safetensors
Model size
185M params
Tensor type
F32
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.