metadata

language: en
license: mit
tags:
  - VIT
  - image-classification
  - drowsiness-detection

VIT_Drowsiness

This model is a fine-tuned version of google/vit-base-patch16-224 for drowsiness detection.

Model description

This model is a Vision Transformer (ViT) fine-tuned for drowsiness detection. It classifies images into two categories: drowsy and not drowsy.

Intended uses & limitations

This model is intended for drowsiness detection in images. It should be used on facial images similar to those in the training dataset.

Training data

The model was trained on a custom dataset located at /kaggle/input/nthuddd2/train_data. The dataset was split into 70% training, 15% validation, and 15% test sets.

Training procedure

The model was trained for 10 epochs using the Lion optimizer with a learning rate of 0.0001 and weight decay of 0.01. A cosine learning rate scheduler with 0.1 warmup ratio was used.

Evaluation results

[Add your evaluation results here after training]