Vision Transformer (ViT) for Music Genre Classification

Model Overview

It achieves the following results on the evaluation set:

  • Loss: 0.8358
  • Accuracy: 0.7460
Downloads last month
3
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for ghermoso/vit-eGTZANplus

Finetuned
(2113)
this model

Dataset used to train ghermoso/vit-eGTZANplus