ghermoso
/

vit-eGTZANplus

Image Classification

Generated from Trainer

Model card Files Files and versions Community

Vision Transformer (ViT) for Music Genre Classification

Model Overview

Model Name: ghermoso/vit-eGTZANplus
Task: Image Classification
Dataset: egtzan_plus
Model Architecture: Vision Transformer (ViT)
Finetuned from model: This model is a fine-tuned version of google/vit-base-patch16-224-in21k on an egtzan_plus dataset.

It achieves the following results on the evaluation set:

Loss: 0.8358
Accuracy: 0.7460

Downloads last month: 3

Inference Providers NEW

Image Classification

This model isn't deployed by any Inference Provider. 🙋 Ask for provider support

Model tree for ghermoso/vit-eGTZANplus

Base model

google/vit-base-patch16-224-in21k

Finetuned

(2113)

this model

Dataset used to train ghermoso/vit-eGTZANplus

Evaluation results

Metadata error: specify a dataset to view leaderboard