metadata

tags:
  - autotrain
  - image-classification
  - eligapris
  - vision
base_model: microsoft/resnet-152
license: apache-2.0
language:
  - en
metrics:
  - accuracy
pipeline_tag: image-classification
library_name: transformers

Corn Leaf Disease Classification Model Analysis

Dataset Breakdown

The dataset consists of four classes with the following distribution:

Class	Number of Images
Healthy_Leaf	3021
Gray_Leaf_Spot	2478
Common_Rust	2949
Northern_Leaf_Blight	3303

Note: There is a slight class imbalance, with Gray_Leaf_Spot having notably fewer images compared to the other classes.

Model Performance Metrics

The model was trained using AutoTrain for image classification. Here's a breakdown of the validation metrics:

Metric	Value
Loss	0.3697
Accuracy	0.8585
F1 (Macro)	0.6843
F1 (Micro)	0.8585
F1 (Weighted)	0.8303
Precision (Macro)	0.8204
Precision (Micro)	0.8585
Precision (Weighted)	0.8821
Recall (Macro)	0.7170
Recall (Micro)	0.8585
Recall (Weighted)	0.8585

Metric Explanations

Loss (0.3697): This relatively low value indicates that the model is learning well.
Accuracy (0.8585): The model correctly classifies 85.85% of all instances across all classes.
F1 Score:
- Macro (0.6843): The unweighted mean of F1 scores for each class.
- Micro (0.8585): Calculated globally by counting the total true positives, false negatives, and false positives.
- Weighted (0.8303): The weighted average of F1 scores for each class, accounting for class imbalance.
Precision:
- Macro (0.8204): The unweighted mean of precision scores for each class.
- Micro (0.8585): The global precision across all classes.
- Weighted (0.8821): The weighted average of precision scores for each class.
Recall:
- Macro (0.7170): The unweighted mean of recall scores for each class.
- Micro (0.8585): The global recall across all classes.
- Weighted (0.8585): The weighted average of recall scores for each class.

Analysis

Class Imbalance: The difference between macro and micro scores suggests class imbalance, which aligns with our dataset breakdown. The Gray_Leaf_Spot class, having fewer images, likely contributes to this imbalance.
Precision vs Recall: Precision scores are generally higher than recall scores, especially for macro metrics. This suggests the model is more cautious in its predictions, preferring to be correct when it does predict a class.
Performance on Majority vs Minority Classes: The higher micro and weighted scores compared to macro scores indicate that the model performs better on more frequent classes. This is likely due to the class imbalance, with the model potentially struggling more with the Gray_Leaf_Spot class.
Overall Performance: With an accuracy of 85.85%, the model shows good overall performance. However, there's room for improvement, especially in handling the class imbalance.