abarbosa's picture
Update README.md
9f51c15 verified
metadata
language:
  - pt
  - en
tags:
  - aes
datasets:
  - kamel-usp/aes_enem_dataset
base_model: microsoft/phi-4
metrics:
  - accuracy
  - qwk
library_name: peft
model-index:
  - name: phi4-balanced-C2
    results:
      - task:
          type: text-classification
          name: Automated Essay Score
        dataset:
          name: Automated Essay Score ENEM Dataset
          type: kamel-usp/aes_enem_dataset
          config: JBCS2025
          split: test
        metrics:
          - name: Macro F1
            type: f1
            value: 0.2886714162530285
          - name: QWK
            type: qwk
            value: 0.3891803278688525
          - name: Weighted Macro F1
            type: f1
            value: 0.4493051983710385

Model ID: phi4-balanced-C2

Results

test_data
eval_accuracy 0.471014
eval_RMSE 61.2905
eval_QWK 0.38918
eval_Macro_F1 0.288671
eval_Weighted_F1 0.449305
eval_Micro_F1 0.471014
eval_HDIV 0.0869565