metadata
license: llama3
datasets:
- DeepMount00/llm_ita_ultra
language:
- it
Evaluation
For a detailed comparison of model performance, check out the Leaderboard for Italian Language Models.
Here's a breakdown of the performance metrics:
Metric | hellaswag_it acc_norm | arc_it acc_norm | m_mmlu_it 5-shot acc | Average |
---|---|---|---|---|
Accuracy Normalized | 0.6483 | 0.5329 | XXXXXXXX | XXXXXX |