xMaulana commited on
Commit
d62b945
1 Parent(s): df43708

Adding Evaluation Results

Browse files

This is an automated PR created with https://huggingface.co/spaces/Weyaxi/open-llm-leaderboard-results-pr

The purpose of this PR is to add evaluation results from the Open LLM Leaderboard to your model card.

If you encounter any issues, please report them to https://huggingface.co/spaces/Weyaxi/open-llm-leaderboard-results-pr/discussions

Files changed (1) hide show
  1. README.md +19 -6
README.md CHANGED
@@ -1,17 +1,17 @@
1
  ---
2
- base_model:
3
- - meta-llama/Llama-3.2-3B-Instruct
4
- datasets:
5
- - NekoFi/alpaca-gpt4-indonesia-cleaned
6
  language:
7
  - id
8
  license: apache-2.0
9
- pipeline_tag: text-generation
10
  tags:
11
  - Indonesian
12
  - Chat
13
  - Instruct
14
  - unsloth
 
 
 
 
 
15
  model-index:
16
  - name: FinMatcha-3B-Instruct
17
  results:
@@ -186,4 +186,17 @@ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-le
186
  |MATH Lvl 5 (4-Shot)|10.20|
187
  |GPQA (0-shot) | 0.34|
188
  |MuSR (0-shot) | 6.62|
189
- |MMLU-PRO (5-shot) |16.04|
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
  ---
 
 
 
 
2
  language:
3
  - id
4
  license: apache-2.0
 
5
  tags:
6
  - Indonesian
7
  - Chat
8
  - Instruct
9
  - unsloth
10
+ base_model:
11
+ - meta-llama/Llama-3.2-3B-Instruct
12
+ datasets:
13
+ - NekoFi/alpaca-gpt4-indonesia-cleaned
14
+ pipeline_tag: text-generation
15
  model-index:
16
  - name: FinMatcha-3B-Instruct
17
  results:
 
186
  |MATH Lvl 5 (4-Shot)|10.20|
187
  |GPQA (0-shot) | 0.34|
188
  |MuSR (0-shot) | 6.62|
189
+ |MMLU-PRO (5-shot) |16.04|
190
+ # [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
191
+ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_xMaulana__FinMatcha-3B-Instruct)
192
+
193
+ | Metric |Value|
194
+ |-------------------|----:|
195
+ |Avg. |11.47|
196
+ |IFEval (0-Shot) |48.08|
197
+ |BBH (3-Shot) | 4.28|
198
+ |MATH Lvl 5 (4-Shot)| 3.85|
199
+ |GPQA (0-shot) | 1.34|
200
+ |MuSR (0-shot) | 5.74|
201
+ |MMLU-PRO (5-shot) | 5.54|
202
+