AIJapanese
commited on
Commit
•
4a207a8
1
Parent(s):
881cbc2
Update README.md
Browse files
README.md
CHANGED
@@ -20,7 +20,7 @@ base_model:
|
|
20 |
We used the [lm-evaluation-harness](https://github.com/Stability-AI/lm-evaluation-harness/tree/jp-stable) repo to evaluate across 8 tasks, and the results are as follows:
|
21 |
|
22 |
|
23 |
-
|Model|JCommonsenseQA|JNLI|JMARC|JSQuAD|JAQKET-V2|XL-SUM|XWINOGRAD|MGSM|JA AVG|
|
24 |
|---|---|---|---|---|---|---|---|---|---|
|
25 |
| |3-shot|3-shot|0-shot|2-shot|1-shot|1-shot|0-shot|5-shot| |
|
26 |
| |Acc.|Balanced Acc.|Balanced Acc.|Char-F1|Char-F1|ROUGE-2|Acc.|Acc.| |
|
|
|
20 |
We used the [lm-evaluation-harness](https://github.com/Stability-AI/lm-evaluation-harness/tree/jp-stable) repo to evaluate across 8 tasks, and the results are as follows:
|
21 |
|
22 |
|
23 |
+
|Model|JCommonsenseQA|JNLI|JMARC|JSQuAD|JAQKET-V2|XL-SUM|XWINOGRAD|MGSM|JA AVG (8 tasks)|
|
24 |
|---|---|---|---|---|---|---|---|---|---|
|
25 |
| |3-shot|3-shot|0-shot|2-shot|1-shot|1-shot|0-shot|5-shot| |
|
26 |
| |Acc.|Balanced Acc.|Balanced Acc.|Char-F1|Char-F1|ROUGE-2|Acc.|Acc.| |
|