AIJapanese commited on
Commit
36b32c3
·
verified ·
1 Parent(s): 1781c0a

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +6 -6
README.md CHANGED
@@ -33,12 +33,12 @@ We used the [lm-evaluation-harness](https://github.com/Stability-AI/lm-evaluatio
33
  |---|---|---|---|---|---|---|---|---|---|
34
  | |3-shot|3-shot|0-shot|2-shot|1-shot|1-shot|0-shot|5-shot| |
35
  | |Acc.|Balanced Acc.|Balanced Acc.|Char-F1|Char-F1|ROUGE-2|Acc.|Acc.| |
36
- | Moriyasu_Qwen2_JP_7B (OURS) | **94.91** | **91.11** | 95.50 | 87.48 | 89.24 | 19.66 | **82.38** | 55.60 | **76.99** |
37
- | Qwen2-7B-Instruct | 90.80 | 78.07 | 93.29 | 92.90 | 83.34 | 19.05 | 72.16 | **61.20** | 73.85 |
38
- | SakanaAI/EvoLLM-JP-v1-7B | 89.19 | 66.02 | 95.55 | 92.10 | 86.41 | **23.31** | 81.65 | 47.60 | 72.73 |
39
- | Llama-3-ELYZA-JP-8B |92.40 | 64.85 | **95.67** | 92.04 | 87.43 | 21.35 | 78.21 | 49.20 | 72.64 |
40
- | Llama-3-Swallow-8B-Instruct-v0.1 | 92.49 | 62.12 | 94.27 | **93.73** | **90.83** | 19.61 | 74.04 | 50.00 | 72.14 |
41
- | Tanuki-8B-dpo-v1.0| 79.18 | 43.05 | 92.26 | 82.29 | 77.99 | 11.68 | 70.39 | 43.60 | 62.56 |
42
 
43
 
44
  ### Japanese tasks
 
33
  |---|---|---|---|---|---|---|---|---|---|
34
  | |3-shot|3-shot|0-shot|2-shot|1-shot|1-shot|0-shot|5-shot| |
35
  | |Acc.|Balanced Acc.|Balanced Acc.|Char-F1|Char-F1|ROUGE-2|Acc.|Acc.| |
36
+ | Moriyasu_Qwen2_JP_7B (OURS) | **0.9491** | **0.9111** | 0.9550 | 0.8748 | 0.8924 | 0.1966 | **0.8238** | 0.5560 | **0.7699** |
37
+ | Qwen2-7B-Instruct | 0.9080 | 0.7807 | 0.9329 | 0.9290 | 0.8334 | 0.1905 | 0.7216 | **0.6120** | 0.7385 |
38
+ | SakanaAI/EvoLLM-JP-v1-7B | 0.8919 | 0.6602 | 0.9555 | 0.9210 | 0.8641 | **0.2331** | 0.8165 | 0.4760 | 0.7273 |
39
+ | Llama-3-ELYZA-JP-8B |92.40 | 0.6485 | **0.9567** | 0.9204 | 0.8743 | 0.2135 | 0.7821 | 0.4920 | 0.7264 |
40
+ | Llama-3-Swallow-8B-Instruct-v0.1 | 0.9249 | 0.6212 | 0.9427 | **0.9373** | **0.9083** | 0.1961 | 0.7404 | 0.5000 | 0.7214 |
41
+ | Tanuki-8B-dpo-v1.0| 79.18 | 43.05 | 0.9226 | 0.8229 | 0.7799 | 0.1168 | 0.7039 | 0.4360 | 0.6256 |
42
 
43
 
44
  ### Japanese tasks