chenxingphh
commited on
Commit
•
8b987c0
1
Parent(s):
6f46bdc
Update README_en.md
Browse files- README_en.md +11 -11
README_en.md
CHANGED
@@ -53,17 +53,17 @@ We use [opencompass](https://opencompass.org.cn) to perform 5-shot on the follow
|
|
53 |
The evaluation results of other models are taken
|
54 |
from [opencompass leaderboard](https://opencompass.org.cn/leaderboard-llm).
|
55 |
|
56 |
-
| | C-Eval
|
57 |
-
|
58 |
-
| **GPT-4** | 69.9
|
59 |
-
| **ChatGPT** | 52.5
|
60 |
-
| **Claude-1** | 52
|
61 |
-
| **TigerBot-70B-Chat-V2** | 57.7
|
62 |
-
| **WeMix-LLaMA2-70B** | 55.2
|
63 |
-
| **LLaMA-2-70B-Chat** | 44.3
|
64 |
-
| **Qwen-14B-Chat** | 71.7
|
65 |
-
| **Baichuan-13B-Chat** |
|
66 |
-
| **OrionStar-Yi-34B-Chat** | 77.71
|
67 |
|
68 |
# Model Inference
|
69 |
|
|
|
53 |
The evaluation results of other models are taken
|
54 |
from [opencompass leaderboard](https://opencompass.org.cn/leaderboard-llm).
|
55 |
|
56 |
+
| | C-Eval | MMLU | CMMLU |
|
57 |
+
|---------------------------|-----------|--------|-----------|
|
58 |
+
| **GPT-4** | 69.9 | **83** | 71 |
|
59 |
+
| **ChatGPT** | 52.5 | 69.1 | 53.9 |
|
60 |
+
| **Claude-1** | 52 | 65.7 | - |
|
61 |
+
| **TigerBot-70B-Chat-V2** | 57.7 | 65.9 | 59.9 |
|
62 |
+
| **WeMix-LLaMA2-70B** | 55.2 | 71.3 | 56 |
|
63 |
+
| **LLaMA-2-70B-Chat** | 44.3 | 63.8 | 43.3 |
|
64 |
+
| **Qwen-14B-Chat** | 71.7 | 66.4 | 70 |
|
65 |
+
| **Baichuan-13B-Chat** | 53.4 | 50.5 | 50.7 |
|
66 |
+
| **OrionStar-Yi-34B-Chat** | **77.71** | 78.32 | **73.52** |
|
67 |
|
68 |
# Model Inference
|
69 |
|