Update README.md
Browse files
README.md
CHANGED
@@ -113,7 +113,20 @@ This is my first English & Chinese MoE Model based on
|
|
113 |
* [SUSTech/SUS-Chat-34B]
|
114 |
|
115 |
|
116 |
-
# [Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
117 |
Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_cloudyu__Mixtral_34Bx2_MoE_60B)
|
118 |
|
119 |
| Metric |Value|
|
|
|
113 |
* [SUSTech/SUS-Chat-34B]
|
114 |
|
115 |
|
116 |
+
# [New Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/open-llm-leaderboard/open_llm_leaderboard)
|
117 |
+
Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_cloudyu__Mixtral_34Bx2_MoE_60B)
|
118 |
+
|
119 |
+
| Metric |Value|
|
120 |
+
|-------------------|----:|
|
121 |
+
|Avg. |27.42|
|
122 |
+
|IFEval (0-Shot) |45.38|
|
123 |
+
|BBH (3-Shot) |41.21|
|
124 |
+
|MATH Lvl 5 (4-Shot)| 6.57|
|
125 |
+
|GPQA (0-shot) |11.74|
|
126 |
+
|MuSR (0-shot) |17.78|
|
127 |
+
|MMLU-PRO (5-shot) |41.85|
|
128 |
+
|
129 |
+
# [Old New Open LLM Leaderboard Evaluation Results](https://huggingface.co/spaces/HuggingFaceH4/open_llm_leaderboard)
|
130 |
Detailed results can be found [here](https://huggingface.co/datasets/open-llm-leaderboard/details_cloudyu__Mixtral_34Bx2_MoE_60B)
|
131 |
|
132 |
| Metric |Value|
|