Spaces:
Sleeping
Sleeping
update content
Browse files- text_content.py +1 -1
text_content.py
CHANGED
@@ -37,5 +37,5 @@ Inspired from the [🤗 Open LLM Leaderboard](https://huggingface.co/spaces/Hugg
|
|
37 |
|
38 |
NOTES_TEXT = """
|
39 |
* On most models on base MMLU, we collected the results for their official technical report. For the models that have not been reported, we use opencompass for evaluation.
|
40 |
-
* For other 2 base benchmarks and all 3 structured benchmarks: for chat models, we evaluate them under 0-shot setting; for completion model, we evaluate them under 0-shot setting with ppl.
|
41 |
"""
|
|
|
37 |
|
38 |
NOTES_TEXT = """
|
39 |
* On most models on base MMLU, we collected the results for their official technical report. For the models that have not been reported, we use opencompass for evaluation.
|
40 |
+
* For other 2 base benchmarks and all 3 structured benchmarks: for chat models, we evaluate them under 0-shot setting; for completion model, we evaluate them under 0-shot setting with ppl. And we keep the prompt format consistent across all benchmarks.
|
41 |
"""
|