Update README.md
Browse files
README.md
CHANGED
@@ -200,6 +200,8 @@ srun -o $OUTFILE -e $ERRFILE --container-image="$CONTAINER" $MOUNTS bash -c "${c
|
|
200 |
#### MT-Bench (GPT-4-Turbo)
|
201 |
|
202 |
Evaluated using MT-Bench judging by GPT-4-Turbo as described in the [HelpSteer2 Dataset Paper](https://arxiv.org/abs/2406.08673)
|
|
|
|
|
203 |
|
204 |
| total | writing | roleplay | extraction | stem | humanities | reasoning | math | coding | turn 1 | turn 2 |
|
205 |
| ----- | ------- | -------- | ---------- | ---- | ---------- | --------- | ---- | ------ | ------ | ------ |
|
|
|
200 |
#### MT-Bench (GPT-4-Turbo)
|
201 |
|
202 |
Evaluated using MT-Bench judging by GPT-4-Turbo as described in the [HelpSteer2 Dataset Paper](https://arxiv.org/abs/2406.08673)
|
203 |
+
Please note that these numbers aren't comparable with original MT-bench as we corrected [some references](https://github.com/lm-sys/FastChat/pull/3158)
|
204 |
+
|
205 |
|
206 |
| total | writing | roleplay | extraction | stem | humanities | reasoning | math | coding | turn 1 | turn 2 |
|
207 |
| ----- | ------- | -------- | ---------- | ---- | ---------- | --------- | ---- | ------ | ------ | ------ |
|