Update README.md
Browse files
README.md
CHANGED
@@ -31,3 +31,5 @@ Detailed results can be found [here](https://huggingface.co/datasets/open-llm-le
|
|
31 |
| Winogrande (5-shot) | 77.35 |
|
32 |
| GSM8K (5-shot) | 5.76 |
|
33 |
| DROP (3-shot) | 5.87 |
|
|
|
|
|
|
31 |
| Winogrande (5-shot) | 77.35 |
|
32 |
| GSM8K (5-shot) | 5.76 |
|
33 |
| DROP (3-shot) | 5.87 |
|
34 |
+
|
35 |
+
I'm not sure what's going on with GSM8K. Since GSK8K (train split) data was included in the Ferret dataset, I suspect that either it is over-correcting itself or the eval is broken.
|