Unable to recreate MMLU-Pro Score
#34
by
acostea-ionos
- opened
Hello, we where unable to recreate the MMLU-Pro score. Our results gives us a score of 47% (using TIGER-LAB dataset) while the score in the model card is 66.3%. Are you able to tell us how the test was done? Or if there is a typo of any sort going on?