Running 51 51 R1-distilled leaderboard β‘ Generate a leaderboard for open-r1 models across various benchmarks