225
AI2 WildBench Leaderboard (V2)
🦁
Display and explore a leaderboard of language models
Display and explore a leaderboard of language models
Note The leaderboard for visualizing the results and collecting human feedback.
Note Examples for evaluating LLMs.
Note The model outputs for verified LLMs on the leaderboard.