Running on CPU Upgrade 66 66 AIR-Bench Leaderboard π₯ Explore benchmark results for QA and long doc models
Running on CPU Upgrade 109 109 Open Chinese LLM Leaderboard π Display and filter LLM benchmark results
Running on CPU Upgrade 12.6k 12.6k Open LLM Leaderboard π Track, rank and evaluate open LLMs and chatbots