Running on CPU Upgrade 69 69 AIR-Bench Leaderboard π₯ Explore benchmark results for QA and long doc models
Running on CPU Upgrade 115 115 Open Chinese LLM Leaderboard π Display and filter LLM benchmark results
Running on CPU Upgrade 12.9k 12.9k Open LLM Leaderboard π Track, rank and evaluate open LLMs and chatbots