Running Agents 231 BigCodeBench Leaderboard 🥇 231 Explore code-generation model leaderboards and task details
Running on CPU Upgrade 14k Open LLM Leaderboard 🏆 14k Track, rank and evaluate open LLMs and chatbots
Running 136 Open FinLLM Leaderboard 🥇 136 Explore and compare LLM performance on financial benchmarks