Benchmarks - a hppdqdq Collection

Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up

hppdqdq 's Collections

Benchmarks

updated 11 days ago

Running on CPU Upgrade

181

🥇

MMLU Pro

More advanced and challenging multi-task evaluation
Running

33

🎭

Stick To Your Role! Leaderboard
Running

50

📊

ZeroEval Leaderboard
Running

23

🥇

Decentralized Arena Leaderboard
Running on CPU Upgrade

323

🥇

Open Medical-LLM Leaderboard
Running

168

🏆

GPU Poor LLM Arena

Compact LLM Battle Arena: Frugal AI Face-Off!
Running

96

🌎

Open VLM Video Leaderboard

VLMEvalKit Eval Results in video understanding benchmark
Running on CPU Upgrade

12.3k

🏆

Open LLM Leaderboard

Track, rank and evaluate open LLMs and chatbots
Running on Zero

260

🤗🏆

TTS Spaces Arena

Vote on the top HF TTS models!

Collection guide
Browse collections

Company

TOS Privacy About Jobs

Website

Models Datasets Spaces Pricing Docs