Runtime error
4
OSQ Leaderboard
🐨
Display LLM leaderboard data
None defined yet.
We introduce the Open-LLM-Leaderboard to track various LLMs’ performance on open-style questions and reflect their true capability. You can use OSQ-bench questions and prompts to evaluate your models automatically with an LLM-based evaluator.