Running 2.49k 2.49k The Ultra-Scale Playbook π The ultimate guide to training LLM on large GPU Clusters
JudgeBench: A Benchmark for Evaluating LLM-based Judges Paper β’ 2410.12784 β’ Published Oct 16, 2024 β’ 48
Running on CPU Upgrade 13k 13k Open LLM Leaderboard π Track, rank and evaluate open LLMs and chatbots