Running 122 Berkeley Function Calling Leaderboard 🏃 122 Compare AI model performance on function calling tasks
Vikhrmodels/Qwen2.5-7B-Instruct-Tool-Planning-v0.1 Text Generation • 8B • Updated Feb 19, 2025 • 27 • • 13
docling-project/SmolDocling-256M-preview Image-Text-to-Text • 0.3B • Updated Sep 17, 2025 • 36.3k • 1.61k
InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU Paper • 2502.08910 • Published Feb 13, 2025 • 148
deepseek-ai/DeepSeek-R1-Distill-Qwen-32B Text Generation • 33B • Updated Feb 24, 2025 • 1.85M • • 1.51k