Running 18 18 TravelPlannerLeaderboard π» Display and submit evaluation results for travel planning
Running on CPU Upgrade 12.8k 12.8k Open LLM Leaderboard π Track, rank and evaluate open LLMs and chatbots