Running 98 Unlocking On-Policy Distillation for Any Model Family π 98 Visualize on-policy distillation for any model family
Running Agents 6 Dataset Length Profiler π 6 Estimate optimal max_length for SFT training with token analysis
Running 3.82k The Ultra-Scale Playbook π 3.82k The ultimate guide to training LLM on large GPU Clusters
Running Agents 88 Large Reasoning Models Leaderboard π³ 88 A leaderboard to rank large reasoning models
Running 596 Scaling test-time compute π 596 Run advanced search strategies to boost LLM problem solving
HuggingFaceH4/zephyr-7b-alpha Text Generation β’ 7B β’ Updated Oct 16, 2024 β’ 5.43k β’ β’ 1.12k
Runtime error Agents 103 Huggingface Leaderboard π 103 Generate Hugging Face author leaderboards and stats