Math GitBag/gemma-2-9b-it-gsm8k Viewer • Updated Nov 5, 2024 • 7.47k • 15 GitBag/llama-3_1-70b-it-gsm8k Viewer • Updated Nov 5, 2024 • 7.47k • 13 GitBag/gemma-2-27b-it-gsm8k Viewer • Updated Nov 5, 2024 • 7.47k • 18 GitBag/llama-3-8b-it-gsm8k Viewer • Updated Nov 5, 2024 • 7.47k • 17
GitBag/a_star_final_ds-distilled-qwen-1.5b-grpo-2-kl-1e-4-16384_actor Text Generation • Updated about 1 hour ago
GitBag/a_star_final_ds-distilled-qwen-1.5b-ppo-kl-1e-4-ec-0.001-16384_critic Token Classification • Updated about 3 hours ago
GitBag/a_star_final_ds-distilled-qwen-1.5b-ppo-kl-1e-4-ec-0.001-16384_actor Text Generation • Updated about 3 hours ago
GitBag/block-q-sharp_ds-distilled-qwen-1.5b-ppo-kl-1e-4-ec-0.001-14336_critic Token Classification • Updated 1 day ago • 27
GitBag/block-q-sharp_ds-distilled-qwen-1.5b-ppo-kl-1e-4-ec-0.001-14336_actor Text Generation • Updated 1 day ago • 43
GitBag/block-q-sharp_ds-distilled-qwen-1.5b-ppo-kl-1e-4-ec-0.001-16384_critic Token Classification • Updated 2 days ago • 21
GitBag/block-q-sharp_ds-distilled-qwen-1.5b-ppo-kl-1e-4-ec-0.001-16384_actor Text Generation • Updated 2 days ago • 37
GitBag/block-q-sharp_ds-distilled-qwen-1.5b-ppo-kl-1e-4-ec-0.001-good-1 Text Generation • Updated 2 days ago • 38
GitBag/open_r1_mar2_round_1_tokenized_DeepSeek-R1-Distill-Qwen-1.5B_eval Viewer • Updated about 6 hours ago • 5k
GitBag/open_r1_mar2_round_1_tokenized_DeepSeek-R1-Distill-Qwen-1.5B Viewer • Updated about 17 hours ago • 5k • 29
GitBag/block-q-sharp_ds-distilled-qwen-1.5b-ppo-kl-1e-4-ec-0.001-good-1_aime-25_expanded_prompt_0_eval Viewer • Updated 1 day ago • 15.4k • 3
GitBag/block-q-sharp_ds-distilled-qwen-1.5b-ppo-kl-1e-4-ec-0.001-good-1_aime-24_expanded_prompt_0_eval Viewer • Updated 1 day ago • 15.4k • 12
GitBag/DeepSeek-R1-Distill-Qwen-1.5B_hmmt-feb-25_expanded_prompt_0_eval Viewer • Updated 2 days ago • 30 • 19
GitBag/DeepSeek-R1-Distill-Qwen-1.5B_hmmt-feb-24_expanded_prompt_0_eval Viewer • Updated 3 days ago • 30 • 20
GitBag/DeepSeek-R1-Distill-Qwen-1.5B_aime-25_expanded_prompt_0_eval Viewer • Updated 3 days ago • 30 • 18
GitBag/DeepSeek-R1-Distill-Qwen-1.5B_aime-24_expanded_prompt_1_eval Viewer • Updated 3 days ago • 30 • 20