Math GitBag/gemma-2-9b-it-gsm8k Viewer • Updated Nov 5, 2024 • 7.47k • 54 GitBag/llama-3_1-70b-it-gsm8k Viewer • Updated Nov 5, 2024 • 7.47k • 39 GitBag/gemma-2-27b-it-gsm8k Viewer • Updated Nov 5, 2024 • 7.47k • 44 GitBag/llama-3-8b-it-gsm8k Viewer • Updated Nov 5, 2024 • 7.47k • 37
Math GitBag/gemma-2-9b-it-gsm8k Viewer • Updated Nov 5, 2024 • 7.47k • 54 GitBag/llama-3_1-70b-it-gsm8k Viewer • Updated Nov 5, 2024 • 7.47k • 39 GitBag/gemma-2-27b-it-gsm8k Viewer • Updated Nov 5, 2024 • 7.47k • 44 GitBag/llama-3-8b-it-gsm8k Viewer • Updated Nov 5, 2024 • 7.47k • 37
GitBag/a_star_final_ds-distilled-qwen-1.5b-a-star-16384_actor Text Generation • Updated about 1 month ago • 25
GitBag/a_star_final_ds-distilled-qwen-1.5b-grpo-2-kl-1e-4-16384_actor Text Generation • Updated about 1 month ago • 33
GitBag/a_star_final_ds-distilled-qwen-1.5b-ppo-kl-1e-4-ec-0.001-16384_critic Token Classification • Updated May 12 • 7
GitBag/a_star_final_ds-distilled-qwen-1.5b-ppo-kl-1e-4-ec-0.001-16384_actor Text Generation • Updated May 12 • 41