phi-4-14B-grpo-gsm8k-3e / model-00002-of-00006.safetensors

Commit History

Trained with Unsloth
d8874d5
verified

mrm8488 commited on