qwen-2.5-3b-grpo-v2 / adapter_model.safetensors

Commit History

Trained with Unsloth
f4b2227
verified

underscore2 commited on