llama-3.1-8b-grpo / adapter_model.safetensors

Commit History

Trained with Unsloth
42a3b29
verified

DrishtiSharma commited on