phi-2-gpo-ultrafeedback-lora / adapter_model.safetensors

Commit History

Training in progress, step 100
01cfcf2
verified

lole25 commited on