phi-2-gpo-ultrafeedback-lora / runs /Mar04_23-03-52_gpu4-119-4

Commit History

Training in progress, step 100
01cfcf2
verified

lole25 commited on