zephyr-7b-dpo-lora-r16-20k / trainer_state.json

Commit History

Model save
32da1fc
verified

LaoRay commited on