ikimhope
/

pythia-1b-tldr-online-dpo-mark-peft

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

pythia-1b-tldr-online-dpo-mark-peft

Commit History

End of training

646b0cf
verified

ikimhope commited on Oct 7, 2024

Model save

4eb4ea8
verified

ikimhope commited on Oct 7, 2024

Training in progress, step 684

f1c1be7
verified

ikimhope commited on Oct 7, 2024

Training in progress, step 500

ec18679
verified

ikimhope commited on Oct 6, 2024

initial commit

2616912
verified

ikimhope commited on Oct 6, 2024