lewtun
/

gemma-7b-dpo-full-ultrafeedback-beta-0.01

Text Generation

alignment-handbook

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Metrics Training metrics Community

gemma-7b-dpo-full-ultrafeedback-beta-0.01 / runs /Feb29_22-08-46_ip-26-0-161-178

1 contributor

History: 6 commits

lewtun's picture

lewtun HF staff

End of training

099ea17 verified 8 months ago

events.out.tfevents.1709244785.ip-26-0-161-178.1167714.0

41.1 kB
LFS

Model save 8 months ago
events.out.tfevents.1709250301.ip-26-0-161-178.1167714.1

828 Bytes
LFS

End of training 8 months ago