ShenaoZ
/

0.001_4iters_bs128_declr_nodpo_useresponse_iter_1

Text Generation

alignment-handbook

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

0.001_4iters_bs128_declr_nodpo_useresponse_iter_1

Commit History

End of training

c031d0d
verified

ShenaoZ commited on Apr 26, 2024

Model save

60e746f
verified

ShenaoZ commited on Apr 26, 2024

Training in progress, step 100

ab68f9f
verified

ShenaoZ commited on Apr 25, 2024

initial commit

521bd4e
verified

ShenaoZ commited on Apr 25, 2024