YYYYYYibo
/

nash_dpo_rank4_on_vanilla_iter_1

alignment-handbook

Generated from Trainer

Model card Files Files and versions Community

nash_dpo_rank4_on_vanilla_iter_1 / tokenizer.json

YYYYYYibo's picture

Training in progress, epoch 0

93295ac verified 8 months ago

history contribute delete

1.8 MB

File too large to display, you can check the raw version instead.