two_agent_rdpo_iter_2 / config.json

Commit History

RDPO-7b-beta0.01-eta0.001
96ac2a4
verified

YYYYYYibo commited on

Training in progress, step 100
e9d1149
verified

YYYYYYibo commited on