reward-gpt-b6 / checkpoint-4000
bradmin's picture
Training in progress, step 4000, checkpoint
ee4c147