reward-gpt-b6 / checkpoint-3500
bradmin's picture
Training in progress, step 3500, checkpoint
77b5f70