Llama-2-7b-sft-dpo-10k / last-checkpoint
AmberYifan's picture
Training in progress, epoch 2, checkpoint
2b0ba6e verified