Llama-2-7b-sft-dpo-10k / last-checkpoint
AmberYifan's picture
Training in progress, epoch 2, checkpoint
baa0469 verified