zephyr-7b-dpo-full-magpi-reward-scale-1 / special_tokens_map.json

Commit History

Training in progress, step 200
7165d2a
verified

sfulay commited on