XueyingJia
/

pythia-1b-deduped-hh-online-dpo-full

Generated from Trainer

Inference Endpoints

Model card Files Files and versions Community

pythia-1b-deduped-hh-online-dpo-full / tokenizer.json

XueyingJia's picture

Training in progress, step 2010

027fc84 verified about 1 month ago

history contribute delete

3.56 MB

File too large to display, you can check the raw version instead.