Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
XueyingJia
/
pythia-1b-online-dpo-HH-merge-rewardmodel-resume
like
0
Transformers
Safetensors
XueyingJia/online_dpo_repo
Generated from Trainer
trl
online-dpo
Inference Endpoints
arxiv:
2402.04792
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
main
pythia-1b-online-dpo-HH-merge-rewardmodel-resume
Commit History
Training in progress, step 1100
86924cd
verified
XueyingJia
commited on
Dec 8, 2024
Training in progress, step 1000
df2852d
verified
XueyingJia
commited on
Dec 8, 2024
Training in progress, step 900
b67639b
verified
XueyingJia
commited on
Dec 8, 2024
Training in progress, step 800
e4da434
verified
XueyingJia
commited on
Dec 8, 2024
Training in progress, step 700
ff6a13b
verified
XueyingJia
commited on
Dec 8, 2024
Training in progress, step 600
0ba13ef
verified
XueyingJia
commited on
Dec 8, 2024
Training in progress, step 500
926e2f4
verified
XueyingJia
commited on
Dec 8, 2024
Training in progress, step 400
ee95181
verified
XueyingJia
commited on
Dec 8, 2024
Training in progress, step 300
37a7a2d
verified
XueyingJia
commited on
Dec 8, 2024
Training in progress, step 200
9bb9731
verified
XueyingJia
commited on
Dec 8, 2024
Training in progress, step 100
8564f6f
verified
XueyingJia
commited on
Dec 8, 2024
End of training
10435a5
verified
XueyingJia
commited on
Dec 8, 2024
Model save
bc0b347
verified
XueyingJia
commited on
Dec 8, 2024
Training in progress, step 4020
995bb26
verified
XueyingJia
commited on
Dec 8, 2024
Training in progress, step 3618
3056768
verified
XueyingJia
commited on
Dec 8, 2024
Training in progress, step 3216
02fa2ef
verified
XueyingJia
commited on
Dec 8, 2024
Training in progress, step 2814
4f46c17
verified
XueyingJia
commited on
Dec 8, 2024
Training in progress, step 2412
8905c8c
verified
XueyingJia
commited on
Dec 8, 2024
Training in progress, step 2010
27ed754
verified
XueyingJia
commited on
Dec 8, 2024
Training in progress, step 1608
936dd91
verified
XueyingJia
commited on
Dec 8, 2024
Training in progress, step 1206
e6ce9c4
verified
XueyingJia
commited on
Dec 8, 2024
Training in progress, step 804
e58153a
verified
XueyingJia
commited on
Dec 8, 2024
Training in progress, step 402
a429f4d
verified
XueyingJia
commited on
Dec 8, 2024
initial commit
1cc3f1f
verified
XueyingJia
commited on
Dec 8, 2024