Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
DUAL-GPO
/
zephyr-7b-gpo-v5-i1
like
0
Follow
DUAL Group
2
PEFT
TensorBoard
Safetensors
HuggingFaceH4/ultrafeedback_binarized
mistral
alignment-handbook
Generated from Trainer
trl
dpo
License:
mit
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Use this model
main
zephyr-7b-gpo-v5-i1
Commit History
Update README.md
40b4f06
verified
lole25
commited on
May 7
End of training
60fcc69
verified
lole25
commited on
May 7
Model save
301e147
verified
lole25
commited on
May 7
Training in progress, step 5800
efbc022
verified
lole25
commited on
May 7
Training in progress, step 5600
51cee4e
verified
lole25
commited on
May 7
Training in progress, step 5200
e61a6aa
verified
lole25
commited on
May 7
Training in progress, step 5100
5737845
verified
lole25
commited on
May 7
Training in progress, step 5000
1486ee5
verified
lole25
commited on
May 7
Training in progress, step 4800
8a0519c
verified
lole25
commited on
May 7
Training in progress, step 4600
e6b5114
verified
lole25
commited on
May 7
Training in progress, step 4500
82ebb32
verified
lole25
commited on
May 7
Training in progress, step 4100
3d2235e
verified
lole25
commited on
May 7
Training in progress, step 4000
c058472
verified
lole25
commited on
May 7
Training in progress, step 3900
c0d16e3
verified
lole25
commited on
May 7
Training in progress, step 3600
0ae8526
verified
lole25
commited on
May 7
Training in progress, step 3300
7190700
verified
lole25
commited on
May 7
Training in progress, step 3100
1b7c843
verified
lole25
commited on
May 6
Training in progress, step 3000
4f94c2e
verified
lole25
commited on
May 6
Training in progress, step 2700
02a7f4e
verified
lole25
commited on
May 6
Training in progress, step 2600
fe210d7
verified
lole25
commited on
May 6
Training in progress, step 2500
3ec4992
verified
lole25
commited on
May 6
Training in progress, step 2300
452c1e9
verified
lole25
commited on
May 6
Training in progress, step 2200
b5842bf
verified
lole25
commited on
May 6
Training in progress, step 2100
e487bcd
verified
lole25
commited on
May 6
Training in progress, step 1500
eecc287
verified
lole25
commited on
May 6
Training in progress, step 1400
3ea2337
verified
lole25
commited on
May 6
Training in progress, step 1100
78d616f
verified
lole25
commited on
May 6
Training in progress, step 1000
1793ad1
verified
lole25
commited on
May 6
Training in progress, step 900
0117f8d
verified
lole25
commited on
May 6
Training in progress, step 800
7045643
verified
lole25
commited on
May 6
Training in progress, step 700
2022909
verified
lole25
commited on
May 6
Training in progress, step 600
66b1085
verified
lole25
commited on
May 6
Training in progress, step 500
e825179
verified
lole25
commited on
May 6
Training in progress, step 400
eda5d26
verified
lole25
commited on
May 6
Training in progress, step 300
c104cb2
verified
lole25
commited on
May 6
Training in progress, step 200
8e9460c
verified
lole25
commited on
May 6
Training in progress, step 100
9182151
verified
lole25
commited on
May 6
initial commit
b8c56fa
verified
lole25
commited on
May 6