Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
DUAL-GPO
/
zephyr-7b-gpo-update4-i0
like
0
Follow
DUAL Group
2
PEFT
TensorBoard
Safetensors
HuggingFaceH4/ultrafeedback_binarized
mistral
alignment-handbook
Generated from Trainer
trl
dpo
License:
apache-2.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Use this model
main
zephyr-7b-gpo-update4-i0
Commit History
Training in progress, step 4900
2061cf8
verified
lole25
commited on
Apr 5
Training in progress, step 4800
ad20a90
verified
lole25
commited on
Apr 5
Training in progress, step 4700
abb82da
verified
lole25
commited on
Apr 5
Training in progress, step 4600
41126a0
verified
lole25
commited on
Apr 5
Training in progress, step 4500
efd2695
verified
lole25
commited on
Apr 5
Training in progress, step 4300
8a1353f
verified
lole25
commited on
Apr 5
Training in progress, step 4200
fe25be0
verified
lole25
commited on
Apr 5
Training in progress, step 4100
d950d1c
verified
lole25
commited on
Apr 5
Training in progress, step 4000
4b92193
verified
lole25
commited on
Apr 5
Training in progress, step 3900
ef78500
verified
lole25
commited on
Apr 5
Training in progress, step 3800
c6b8be7
verified
lole25
commited on
Apr 5
Training in progress, step 3700
d26944a
verified
lole25
commited on
Apr 5
Training in progress, step 3600
b49d502
verified
lole25
commited on
Apr 5
Training in progress, step 3500
8243ac1
verified
lole25
commited on
Apr 5
Training in progress, step 3400
70dda2f
verified
lole25
commited on
Apr 5
Training in progress, step 3300
7f453c8
verified
lole25
commited on
Apr 5
Training in progress, step 3200
4baee8a
verified
lole25
commited on
Apr 5
Training in progress, step 3100
7044df5
verified
lole25
commited on
Apr 5
Training in progress, step 3000
0082d6e
verified
lole25
commited on
Apr 5
Training in progress, step 2900
9efec30
verified
lole25
commited on
Apr 5
Training in progress, step 2800
f947381
verified
lole25
commited on
Apr 5
Training in progress, step 2700
b445085
verified
lole25
commited on
Apr 4
Training in progress, step 2600
bd7e55e
verified
lole25
commited on
Apr 4
Training in progress, step 2500
1f606da
verified
lole25
commited on
Apr 4
Training in progress, step 2400
5bb5147
verified
lole25
commited on
Apr 4
Training in progress, step 2300
f274025
verified
lole25
commited on
Apr 4
Training in progress, step 2200
2ceafcf
verified
lole25
commited on
Apr 4
Training in progress, step 2100
9f7194e
verified
lole25
commited on
Apr 4
Training in progress, step 2000
0687175
verified
lole25
commited on
Apr 4
Training in progress, step 1900
9f82946
verified
lole25
commited on
Apr 4
Training in progress, step 1800
e05132d
verified
lole25
commited on
Apr 4
Training in progress, step 1700
b270776
verified
lole25
commited on
Apr 4
Training in progress, step 1600
5218d5a
verified
lole25
commited on
Apr 4
Training in progress, step 1500
dc156f3
verified
lole25
commited on
Apr 4
Training in progress, step 1400
774f160
verified
lole25
commited on
Apr 4
Training in progress, step 1300
b88037a
verified
lole25
commited on
Apr 4
Training in progress, step 1200
649026c
verified
lole25
commited on
Apr 4
Training in progress, step 1100
e377a54
verified
lole25
commited on
Apr 4
Training in progress, step 1000
9cb5798
verified
lole25
commited on
Apr 4
Training in progress, step 900
954eb28
verified
lole25
commited on
Apr 4
Training in progress, step 800
305ccdc
verified
lole25
commited on
Apr 4
Training in progress, step 700
63d840a
verified
lole25
commited on
Apr 4
Training in progress, step 500
8bfc650
verified
lole25
commited on
Apr 4
Training in progress, step 400
5a60c98
verified
lole25
commited on
Apr 4
Training in progress, step 300
95b30f6
verified
lole25
commited on
Apr 4
Training in progress, step 200
99aefcd
verified
lole25
commited on
Apr 4
Training in progress, step 100
e81b3cb
verified
lole25
commited on
Apr 4
initial commit
d19d40b
verified
lole25
commited on
Apr 4
Previous
1
2
3
Next