Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
DUAL-GPO
/
zephyr-7b-gpo-v8-i1
like
0
Follow
DUAL Group
2
PEFT
TensorBoard
Safetensors
HuggingFaceH4/ultrafeedback_binarized
mistral
alignment-handbook
Generated from Trainer
trl
dpo
License:
mit
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Use this model
main
zephyr-7b-gpo-v8-i1
Commit History
End of training
1895846
verified
lole25
commited on
May 8
Model save
f90326a
verified
lole25
commited on
May 8
Training in progress, step 8300
55bbcdb
verified
lole25
commited on
May 8
Training in progress, step 8200
8d73f2b
verified
lole25
commited on
May 8
Training in progress, step 8100
e741d79
verified
lole25
commited on
May 8
Training in progress, step 8000
5e95aad
verified
lole25
commited on
May 8
Training in progress, step 7800
9ce6df8
verified
lole25
commited on
May 8
Training in progress, step 7700
7149205
verified
lole25
commited on
May 8
Training in progress, step 7400
86c3b2d
verified
lole25
commited on
May 8
Training in progress, step 7300
11ebd30
verified
lole25
commited on
May 8
Training in progress, step 7200
e77a7ab
verified
lole25
commited on
May 8
Training in progress, step 7100
791406d
verified
lole25
commited on
May 8
Training in progress, step 7000
38b89c2
verified
lole25
commited on
May 8
Training in progress, step 6900
50dfad0
verified
lole25
commited on
May 8
Training in progress, step 6800
4ad5e24
verified
lole25
commited on
May 8
Training in progress, step 6700
e27d8f3
verified
lole25
commited on
May 8
Training in progress, step 6600
45dd116
verified
lole25
commited on
May 8
Training in progress, step 6500
cf5faf3
verified
lole25
commited on
May 8
Training in progress, step 6300
8dc88a5
verified
lole25
commited on
May 8
Training in progress, step 6100
dc5c975
verified
lole25
commited on
May 8
Training in progress, step 5900
c873646
verified
lole25
commited on
May 8
Training in progress, step 5700
c86d3ac
verified
lole25
commited on
May 8
Training in progress, step 5500
d312c0e
verified
lole25
commited on
May 8
Training in progress, step 5400
01d76a9
verified
lole25
commited on
May 8
Training in progress, step 5300
2fc8ec5
verified
lole25
commited on
May 8
Training in progress, step 5200
89e3ef6
verified
lole25
commited on
May 8
Training in progress, step 5000
2f6c224
verified
lole25
commited on
May 8
Training in progress, step 4900
24461d4
verified
lole25
commited on
May 8
Training in progress, step 4800
abdbcd2
verified
lole25
commited on
May 8
Training in progress, step 4700
dc2320f
verified
lole25
commited on
May 8
Training in progress, step 4600
1d42b34
verified
lole25
commited on
May 8
Training in progress, step 4500
3432817
verified
lole25
commited on
May 8
Training in progress, step 4400
c9d56dd
verified
lole25
commited on
May 8
Training in progress, step 4300
97cb520
verified
lole25
commited on
May 8
Training in progress, step 4200
6ac3e59
verified
lole25
commited on
May 8
Training in progress, step 4100
89f41a6
verified
lole25
commited on
May 8
Training in progress, step 4000
48760eb
verified
lole25
commited on
May 8
Training in progress, step 3900
e799e19
verified
lole25
commited on
May 8
Training in progress, step 3800
0e660de
verified
lole25
commited on
May 8
Training in progress, step 3700
ed784d0
verified
lole25
commited on
May 8
Training in progress, step 3600
3eb4e17
verified
lole25
commited on
May 8
Training in progress, step 3500
5acb19d
verified
lole25
commited on
May 8
Training in progress, step 3300
e5067f8
verified
lole25
commited on
May 8
Training in progress, step 3200
85fc8b5
verified
lole25
commited on
May 8
Training in progress, step 3100
77dd4d7
verified
lole25
commited on
May 8
Training in progress, step 3000
5312189
verified
lole25
commited on
May 8
Training in progress, step 2900
785428e
verified
lole25
commited on
May 8
Training in progress, step 2800
8f43949
verified
lole25
commited on
May 8
Training in progress, step 2600
cf280bd
verified
lole25
commited on
May 8
Training in progress, step 2500
fd9b7e9
verified
lole25
commited on
May 7
Previous
1
2
Next