Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
DUAL-GPO
/
zephyr-7b-gpo-update3-i0
like
0
Follow
DUAL Group
2
PEFT
TensorBoard
Safetensors
HuggingFaceH4/ultrafeedback_binarized
mistral
alignment-handbook
Generated from Trainer
trl
dpo
License:
apache-2.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Use this model
ea7652f
zephyr-7b-gpo-update3-i0
Commit History
Training in progress, step 4900
ad1e980
verified
lole25
commited on
Apr 5
Training in progress, step 4800
0c3e875
verified
lole25
commited on
Apr 5
Training in progress, step 4700
41ef24a
verified
lole25
commited on
Apr 5
Training in progress, step 4600
7ad4d7c
verified
lole25
commited on
Apr 5
Training in progress, step 4500
3fdbb2d
verified
lole25
commited on
Apr 5
Training in progress, step 4400
2ceb864
verified
lole25
commited on
Apr 5
Training in progress, step 4300
539f571
verified
lole25
commited on
Apr 5
Training in progress, step 4200
9d33e59
verified
lole25
commited on
Apr 5
Training in progress, step 4000
43a5da4
verified
lole25
commited on
Apr 5
Training in progress, step 3900
c823e57
verified
lole25
commited on
Apr 5
Training in progress, step 3800
6ccbfd9
verified
lole25
commited on
Apr 5
Training in progress, step 3700
c8b95f5
verified
lole25
commited on
Apr 5
Training in progress, step 3600
b1acd7d
verified
lole25
commited on
Apr 5
Training in progress, step 3500
a01cdb4
verified
lole25
commited on
Apr 5
Training in progress, step 3400
7ee347f
verified
lole25
commited on
Apr 5
Training in progress, step 3200
6922a52
verified
lole25
commited on
Apr 5
Training in progress, step 3100
3bafca5
verified
lole25
commited on
Apr 5
Training in progress, step 2900
582da8e
verified
lole25
commited on
Apr 5
Training in progress, step 2800
12c5f60
verified
lole25
commited on
Apr 5
Training in progress, step 2700
a1dfc64
verified
lole25
commited on
Apr 4
Training in progress, step 2600
555c3eb
verified
lole25
commited on
Apr 4
Training in progress, step 2500
44c2ec9
verified
lole25
commited on
Apr 4
Training in progress, step 2400
469f57f
verified
lole25
commited on
Apr 4
Training in progress, step 2300
986610c
verified
lole25
commited on
Apr 4
Training in progress, step 2200
8930472
verified
lole25
commited on
Apr 4
Training in progress, step 2100
b877df6
verified
lole25
commited on
Apr 4
Training in progress, step 2000
88515de
verified
lole25
commited on
Apr 4
Training in progress, step 1900
0d346f5
verified
lole25
commited on
Apr 4
Training in progress, step 1800
c879b0a
verified
lole25
commited on
Apr 4
Training in progress, step 1700
0462095
verified
lole25
commited on
Apr 4
Training in progress, step 1600
4218669
verified
lole25
commited on
Apr 4
Training in progress, step 1500
218367d
verified
lole25
commited on
Apr 4
Training in progress, step 1400
b4413cc
verified
lole25
commited on
Apr 4
Training in progress, step 1300
ab1e891
verified
lole25
commited on
Apr 4
Training in progress, step 1200
8a348d5
verified
lole25
commited on
Apr 4
Training in progress, step 1000
82fca72
verified
lole25
commited on
Apr 4
Training in progress, step 900
5ee9430
verified
lole25
commited on
Apr 4
Training in progress, step 800
4f50e9c
verified
lole25
commited on
Apr 4
Training in progress, step 700
4f57f65
verified
lole25
commited on
Apr 4
Training in progress, step 600
42c4d3e
verified
lole25
commited on
Apr 4
Training in progress, step 500
665649f
verified
lole25
commited on
Apr 4
Training in progress, step 400
f201a30
verified
lole25
commited on
Apr 4
Training in progress, step 300
6f54be4
verified
lole25
commited on
Apr 4
Training in progress, step 200
583e0ab
verified
lole25
commited on
Apr 4
Training in progress, step 100
0deff58
verified
lole25
commited on
Apr 4
initial commit
0759bc3
verified
lole25
commited on
Apr 4
Previous
1
2
3
Next