Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
ondevicellm
/
zephyr-7b-dpo-full
like
0
Follow
On-Device-LLM
4
Text Generation
Transformers
TensorBoard
Safetensors
HuggingFaceH4/ultrafeedback_binarized
mistral
alignment-handbook
Generated from Trainer
trl
dpo
conversational
text-generation-inference
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Deploy
Use this model
ce67d23
zephyr-7b-dpo-full
Commit History
End of training
ce67d23
verified
hushell
commited on
Jan 12
Model save
fac6a8d
verified
hushell
commited on
Jan 12
Training in progress, step 900
73940eb
verified
hushell
commited on
Jan 12
Training in progress, step 700
f7491e5
verified
hushell
commited on
Jan 12
Training in progress, step 600
81f9f70
verified
hushell
commited on
Jan 12
Training in progress, step 500
be4b355
verified
hushell
commited on
Jan 12
Training in progress, step 400
9d85750
verified
hushell
commited on
Jan 12
Training in progress, step 300
8cb5bf0
verified
hushell
commited on
Jan 12
Training in progress, step 200
3cbf3af
verified
hushell
commited on
Jan 12
Training in progress, step 100
bb9e240
verified
hushell
commited on
Jan 12
initial commit
96ebc5d
verified
hushell
commited on
Jan 12