Hugging Face
Models
Datasets
Spaces
Posts
Docs
Solutions
Pricing
Log In
Sign Up
qgallouedec
/
online-dpo-qwen2-2
like
0
Text Generation
Transformers
Safetensors
PEFT
dataset_name
qwen2
trl
online-dpo
Generated from Trainer
conversational
text-generation-inference
Inference Endpoints
Model card
Files
Files and versions
Community
Train
Deploy
Use this model
751bf01
online-dpo-qwen2-2
Commit History
Training in progress, epoch 3
751bf01
verified
qgallouedec
HF staff
commited on
Sep 25
Training in progress, epoch 1
576d65a
verified
qgallouedec
HF staff
commited on
Sep 25
Training in progress, epoch 3
6f2ee96
verified
qgallouedec
HF staff
commited on
Sep 25
Training in progress, epoch 2
0c898f5
verified
qgallouedec
HF staff
commited on
Sep 25
Training in progress, epoch 1
00dc223
verified
qgallouedec
HF staff
commited on
Sep 25
Update README.md
e4af263
verified
qgallouedec
HF staff
commited on
Sep 25
Update README.md
c16f001
verified
qgallouedec
HF staff
commited on
Sep 25
Update README.md
4cd717a
verified
qgallouedec
HF staff
commited on
Sep 25
End of training
4bc4863
verified
qgallouedec
HF staff
commited on
Sep 25
Training in progress, epoch 3
7bd44d6
verified
qgallouedec
HF staff
commited on
Sep 25
Training in progress, epoch 2
3765841
verified
qgallouedec
HF staff
commited on
Sep 25
Training in progress, epoch 1
23f6dc2
verified
qgallouedec
HF staff
commited on
Sep 25
initial commit
95e3732
verified
qgallouedec
HF staff
commited on
Sep 25