Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
alvarobartt
/
Mistral-7B-v0.1-ORPO
like
14
Text Generation
Transformers
TensorBoard
Safetensors
alvarobartt/dpo-mix-7k-simplified
argilla/dpo-mix-7k
English
mistral
orpo
qlora
trl
conversational
text-generation-inference
arxiv:
2403.07691
License:
apache-2.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Deploy
Use this model
2144467
Mistral-7B-v0.1-ORPO
Commit History
Upload MistralForCausalLM
2144467
verified
alvarobartt
HF staff
commited on
Mar 23
Update README.md
97460b7
verified
alvarobartt
HF staff
commited on
Mar 22
Update README.md
21eefb4
verified
alvarobartt
HF staff
commited on
Mar 22
Update README.md
8749fa0
verified
alvarobartt
HF staff
commited on
Mar 22
Update README.md
16f4d77
verified
alvarobartt
HF staff
commited on
Mar 22
Update README.md
4856a37
verified
alvarobartt
HF staff
commited on
Mar 22
Create README.md
d8607aa
verified
alvarobartt
HF staff
commited on
Mar 22
Add `config.json`
8483f73
verified
alvarobartt
HF staff
commited on
Mar 22
Training in progress, epoch 2
89ccdfd
verified
alvarobartt
HF staff
commited on
Mar 22
Training in progress, epoch 1
d9394e0
verified
alvarobartt
HF staff
commited on
Mar 22
Training in progress, epoch 0
21038f9
verified
alvarobartt
HF staff
commited on
Mar 22
initial commit
2f0aac3
verified
alvarobartt
HF staff
commited on
Mar 21