Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
pbevan11
/
Mistral-Nemo-MCAI-SFT-DPO
like
0
Text Generation
Transformers
TensorBoard
Safetensors
pbevan11/multilingual-constitutional-preference-pairs
pbevan11/ultrafeedback_binarized_multilingual
mistral
alignment-handbook
trl
dpo
Generated from Trainer
conversational
text-generation-inference
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Metrics
Training metrics
Community
Train
Deploy
Use this model
0736e78
Mistral-Nemo-MCAI-SFT-DPO
/
runs
/
Sep30_15-39-35_280ca1cd997c
1 contributor
History:
1 commit
pbevan11
Training in progress, step 83
93ba0d6
verified
4 months ago
events.out.tfevents.1727711310.280ca1cd997c.3414.0
Safe
12.7 kB
LFS
Training in progress, step 83
4 months ago