Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
Haleshot
/
Mathmate-7B-DELLA-ORPO
like
0
Safetensors
argilla/distilabel-math-preference-dpo
llama
finetuned
orpo
math
preference-learning
License:
apache-2.0
Model card
Files
Files and versions
Community
Train
main
Mathmate-7B-DELLA-ORPO
/
README.md
Commit History
Update README.md
81533a4
verified
Haleshot
commited on
Sep 29, 2024
Update README.md
80cab8f
verified
Haleshot
commited on
Sep 23, 2024
Create README.md
90127cd
verified
Haleshot
commited on
Sep 23, 2024