Hugging Face
Models
Datasets
Spaces
Posts
Docs
Enterprise
Pricing
Log In
Sign Up
simonveitner
/
MathHermes-2.5-Mistral-7B
like
1
Text Generation
Transformers
Safetensors
English
mistral
instruct
finetune
chatml
gpt4
synthetic data
distillation
dpo
rlhf
conversational
text-generation-inference
Inference Endpoints
License:
apache-2.0
Model card
Files
Files and versions
Community
1
Train
Deploy
Use this model
simonveitner
commited on
Dec 2, 2023
Commit
31d9ea8
·
1 Parent(s):
6f89370
Update README.md
Browse files
Files changed (1)
hide
show
README.md
+3
-0
README.md
CHANGED
Viewed
@@ -13,4 +13,7 @@ tags:
13
license: apache-2.0
14
language:
15
- en
16
---
13
license: apache-2.0
14
language:
15
- en
16
+
dataset: argilla/distilabel-math-preference-dpo
17
---
18
+
19
+
This model was finetuned with DPO technique.