Commit
•
4856a37
1
Parent(s):
d8607aa
Update README.md
Browse files
README.md
CHANGED
@@ -3,6 +3,7 @@ license: apache-2.0
|
|
3 |
datasets:
|
4 |
- alvarobartt/dpo-mix-7k-simplified
|
5 |
- argilla/dpo-mix-7k
|
|
|
6 |
language:
|
7 |
- en
|
8 |
library_name: peft
|
@@ -10,6 +11,7 @@ pipeline_tag: text-generation
|
|
10 |
tags:
|
11 |
- orpo
|
12 |
- qlora
|
|
|
13 |
---
|
14 |
|
15 |
## ORPO fine-tune of Mistral 7B v0.1 with DPO Mix 7K
|
|
|
3 |
datasets:
|
4 |
- alvarobartt/dpo-mix-7k-simplified
|
5 |
- argilla/dpo-mix-7k
|
6 |
+
base_model: mistralai/Mistral-7B-v0.1
|
7 |
language:
|
8 |
- en
|
9 |
library_name: peft
|
|
|
11 |
tags:
|
12 |
- orpo
|
13 |
- qlora
|
14 |
+
- trl
|
15 |
---
|
16 |
|
17 |
## ORPO fine-tune of Mistral 7B v0.1 with DPO Mix 7K
|