abhishekchohan
commited on
Commit
•
750e329
1
Parent(s):
fed3999
Update README.md
Browse files
README.md
CHANGED
@@ -9,6 +9,16 @@ language:
|
|
9 |
library_name: transformers
|
10 |
pipeline_tag: text-generation
|
11 |
---
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
12 |
💻 Usage
|
13 |
|
14 |
```python
|
|
|
9 |
library_name: transformers
|
10 |
pipeline_tag: text-generation
|
11 |
---
|
12 |
+
|
13 |
+
|
14 |
+
Introducing Mistral-7B-Forest-DPO, a LLM fine-tuned with base model mistralai/Mistral-7B-v0.1, using direct preference optimization.
|
15 |
+
This model showcases exceptional prowess across a spectrum of natural language processing (NLP) tasks.
|
16 |
+
|
17 |
+
A mixture of the following datasets was used for fine-tuning.
|
18 |
+
*Intel/orca_dpo_pairs
|
19 |
+
*nvidia/HelpSteer
|
20 |
+
*jondurbin/truthy-dpo-v0.1
|
21 |
+
|
22 |
💻 Usage
|
23 |
|
24 |
```python
|