abhishekchohan
/

mistral-7B-forest-dpo

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

abhishekchohan commited on Feb 14, 2024

Commit

750e329

·

verified ·

1 Parent(s): fed3999

Update README.md

Files changed (1) hide show

README.md +10 -0

README.md CHANGED Viewed

@@ -9,6 +9,16 @@ language:
 library_name: transformers
 pipeline_tag: text-generation
 ---
 💻 Usage
 ```python

 library_name: transformers
 pipeline_tag: text-generation
 ---
+Introducing Mistral-7B-Forest-DPO, a LLM fine-tuned with base model mistralai/Mistral-7B-v0.1, using direct preference optimization.
+This model showcases exceptional prowess across a spectrum of natural language processing (NLP) tasks.
+A mixture of the following datasets was used for fine-tuning.
+*Intel/orca_dpo_pairs
+*nvidia/HelpSteer
+*jondurbin/truthy-dpo-v0.1
 💻 Usage
 ```python