Goekdeniz-Guelmez
/

Josie-v6-2b-mlx-concept

Text Generation

Model card Files Files and versions Community

Goekdeniz-Guelmez commited on 15 days ago

Commit

31d78a7

·

verified ·

1 Parent(s): 515e8e4

Update README.md

Files changed (1) hide show

README.md +3 -3

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ This is a crude proof of concept (PoC) demonstrating the feasibility of fine-tun
 - **Trained number of Tokens:** ca. 1T
 - **Created by:** Gökdeniz Gülmez
 - **Fine-Tune Dataset:** Offline private dataset
-- **DPO Dataset:** Offline private dataset
 - **Prompt Template:**
   ```text
@@ -44,9 +44,9 @@ This is a crude proof of concept (PoC) demonstrating the feasibility of fine-tun
 - The training process may require significant memory and computational resources despite optimizations.
 - Further work is needed to explore distributed training and mixed-precision techniques for better performance on Apple Silicon.
-## DPO Training
-DPO training is not yet available in the official `mlx-examples` repository. To use it, you will need to clone and work from my fork:
 [https://github.com/Goekdeniz-Guelmez/mlx-examples.git](https://github.com/Goekdeniz-Guelmez/mlx-examples.git)
 ## Future Improvements

 - **Trained number of Tokens:** ca. 1T
 - **Created by:** Gökdeniz Gülmez
 - **Fine-Tune Dataset:** Offline private dataset
+- **DPO/ORPO Dataset:** Offline private dataset
 - **Prompt Template:**
   ```text
 - The training process may require significant memory and computational resources despite optimizations.
 - Further work is needed to explore distributed training and mixed-precision techniques for better performance on Apple Silicon.
+## ORPO Training
+ORPO training is not yet available in the official `mlx-examples` repository. To use it, you will need to clone and work from my fork:
 [https://github.com/Goekdeniz-Guelmez/mlx-examples.git](https://github.com/Goekdeniz-Guelmez/mlx-examples.git)
 ## Future Improvements