Goekdeniz-Guelmez
/

Josie-v6-2b-mlx-concept

Text Generation

Model card Files Files and versions Community

Goekdeniz-Guelmez commited on 14 days ago

Commit

48f3adf

·

verified ·

1 Parent(s): 31d78a7

Update README.md

Files changed (1) hide show

README.md +3 -2

README.md CHANGED Viewed

@@ -29,8 +29,9 @@ This is a crude proof of concept (PoC) demonstrating the feasibility of fine-tun
   ```
 - **Training Process:**
-  - First **10K steps** trained using **LoRA** (Low-Rank Adaptation) with **18 layers** selected.
-  - Final **1K steps** trained using **full weight training**.
 ## Hardware Used

   ```
 - **Training Process:**
+  - First **10K steps** trained using **LoRA** (Low-Rank Adaptation) with **22 layers** selected.
+  - Second **1K steps** trained using **full weight training**.
+  - Final **4K steps** ORPO training using **DoRA** with **22 layers** selected.
 ## Hardware Used