Update README.md
Browse files
README.md
CHANGED
@@ -29,8 +29,9 @@ This is a crude proof of concept (PoC) demonstrating the feasibility of fine-tun
|
|
29 |
```
|
30 |
|
31 |
- **Training Process:**
|
32 |
-
- First **10K steps** trained using **LoRA** (Low-Rank Adaptation) with **
|
33 |
-
-
|
|
|
34 |
|
35 |
## Hardware Used
|
36 |
|
|
|
29 |
```
|
30 |
|
31 |
- **Training Process:**
|
32 |
+
- First **10K steps** trained using **LoRA** (Low-Rank Adaptation) with **22 layers** selected.
|
33 |
+
- Second **1K steps** trained using **full weight training**.
|
34 |
+
- Final **4K steps** ORPO training using **DoRA** with **22 layers** selected.
|
35 |
|
36 |
## Hardware Used
|
37 |
|