Goekdeniz-Guelmez commited on
Commit
48f3adf
·
verified ·
1 Parent(s): 31d78a7

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +3 -2
README.md CHANGED
@@ -29,8 +29,9 @@ This is a crude proof of concept (PoC) demonstrating the feasibility of fine-tun
29
  ```
30
 
31
  - **Training Process:**
32
- - First **10K steps** trained using **LoRA** (Low-Rank Adaptation) with **18 layers** selected.
33
- - Final **1K steps** trained using **full weight training**.
 
34
 
35
  ## Hardware Used
36
 
 
29
  ```
30
 
31
  - **Training Process:**
32
+ - First **10K steps** trained using **LoRA** (Low-Rank Adaptation) with **22 layers** selected.
33
+ - Second **1K steps** trained using **full weight training**.
34
+ - Final **4K steps** ORPO training using **DoRA** with **22 layers** selected.
35
 
36
  ## Hardware Used
37