NotASI commited on
Commit
0e0ca77
1 Parent(s): ac261b0

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +9 -1
README.md CHANGED
@@ -10,6 +10,8 @@ tags:
10
  - llama
11
  - trl
12
  - sft
 
 
13
  ---
14
 
15
  # Uploaded model
@@ -18,6 +20,12 @@ tags:
18
  - **License:** apache-2.0
19
  - **Finetuned from model :** unsloth/Llama-3.2-1B-Instruct-bnb-4bit
20
 
 
 
 
 
 
 
21
  This llama model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
22
 
23
- [<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)
 
10
  - llama
11
  - trl
12
  - sft
13
+ datasets:
14
+ - mlabonne/FineTome-100k
15
  ---
16
 
17
  # Uploaded model
 
20
  - **License:** apache-2.0
21
  - **Finetuned from model :** unsloth/Llama-3.2-1B-Instruct-bnb-4bit
22
 
23
+ # Details
24
+
25
+ This model was trained on **mlabonne/FineTome-100k** for *2* epochs with **rslora** + **qlora**, and achieve the final training loss: *0.796700*.
26
+
27
+ This model follows the same chat template as the base model one.
28
+
29
  This llama model was trained 2x faster with [Unsloth](https://github.com/unslothai/unsloth) and Huggingface's TRL library.
30
 
31
+ [<img src="https://raw.githubusercontent.com/unslothai/unsloth/main/images/unsloth%20made%20with%20love.png" width="200"/>](https://github.com/unslothai/unsloth)