Update README.md
Browse files
README.md
CHANGED
@@ -9,7 +9,7 @@ basemodel: Qwen/Qwen1.5-7B
|
|
9 |
We can't train Qwen2 with Unsloth, even though Qwen2 is popular in community.
|
10 |
|
11 |
It's exciting that we succeed to make Unsloth support Qwen2, it can speed up training and reduce much memory usage.
|
12 |
-
If you want to train Qwen2 with Unsloth, you can use our
|
13 |
|
14 |
Install our Unsloth:
|
15 |
```bash
|
@@ -166,6 +166,7 @@ The table below shows the full set of DPO training metrics:
|
|
166 |
|0.21|400|0.5624|0.7169|0.9575|-0.2447|-1.2022|0.7615|0.7497|-293.8072|-274.4167|
|
167 |
|0.26|500|0.5863|0.7|0.8908|-0.5283|-1.4191|0.537|0.5085|-284.3388|-267.9294|
|
168 |
|0.31|600|0.5612|0.7166|1.0791|-0.592|-1.6711|0.7121|0.7219|-293.2425|-278.5992|
|
|
|
169 |
|0.37|700|0.5741|0.7234|1.0742|-0.8469|-1.9211|0.6002|0.5769|-300.8099|-285.9137|
|
170 |
|0.42|800|0.582|0.7141|1.0414|-1.1658|-2.2072|0.7191|0.5934|-300.458|-286.1|
|
171 |
|0.47|900|0.5694|0.7178|1.2055|-1.7372|-2.9426|0.4226|0.316|-305.5303|-290.7548|
|
|
|
9 |
We can't train Qwen2 with Unsloth, even though Qwen2 is popular in community.
|
10 |
|
11 |
It's exciting that we succeed to make Unsloth support Qwen2, it can speed up training and reduce much memory usage.
|
12 |
+
If you want to train Qwen2 with Unsloth, you can use [our repo](https://github.com/yangjianxin1/unsloth) rather than the official one. And we will commit our code to the [official repo](https://github.com/unslothai/unsloth).
|
13 |
|
14 |
Install our Unsloth:
|
15 |
```bash
|
|
|
166 |
|0.21|400|0.5624|0.7169|0.9575|-0.2447|-1.2022|0.7615|0.7497|-293.8072|-274.4167|
|
167 |
|0.26|500|0.5863|0.7|0.8908|-0.5283|-1.4191|0.537|0.5085|-284.3388|-267.9294|
|
168 |
|0.31|600|0.5612|0.7166|1.0791|-0.592|-1.6711|0.7121|0.7219|-293.2425|-278.5992|
|
169 |
+
|
170 |
|0.37|700|0.5741|0.7234|1.0742|-0.8469|-1.9211|0.6002|0.5769|-300.8099|-285.9137|
|
171 |
|0.42|800|0.582|0.7141|1.0414|-1.1658|-2.2072|0.7191|0.5934|-300.458|-286.1|
|
172 |
|0.47|900|0.5694|0.7178|1.2055|-1.7372|-2.9426|0.4226|0.316|-305.5303|-290.7548|
|