End of training
Browse files- README.md +9 -9
- config.json +2 -2
- model-00001-of-00003.safetensors +1 -1
- model-00002-of-00003.safetensors +1 -1
- model-00003-of-00003.safetensors +1 -1
README.md
CHANGED
@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
|
|
15 |
|
16 |
This model is a fine-tuned version of [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) on the None dataset.
|
17 |
It achieves the following results on the evaluation set:
|
18 |
-
- Loss: 3.
|
19 |
|
20 |
## Model description
|
21 |
|
@@ -51,14 +51,14 @@ The following hyperparameters were used during training:
|
|
51 |
|
52 |
| Training Loss | Epoch | Step | Validation Loss |
|
53 |
|:-------------:|:-----:|:----:|:---------------:|
|
54 |
-
| 3.
|
55 |
-
| 3.
|
56 |
-
| 3.
|
57 |
-
| 3.
|
58 |
-
| 3.
|
59 |
-
| 3.
|
60 |
-
| 3.
|
61 |
-
| 3.
|
62 |
|
63 |
|
64 |
### Framework versions
|
|
|
15 |
|
16 |
This model is a fine-tuned version of [mistralai/Mistral-7B-v0.1](https://huggingface.co/mistralai/Mistral-7B-v0.1) on the None dataset.
|
17 |
It achieves the following results on the evaluation set:
|
18 |
+
- Loss: 3.1992
|
19 |
|
20 |
## Model description
|
21 |
|
|
|
51 |
|
52 |
| Training Loss | Epoch | Step | Validation Loss |
|
53 |
|:-------------:|:-----:|:----:|:---------------:|
|
54 |
+
| 3.6978 | 0.01 | 25 | 2.3995 |
|
55 |
+
| 3.651 | 0.02 | 50 | 2.3858 |
|
56 |
+
| 3.6755 | 0.02 | 75 | 2.3870 |
|
57 |
+
| 3.5904 | 0.03 | 100 | 2.4495 |
|
58 |
+
| 3.5636 | 0.04 | 125 | 2.4981 |
|
59 |
+
| 3.4337 | 0.05 | 150 | 2.5197 |
|
60 |
+
| 3.3356 | 0.06 | 175 | 2.5215 |
|
61 |
+
| 3.3829 | 0.06 | 200 | 2.5234 |
|
62 |
|
63 |
|
64 |
### Framework versions
|
config.json
CHANGED
@@ -29,7 +29,7 @@
|
|
29 |
0.10732196271419525,
|
30 |
0.12738214433193207,
|
31 |
0.1414242684841156,
|
32 |
-
0.
|
33 |
0.16349045932292938,
|
34 |
0.1675025075674057,
|
35 |
0.1675025075674057,
|
@@ -37,7 +37,7 @@
|
|
37 |
0.1735205501317978,
|
38 |
0.17552657425403595,
|
39 |
0.1775325983762741,
|
40 |
-
0.
|
41 |
0.1935807317495346,
|
42 |
0.19759276509284973,
|
43 |
0.21364091336727142,
|
|
|
29 |
0.10732196271419525,
|
30 |
0.12738214433193207,
|
31 |
0.1414242684841156,
|
32 |
+
0.15546639263629913,
|
33 |
0.16349045932292938,
|
34 |
0.1675025075674057,
|
35 |
0.1675025075674057,
|
|
|
37 |
0.1735205501317978,
|
38 |
0.17552657425403595,
|
39 |
0.1775325983762741,
|
40 |
+
0.18956869840621948,
|
41 |
0.1935807317495346,
|
42 |
0.19759276509284973,
|
43 |
0.21364091336727142,
|
model-00001-of-00003.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 4943162336
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:266a466bbb9e54f96f47eef2c06d2735b44e8aa8ec13e8edec7a83439f5efa72
|
3 |
size 4943162336
|
model-00002-of-00003.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 4999819336
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:5ef52d7f7c0a3c78947311fed9515046829e339d6ecd46f23fd3eacf34148535
|
3 |
size 4999819336
|
model-00003-of-00003.safetensors
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 4540516344
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:a912fda040ea3a959e05fee84c57ca990ed702145ce81691ec25877ce5680644
|
3 |
size 4540516344
|