cuba6112 commited on
Commit
64e99f1
1 Parent(s): 80647e8

Fine-tuned GPT-2 on Wikitext-2

Browse files
README.md CHANGED
@@ -1,6 +1,6 @@
1
  ---
2
  license: mit
3
- base_model: gpt2
4
  tags:
5
  - generated_from_trainer
6
  model-index:
@@ -13,9 +13,9 @@ should probably proofread and complete it, then remove this comment. -->
13
 
14
  # orion
15
 
16
- This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 2.8502
19
 
20
  ## Model description
21
 
@@ -48,17 +48,17 @@ The following hyperparameters were used during training:
48
 
49
  | Training Loss | Epoch | Step | Validation Loss |
50
  |:-------------:|:------:|:----:|:---------------:|
51
- | No log | 0.0871 | 400 | 3.0915 |
52
- | 3.6109 | 0.1743 | 800 | 2.9917 |
53
- | 3.2874 | 0.2614 | 1200 | 2.9542 |
54
- | 3.1807 | 0.3486 | 1600 | 2.9252 |
55
- | 3.1763 | 0.4357 | 2000 | 2.9056 |
56
- | 3.1763 | 0.5229 | 2400 | 2.8900 |
57
- | 3.1536 | 0.6100 | 2800 | 2.8740 |
58
- | 3.0856 | 0.6972 | 3200 | 2.8683 |
59
- | 3.1129 | 0.7843 | 3600 | 2.8619 |
60
- | 3.0838 | 0.8715 | 4000 | 2.8546 |
61
- | 3.0838 | 0.9586 | 4400 | 2.8511 |
62
 
63
 
64
  ### Framework versions
 
1
  ---
2
  license: mit
3
+ base_model: cuba6112/orion
4
  tags:
5
  - generated_from_trainer
6
  model-index:
 
13
 
14
  # orion
15
 
16
+ This model is a fine-tuned version of [cuba6112/orion](https://huggingface.co/cuba6112/orion) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 2.8471
19
 
20
  ## Model description
21
 
 
48
 
49
  | Training Loss | Epoch | Step | Validation Loss |
50
  |:-------------:|:------:|:----:|:---------------:|
51
+ | No log | 0.0871 | 400 | 2.8882 |
52
+ | 2.9006 | 0.1743 | 800 | 2.9229 |
53
+ | 2.6909 | 0.2614 | 1200 | 2.9341 |
54
+ | 2.6634 | 0.3486 | 1600 | 2.9170 |
55
+ | 2.769 | 0.4357 | 2000 | 2.9012 |
56
+ | 2.769 | 0.5229 | 2400 | 2.8874 |
57
+ | 2.8258 | 0.6100 | 2800 | 2.8755 |
58
+ | 2.8313 | 0.6972 | 3200 | 2.8689 |
59
+ | 2.9336 | 0.7843 | 3600 | 2.8605 |
60
+ | 2.9614 | 0.8715 | 4000 | 2.8522 |
61
+ | 2.9614 | 0.9586 | 4400 | 2.8481 |
62
 
63
 
64
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:c43959029126687848a7100b875f51ffb2e75fafa591b12acd4af6da48869d73
3
  size 497774208
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a1f30a820e2f6e6b63f81d908c76eb0e7944df8fcf3f84c50e260fa555559199
3
  size 497774208
runs/Jun30_18-04-46_Delta6112/events.out.tfevents.1719785087.Delta6112.24560.2 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:258ca1070dd771014049f438704001c7be80f942cea604fcafb5cb69c5b9d206
3
- size 9361
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d701666be8c6647668bdcf64dc2e0898787897780c0d2fb6c0423f6e54eb85b5
3
+ size 10197
runs/Jun30_18-04-46_Delta6112/events.out.tfevents.1719785933.Delta6112.24560.3 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ad5a4f2d5cdbb21003b96a85609b62152cd72e345181f8f58955fe6d7db419cc
3
+ size 359