Fine-tuned GPT-2 on Wikitext-2

Files changed (4) hide show

README.md CHANGED Viewed

@@ -1,6 +1,6 @@
 ---
 license: mit
-base_model: gpt2
 tags:
 - generated_from_trainer
 model-index:
@@ -13,9 +13,9 @@ should probably proofread and complete it, then remove this comment. -->
 # orion
-This model is a fine-tuned version of [gpt2](https://huggingface.co/gpt2) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.8502
 ## Model description
@@ -48,17 +48,17 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
-| No log        | 0.0871 | 400  | 3.0915          |
-| 3.6109        | 0.1743 | 800  | 2.9917          |
-| 3.2874        | 0.2614 | 1200 | 2.9542          |
-| 3.1807        | 0.3486 | 1600 | 2.9252          |
-| 3.1763        | 0.4357 | 2000 | 2.9056          |
-| 3.1763        | 0.5229 | 2400 | 2.8900          |
-| 3.1536        | 0.6100 | 2800 | 2.8740          |
-| 3.0856        | 0.6972 | 3200 | 2.8683          |
-| 3.1129        | 0.7843 | 3600 | 2.8619          |
-| 3.0838        | 0.8715 | 4000 | 2.8546          |
-| 3.0838        | 0.9586 | 4400 | 2.8511          |
 ### Framework versions

 ---
 license: mit
+base_model: cuba6112/orion
 tags:
 - generated_from_trainer
 model-index:
 # orion
+This model is a fine-tuned version of [cuba6112/orion](https://huggingface.co/cuba6112/orion) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.8471
 ## Model description
 | Training Loss | Epoch  | Step | Validation Loss |
 |:-------------:|:------:|:----:|:---------------:|
+| No log        | 0.0871 | 400  | 2.8882          |
+| 2.9006        | 0.1743 | 800  | 2.9229          |
+| 2.6909        | 0.2614 | 1200 | 2.9341          |
+| 2.6634        | 0.3486 | 1600 | 2.9170          |
+| 2.769         | 0.4357 | 2000 | 2.9012          |
+| 2.769         | 0.5229 | 2400 | 2.8874          |
+| 2.8258        | 0.6100 | 2800 | 2.8755          |
+| 2.8313        | 0.6972 | 3200 | 2.8689          |
+| 2.9336        | 0.7843 | 3600 | 2.8605          |
+| 2.9614        | 0.8715 | 4000 | 2.8522          |
+| 2.9614        | 0.9586 | 4400 | 2.8481          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:c43959029126687848a7100b875f51ffb2e75fafa591b12acd4af6da48869d73
 size 497774208

 version https://git-lfs.github.com/spec/v1
+oid sha256:a1f30a820e2f6e6b63f81d908c76eb0e7944df8fcf3f84c50e260fa555559199
 size 497774208

runs/Jun30_18-04-46_Delta6112/events.out.tfevents.1719785087.Delta6112.24560.2 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:258ca1070dd771014049f438704001c7be80f942cea604fcafb5cb69c5b9d206
-size 9361

 version https://git-lfs.github.com/spec/v1
+oid sha256:d701666be8c6647668bdcf64dc2e0898787897780c0d2fb6c0423f6e54eb85b5
+size 10197

runs/Jun30_18-04-46_Delta6112/events.out.tfevents.1719785933.Delta6112.24560.3 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:ad5a4f2d5cdbb21003b96a85609b62152cd72e345181f8f58955fe6d7db419cc
+size 359