End of training

Files changed (3) hide show

README.md CHANGED Viewed

@@ -17,7 +17,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [distilbert/distilgpt2](https://huggingface.co/distilbert/distilgpt2) on the eli5_category dataset.
 It achieves the following results on the evaluation set:
-- Loss: 3.7078
 ## Model description
@@ -48,9 +48,9 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 3.743         | 1.0   | 500  | 3.7099          |
-| 3.7019        | 2.0   | 1000 | 3.7074          |
-| 3.6787        | 3.0   | 1500 | 3.7078          |
 ### Framework versions

 This model is a fine-tuned version of [distilbert/distilgpt2](https://huggingface.co/distilbert/distilgpt2) on the eli5_category dataset.
 It achieves the following results on the evaluation set:
+- Loss: 3.7245
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 3.866         | 1.0   | 500  | 3.7400          |
+| 3.7814        | 2.0   | 1000 | 3.7267          |
+| 3.7498        | 3.0   | 1500 | 3.7245          |
 ### Framework versions

runs/Jul15_21-18-24_b55a46f00eed/events.out.tfevents.1721078348.b55a46f00eed.3144.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:019f9c46080368ea4c97d69d26d9bee68f4a7c1982ea4f68cef5cef93c5af0a9
-size 6333

 version https://git-lfs.github.com/spec/v1
+oid sha256:93ec3800a1be5ee64f1c6369d9ce1dd1e9e5cc2634d475b82772a509ee7b5dab
+size 6958

runs/Jul15_21-18-24_b55a46f00eed/events.out.tfevents.1721078819.b55a46f00eed.3144.1 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:d8a3189d23343c8a5e001afa14f310764af7d055fe2ada0123aba4fb78b43d68
+size 359