End of training

Files changed (3) hide show

README.md CHANGED Viewed

@@ -17,7 +17,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [distilbert/distilgpt2](https://huggingface.co/distilbert/distilgpt2) on the eli5_category dataset.
 It achieves the following results on the evaluation set:
-- Loss: 3.7140
 ## Model description
@@ -48,9 +48,9 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 3.8682        | 1.0   | 500  | 3.7314          |
-| 3.7854        | 2.0   | 1000 | 3.7164          |
-| 3.7538        | 3.0   | 1500 | 3.7140          |
 ### Framework versions

 This model is a fine-tuned version of [distilbert/distilgpt2](https://huggingface.co/distilbert/distilgpt2) on the eli5_category dataset.
 It achieves the following results on the evaluation set:
+- Loss: 3.7078
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 3.743         | 1.0   | 500  | 3.7099          |
+| 3.7019        | 2.0   | 1000 | 3.7074          |
+| 3.6787        | 3.0   | 1500 | 3.7078          |
 ### Framework versions

runs/Jul15_20-08-42_600aa654b321/events.out.tfevents.1721074487.600aa654b321.2882.1 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:558a361f431286c7ea5089f6cbd4a78d1bfa3d939a83f159b2ebc29a31ab584e
-size 6632

 version https://git-lfs.github.com/spec/v1
+oid sha256:3bab0d5c3cc369d415f320208b2045913d97ea1c01fc998f7a7a68572aa99cab
+size 7257

runs/Jul15_20-08-42_600aa654b321/events.out.tfevents.1721075553.600aa654b321.2882.2 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:87f0e97bfd21a369dc9e093074e21eea6b2b911f2887094208bdc8157be5bf43
+size 359