Pyro-X2 commited on
Commit
cb3701e
1 Parent(s): bff66bc

End of training

Browse files
README.md CHANGED
@@ -17,7 +17,7 @@ should probably proofread and complete it, then remove this comment. -->
17
 
18
  This model is a fine-tuned version of [distilbert/distilgpt2](https://huggingface.co/distilbert/distilgpt2) on the eli5_category dataset.
19
  It achieves the following results on the evaluation set:
20
- - Loss: 3.7140
21
 
22
  ## Model description
23
 
@@ -48,9 +48,9 @@ The following hyperparameters were used during training:
48
 
49
  | Training Loss | Epoch | Step | Validation Loss |
50
  |:-------------:|:-----:|:----:|:---------------:|
51
- | 3.8682 | 1.0 | 500 | 3.7314 |
52
- | 3.7854 | 2.0 | 1000 | 3.7164 |
53
- | 3.7538 | 3.0 | 1500 | 3.7140 |
54
 
55
 
56
  ### Framework versions
 
17
 
18
  This model is a fine-tuned version of [distilbert/distilgpt2](https://huggingface.co/distilbert/distilgpt2) on the eli5_category dataset.
19
  It achieves the following results on the evaluation set:
20
+ - Loss: 3.7078
21
 
22
  ## Model description
23
 
 
48
 
49
  | Training Loss | Epoch | Step | Validation Loss |
50
  |:-------------:|:-----:|:----:|:---------------:|
51
+ | 3.743 | 1.0 | 500 | 3.7099 |
52
+ | 3.7019 | 2.0 | 1000 | 3.7074 |
53
+ | 3.6787 | 3.0 | 1500 | 3.7078 |
54
 
55
 
56
  ### Framework versions
runs/Jul15_20-08-42_600aa654b321/events.out.tfevents.1721074487.600aa654b321.2882.1 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:558a361f431286c7ea5089f6cbd4a78d1bfa3d939a83f159b2ebc29a31ab584e
3
- size 6632
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:3bab0d5c3cc369d415f320208b2045913d97ea1c01fc998f7a7a68572aa99cab
3
+ size 7257
runs/Jul15_20-08-42_600aa654b321/events.out.tfevents.1721075553.600aa654b321.2882.2 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:87f0e97bfd21a369dc9e093074e21eea6b2b911f2887094208bdc8157be5bf43
3
+ size 359