ammonbro
/

up_down_sp_model

Text Generation

Generated from Trainer

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

ammonbro commited on Oct 13

Commit

da46356

•

1 Parent(s): ff9239d

End of training

Files changed (2) hide show

README.md +4 -4
model.safetensors +1 -1

README.md CHANGED Viewed

@@ -16,7 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.1398
 ## Model description
@@ -47,9 +47,9 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 0.1468        | 1.0   | 1429 | 0.1404          |
-| 0.1387        | 2.0   | 2858 | 0.1395          |
-| 0.1378        | 3.0   | 4287 | 0.1398          |
 ### Framework versions

 This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.7062
 ## Model description
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 0.6954        | 1.0   | 1429 | 0.7057          |
+| 0.691         | 2.0   | 2858 | 0.7039          |
+| 0.6887        | 3.0   | 4287 | 0.7062          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:f3f9ebc3ddcb86ce22b7ebf332764dcf1805ff6713f2e99d002336f7bf70d30b
 size 327657928

 version https://git-lfs.github.com/spec/v1
+oid sha256:d27e47abc52e219b1cf0267510ef56cfbe33c4a20441e64959f34d40f8955eb9
 size 327657928