ninagroot/GPT2-705M

Browse files

Files changed (4) hide show

README.md +22 -6
model.safetensors +1 -1
runs/Apr29_16-13-26_gcn21.local.snellius.surf.nl/events.out.tfevents.1714400015.gcn21.local.snellius.surf.nl.467660.0 +3 -0
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -13,7 +13,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 5.1361
 ## Model description
@@ -41,17 +41,33 @@ The following hyperparameters were used during training:
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_steps: 300
-- num_epochs: 4
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 8.1174        | 1.0   | 3    | 7.4247          |
-| 6.2796        | 2.0   | 6    | 6.3362          |
-| 5.6153        | 3.0   | 9    | 5.6666          |
-| 4.8481        | 4.0   | 12   | 5.1361          |
 ### Framework versions

 This model is a fine-tuned version of [](https://huggingface.co/) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 3.6046
 ## Model description
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 - lr_scheduler_warmup_steps: 300
+- num_epochs: 20
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 8.0336        | 1.0   | 3    | 7.3770          |
+| 6.2535        | 2.0   | 6    | 6.3128          |
+| 5.6213        | 3.0   | 9    | 5.6716          |
+| 4.8242        | 4.0   | 12   | 5.1521          |
+| 4.6266        | 5.0   | 15   | 4.9789          |
+| 4.4097        | 6.0   | 18   | 4.7306          |
+| 4.0358        | 7.0   | 21   | 4.5332          |
+| 4.0027        | 8.0   | 24   | 4.4014          |
+| 3.8638        | 9.0   | 27   | 4.1175          |
+| 3.5414        | 10.0  | 30   | 4.0355          |
+| 3.4701        | 11.0  | 33   | 3.8834          |
+| 3.4822        | 12.0  | 36   | 3.8336          |
+| 3.0602        | 13.0  | 39   | 3.7213          |
+| 3.1109        | 14.0  | 42   | 3.7379          |
+| 2.9087        | 15.0  | 45   | 3.7389          |
+| 2.7124        | 16.0  | 48   | 3.6220          |
+| 2.5867        | 17.0  | 51   | 3.7192          |
+| 2.4577        | 18.0  | 54   | 3.5953          |
+| 2.279         | 19.0  | 57   | 3.7648          |
+| 2.3218        | 20.0  | 60   | 3.6046          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:1a9af2005cc75218ca636dbe06bbd65e30abdc4a5573abfb5df5688d6e3cec52
 size 2748401440

 version https://git-lfs.github.com/spec/v1
+oid sha256:ac64e7d51b66be7b695ce5f77da518218607444f579fe548439a3ad9fb4164f1
 size 2748401440

runs/Apr29_16-13-26_gcn21.local.snellius.surf.nl/events.out.tfevents.1714400015.gcn21.local.snellius.surf.nl.467660.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:dee410478de99ff6bbe4a90fdae59dbe432a47cd6712dae857716a580a017a39
+size 22767

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a30852277f467352b48aea78723b61253783bc15f07264ebd423e1691017aae0
 size 4984

 version https://git-lfs.github.com/spec/v1
+oid sha256:4425104d4ab90f66d74aec7b2ddb7ff0c53d6e50186c2d9c2928e9c6a711a4cc
 size 4984