End of training

Files changed (8) hide show

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [openai-community/gpt2](https://huggingface.co/openai-community/gpt2) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 3.3517
 ## Model description
@@ -35,8 +35,8 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
-- train_batch_size: 192
-- eval_batch_size: 192
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
@@ -47,16 +47,16 @@ The following hyperparameters were used during training:
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss |
-|:-------------:|:-----:|:----:|:---------------:|
-| 3.6325        | 1.0   | 1086 | 3.4211          |
-| 3.5084        | 2.0   | 2172 | 3.3519          |
-| 3.4583        | 3.0   | 3258 | 3.3415          |
 ### Framework versions
-- Transformers 4.42.3
 - Pytorch 2.3.0+cu121
 - Datasets 2.20.0
 - Tokenizers 0.19.1

 This model is a fine-tuned version of [openai-community/gpt2](https://huggingface.co/openai-community/gpt2) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 3.2082
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 2e-05
+- train_batch_size: 32
+- eval_batch_size: 32
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
 ### Training results
+| Training Loss | Epoch | Step  | Validation Loss |
+|:-------------:|:-----:|:-----:|:---------------:|
+| 3.381         | 1.0   | 6516  | 3.2650          |
+| 3.2617        | 2.0   | 13032 | 3.2063          |
+| 3.2142        | 3.0   | 19548 | 3.1986          |
 ### Framework versions
+- Transformers 4.42.4
 - Pytorch 2.3.0+cu121
 - Datasets 2.20.0
 - Tokenizers 0.19.1

config.json CHANGED Viewed

@@ -34,7 +34,7 @@
     }
   },
   "torch_dtype": "float32",
-  "transformers_version": "4.42.3",
   "use_cache": true,
   "vocab_size": 50259
 }

     }
   },
   "torch_dtype": "float32",
+  "transformers_version": "4.42.4",
   "use_cache": true,
   "vocab_size": 50259
 }

generation_config.json CHANGED Viewed

@@ -2,5 +2,5 @@
   "_from_model_config": true,
   "bos_token_id": 50256,
   "eos_token_id": 50256,
-  "transformers_version": "4.42.3"
 }

   "_from_model_config": true,
   "bos_token_id": 50256,
   "eos_token_id": 50256,
+  "transformers_version": "4.42.4"
 }

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:c9465943760784494a1af1b9c1f23fe47ca8cab81ff0ae1e85b5beb987fb5dfb
 size 497780352

 version https://git-lfs.github.com/spec/v1
+oid sha256:50dd230b60dd3f128ac5c636d2f0f1e3700945ecae80da25ee421c15eee8a4ab
 size 497780352

runs/Jul12_09-49-58_06f378f9c407/events.out.tfevents.1720777807.06f378f9c407.5254.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:0d5d6cafd96ea4a68aa59d77fc9cdf7f36a202c312f31ac0d4cac3d84a211ba4
+size 5505

runs/Jul12_09-52-21_06f378f9c407/events.out.tfevents.1720777945.06f378f9c407.6523.0 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:35ba56ddf1279bb1ab63deb49c888df609c6f3fafe999f09ddf5b7d2b3e35662
+size 14674

runs/Jul12_09-52-21_06f378f9c407/events.out.tfevents.1720787118.06f378f9c407.6523.1 ADDED Viewed

+version https://git-lfs.github.com/spec/v1
+oid sha256:4533f50efac44bb9d51c41479b7863e760cb4867471a6feacbc44ab35f038f1c
+size 364

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:771a248bb3127eb967831ca732d8af2553fd3670d4610178a68640f5aa27e0e9
 size 5240

 version https://git-lfs.github.com/spec/v1
+oid sha256:6c5a40843c26a73b213741685b0053b7cde405f00fa4684c3a3507a45c641fd1
 size 5240