Harshatheeswar
/

gpt2-scratch

@@ -1,5 +1,6 @@
 ---
 library_name: transformers
 tags:
 - generated_from_trainer
 model-index:
@@ -12,9 +13,9 @@ should probably proofread and complete it, then remove this comment. -->
 # gpt2-scratch
-This model is a fine-tuned version of [](https://huggingface.co/) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 4.4209
 ## Model description
@@ -41,20 +42,18 @@ The following hyperparameters were used during training:
 - total_train_batch_size: 64
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
-- num_epochs: 2
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 4.6168        | 1.0   | 1390 | 4.5492          |
-| 4.4783        | 2.0   | 2780 | 4.4209          |
 ### Framework versions
 - Transformers 4.44.2
 - Pytorch 2.4.1+cu121
-- Datasets 3.0.1
 - Tokenizers 0.19.1

 ---
 library_name: transformers
+base_model: Harshatheeswar/gpt2-scratch
 tags:
 - generated_from_trainer
 model-index:
 # gpt2-scratch
+This model is a fine-tuned version of [Harshatheeswar/gpt2-scratch](https://huggingface.co/Harshatheeswar/gpt2-scratch) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 4.2516
 ## Model description
 - total_train_batch_size: 64
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: cosine
+- num_epochs: 1
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 4.3012        | 1.0   | 1390 | 4.2516          |
 ### Framework versions
 - Transformers 4.44.2
 - Pytorch 2.4.1+cu121
 - Tokenizers 0.19.1

runs/Oct12_06-45-21_c41c63f4f435/events.out.tfevents.1728715560.c41c63f4f435.185.0 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:6a556aff275778ebba414b07d2f05977025201a59c87fb6fbc4e80c6cedb2ea2
-size 34388

 version https://git-lfs.github.com/spec/v1
+oid sha256:19e2775d304988bc394b928bf37c754d134bf09995ffec6eba4c948d4aac82b3
+size 35013