Harshatheeswar commited on
Commit
3ab89b4
1 Parent(s): 0f9e6be

End of training

Browse files
README.md CHANGED
@@ -1,5 +1,6 @@
1
  ---
2
  library_name: transformers
 
3
  tags:
4
  - generated_from_trainer
5
  model-index:
@@ -12,9 +13,9 @@ should probably proofread and complete it, then remove this comment. -->
12
 
13
  # gpt2-scratch
14
 
15
- This model is a fine-tuned version of [](https://huggingface.co/) on the None dataset.
16
  It achieves the following results on the evaluation set:
17
- - Loss: 4.4209
18
 
19
  ## Model description
20
 
@@ -41,20 +42,18 @@ The following hyperparameters were used during training:
41
  - total_train_batch_size: 64
42
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
43
  - lr_scheduler_type: cosine
44
- - num_epochs: 2
45
  - mixed_precision_training: Native AMP
46
 
47
  ### Training results
48
 
49
  | Training Loss | Epoch | Step | Validation Loss |
50
  |:-------------:|:-----:|:----:|:---------------:|
51
- | 4.6168 | 1.0 | 1390 | 4.5492 |
52
- | 4.4783 | 2.0 | 2780 | 4.4209 |
53
 
54
 
55
  ### Framework versions
56
 
57
  - Transformers 4.44.2
58
  - Pytorch 2.4.1+cu121
59
- - Datasets 3.0.1
60
  - Tokenizers 0.19.1
 
1
  ---
2
  library_name: transformers
3
+ base_model: Harshatheeswar/gpt2-scratch
4
  tags:
5
  - generated_from_trainer
6
  model-index:
 
13
 
14
  # gpt2-scratch
15
 
16
+ This model is a fine-tuned version of [Harshatheeswar/gpt2-scratch](https://huggingface.co/Harshatheeswar/gpt2-scratch) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 4.2516
19
 
20
  ## Model description
21
 
 
42
  - total_train_batch_size: 64
43
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
44
  - lr_scheduler_type: cosine
45
+ - num_epochs: 1
46
  - mixed_precision_training: Native AMP
47
 
48
  ### Training results
49
 
50
  | Training Loss | Epoch | Step | Validation Loss |
51
  |:-------------:|:-----:|:----:|:---------------:|
52
+ | 4.3012 | 1.0 | 1390 | 4.2516 |
 
53
 
54
 
55
  ### Framework versions
56
 
57
  - Transformers 4.44.2
58
  - Pytorch 2.4.1+cu121
 
59
  - Tokenizers 0.19.1
runs/Oct12_06-45-21_c41c63f4f435/events.out.tfevents.1728715560.c41c63f4f435.185.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:6a556aff275778ebba414b07d2f05977025201a59c87fb6fbc4e80c6cedb2ea2
3
- size 34388
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:19e2775d304988bc394b928bf37c754d134bf09995ffec6eba4c948d4aac82b3
3
+ size 35013