AmalNlal commited on
Commit
25f802a
·
verified ·
1 Parent(s): f23d610

End of training

Browse files
Files changed (1) hide show
  1. README.md +8 -8
README.md CHANGED
@@ -13,7 +13,7 @@ should probably proofread and complete it, then remove this comment. -->
13
 
14
  This model is a fine-tuned version of [](https://huggingface.co/) on the None dataset.
15
  It achieves the following results on the evaluation set:
16
- - Loss: 5.6008
17
 
18
  ## Model description
19
 
@@ -33,8 +33,8 @@ More information needed
33
 
34
  The following hyperparameters were used during training:
35
  - learning_rate: 0.01
36
- - train_batch_size: 32
37
- - eval_batch_size: 32
38
  - seed: 42
39
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
40
  - lr_scheduler_type: linear
@@ -44,11 +44,11 @@ The following hyperparameters were used during training:
44
 
45
  | Training Loss | Epoch | Step | Validation Loss |
46
  |:-------------:|:-----:|:----:|:---------------:|
47
- | 5.8219 | 0.12 | 100 | 5.6243 |
48
- | 5.6177 | 0.25 | 200 | 5.6153 |
49
- | 5.6121 | 0.37 | 300 | 5.6094 |
50
- | 5.6131 | 0.49 | 400 | 5.6055 |
51
- | 5.6 | 0.62 | 500 | 5.6008 |
52
 
53
 
54
  ### Framework versions
 
13
 
14
  This model is a fine-tuned version of [](https://huggingface.co/) on the None dataset.
15
  It achieves the following results on the evaluation set:
16
+ - Loss: 7.9301
17
 
18
  ## Model description
19
 
 
33
 
34
  The following hyperparameters were used during training:
35
  - learning_rate: 0.01
36
+ - train_batch_size: 16
37
+ - eval_batch_size: 16
38
  - seed: 42
39
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
40
  - lr_scheduler_type: linear
 
44
 
45
  | Training Loss | Epoch | Step | Validation Loss |
46
  |:-------------:|:-----:|:----:|:---------------:|
47
+ | 9.2035 | 0.06 | 100 | 8.1180 |
48
+ | 8.0969 | 0.12 | 200 | 8.0501 |
49
+ | 8.002 | 0.19 | 300 | 8.0142 |
50
+ | 8.0026 | 0.25 | 400 | 7.9677 |
51
+ | 7.9075 | 0.31 | 500 | 7.9301 |
52
 
53
 
54
  ### Framework versions