thorirhrafn commited on
Commit
b057e91
·
verified ·
1 Parent(s): 8c7ed89

End of training

Browse files
Files changed (1) hide show
  1. README.md +9 -7
README.md CHANGED
@@ -20,8 +20,8 @@ should probably proofread and complete it, then remove this comment. -->
20
 
21
  This model is a fine-tuned version of [AI-Sweden-Models/gpt-sw3-1.3b](https://huggingface.co/AI-Sweden-Models/gpt-sw3-1.3b) on an unknown dataset.
22
  It achieves the following results on the evaluation set:
23
- - Loss: 0.0017
24
- - Accuracy: 1.0
25
 
26
  ## Model description
27
 
@@ -48,16 +48,18 @@ The following hyperparameters were used during training:
48
  - total_train_batch_size: 16
49
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
50
  - lr_scheduler_type: linear
51
- - num_epochs: 2
52
 
53
  ### Training results
54
 
55
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
56
  |:-------------:|:-----:|:----:|:---------------:|:--------:|
57
- | 0.1303 | 0.41 | 50 | 0.1400 | 0.9697 |
58
- | 0.0606 | 0.83 | 100 | 0.0171 | 0.9865 |
59
- | 0.0005 | 1.24 | 150 | 0.0036 | 1.0 |
60
- | 0.0 | 1.65 | 200 | 0.0017 | 1.0 |
 
 
61
 
62
 
63
  ### Framework versions
 
20
 
21
  This model is a fine-tuned version of [AI-Sweden-Models/gpt-sw3-1.3b](https://huggingface.co/AI-Sweden-Models/gpt-sw3-1.3b) on an unknown dataset.
22
  It achieves the following results on the evaluation set:
23
+ - Loss: 0.0078
24
+ - Accuracy: 0.9966
25
 
26
  ## Model description
27
 
 
48
  - total_train_batch_size: 16
49
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
50
  - lr_scheduler_type: linear
51
+ - num_epochs: 1
52
 
53
  ### Training results
54
 
55
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
56
  |:-------------:|:-----:|:----:|:---------------:|:--------:|
57
+ | 0.3164 | 0.17 | 20 | 0.2708 | 0.9461 |
58
+ | 0.1799 | 0.33 | 40 | 0.1111 | 0.9697 |
59
+ | 0.0577 | 0.5 | 60 | 0.0276 | 0.9899 |
60
+ | 0.0064 | 0.66 | 80 | 0.0119 | 0.9933 |
61
+ | 0.0036 | 0.83 | 100 | 0.0099 | 0.9933 |
62
+ | 0.0035 | 0.99 | 120 | 0.0078 | 0.9966 |
63
 
64
 
65
  ### Framework versions