santis2 commited on
Commit
3f65a04
1 Parent(s): d19f926

End of training

Browse files
Files changed (4) hide show
  1. README.md +9 -3
  2. adapter_model.bin +1 -1
  3. tokenizer.json +14 -2
  4. training_args.bin +1 -1
README.md CHANGED
@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 2.7285
19
 
20
  ## Model description
21
 
@@ -41,13 +41,19 @@ The following hyperparameters were used during training:
41
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
42
  - lr_scheduler_type: cosine
43
  - lr_scheduler_warmup_steps: 1000
44
- - num_epochs: 1
45
 
46
  ### Training results
47
 
48
  | Training Loss | Epoch | Step | Validation Loss |
49
  |:-------------:|:-----:|:----:|:---------------:|
50
- | 3.3117 | 0.66 | 1000 | 2.7285 |
 
 
 
 
 
 
51
 
52
 
53
  ### Framework versions
 
15
 
16
  This model is a fine-tuned version of [distilgpt2](https://huggingface.co/distilgpt2) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 2.7754
19
 
20
  ## Model description
21
 
 
41
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
42
  - lr_scheduler_type: cosine
43
  - lr_scheduler_warmup_steps: 1000
44
+ - num_epochs: 5
45
 
46
  ### Training results
47
 
48
  | Training Loss | Epoch | Step | Validation Loss |
49
  |:-------------:|:-----:|:----:|:---------------:|
50
+ | 3.6372 | 0.66 | 1000 | 2.8398 |
51
+ | 3.0548 | 1.32 | 2000 | 2.7754 |
52
+ | 3.0124 | 1.99 | 3000 | 2.7832 |
53
+ | 3.0056 | 2.65 | 4000 | 2.8105 |
54
+ | 2.9911 | 3.31 | 5000 | 2.7812 |
55
+ | 2.983 | 3.97 | 6000 | 2.7676 |
56
+ | 2.9735 | 4.64 | 7000 | 2.7754 |
57
 
58
 
59
  ### Framework versions
adapter_model.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:aae7b1c77a407bcafeee2fc69e48abddb4dc0247d43fd0e94910d3060183088a
3
  size 418013
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:deab2a915f6881025fbc99c13f13b338d94cc8248f0076f261b80f3193525820
3
  size 418013
tokenizer.json CHANGED
@@ -1,7 +1,19 @@
1
  {
2
  "version": "1.0",
3
- "truncation": null,
4
- "padding": null,
 
 
 
 
 
 
 
 
 
 
 
 
5
  "added_tokens": [
6
  {
7
  "id": 50256,
 
1
  {
2
  "version": "1.0",
3
+ "truncation": {
4
+ "direction": "Right",
5
+ "max_length": 1024,
6
+ "strategy": "LongestFirst",
7
+ "stride": 0
8
+ },
9
+ "padding": {
10
+ "strategy": "BatchLongest",
11
+ "direction": "Right",
12
+ "pad_to_multiple_of": null,
13
+ "pad_id": 50256,
14
+ "pad_type_id": 0,
15
+ "pad_token": "<|endoftext|>"
16
+ },
17
  "added_tokens": [
18
  {
19
  "id": 50256,
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:506f6e20cd8c3db2c7ae9c5c31139cf0ee81ce8f7aa587ee77d824b22310b13b
3
  size 4091
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:830f091b7b9ed63b66cc6d35c8205ab488e9c026944bd3b61de3c7d3896f62cb
3
  size 4091