hsmith-morganhill commited on
Commit
a29735a
·
verified ·
1 Parent(s): 9f08618

End of training

Browse files
Files changed (4) hide show
  1. README.md +10 -7
  2. config.json +1 -1
  3. model.safetensors +1 -1
  4. training_args.bin +2 -2
README.md CHANGED
@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [deepset/roberta-base-squad2](https://huggingface.co/deepset/roberta-base-squad2) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
- - Loss: 1.4789
19
 
20
  ## Model description
21
 
@@ -40,20 +40,23 @@ The following hyperparameters were used during training:
40
  - seed: 42
41
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
42
  - lr_scheduler_type: linear
43
- - num_epochs: 3
44
 
45
  ### Training results
46
 
47
  | Training Loss | Epoch | Step | Validation Loss |
48
  |:-------------:|:-----:|:----:|:---------------:|
49
- | 1.9432 | 1.0 | 72 | 1.9258 |
50
- | 2.2068 | 2.0 | 144 | 1.5627 |
51
- | 1.3339 | 3.0 | 216 | 1.4789 |
 
 
 
52
 
53
 
54
  ### Framework versions
55
 
56
- - Transformers 4.43.4
57
- - Pytorch 2.4.0+cu121
58
  - Datasets 2.20.0
59
  - Tokenizers 0.19.1
 
15
 
16
  This model is a fine-tuned version of [deepset/roberta-base-squad2](https://huggingface.co/deepset/roberta-base-squad2) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 1.1706
19
 
20
  ## Model description
21
 
 
40
  - seed: 42
41
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
42
  - lr_scheduler_type: linear
43
+ - num_epochs: 6
44
 
45
  ### Training results
46
 
47
  | Training Loss | Epoch | Step | Validation Loss |
48
  |:-------------:|:-----:|:----:|:---------------:|
49
+ | 2.3938 | 1.0 | 72 | 1.8482 |
50
+ | 1.5735 | 2.0 | 144 | 1.4687 |
51
+ | 2.0029 | 3.0 | 216 | 1.3260 |
52
+ | 1.6424 | 4.0 | 288 | 1.2291 |
53
+ | 1.2187 | 5.0 | 360 | 1.1848 |
54
+ | 0.5262 | 6.0 | 432 | 1.1706 |
55
 
56
 
57
  ### Framework versions
58
 
59
+ - Transformers 4.41.2
60
+ - Pytorch 2.4.0
61
  - Datasets 2.20.0
62
  - Tokenizers 0.19.1
config.json CHANGED
@@ -23,7 +23,7 @@
23
  "pad_token_id": 1,
24
  "position_embedding_type": "absolute",
25
  "torch_dtype": "float32",
26
- "transformers_version": "4.43.4",
27
  "type_vocab_size": 1,
28
  "use_cache": true,
29
  "vocab_size": 50265
 
23
  "pad_token_id": 1,
24
  "position_embedding_type": "absolute",
25
  "torch_dtype": "float32",
26
+ "transformers_version": "4.41.2",
27
  "type_vocab_size": 1,
28
  "use_cache": true,
29
  "vocab_size": 50265
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:f102435f4f80f0c68fc7b295a31e2a6a3d146e436f235962f04449f2fbc6542d
3
  size 496250232
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:35e5b0428008e59ee0ee7a5d84c8bc7e2a9b91f551a27c2f19da99eb30e59c62
3
  size 496250232
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:be6e32daa03b7c861330289f350452edca2db39fc5fb685c7c0fd18225ad0de3
3
- size 5176
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:fe674f5550a770b4439a4694756bb78e65dfaaafd65c486a726b0cf6942a562e
3
+ size 5048