Mohammed Aly commited on
Commit
0c75ac4
1 Parent(s): c649a9d

Commit Successfully!

Browse files
Files changed (5) hide show
  1. README.md +16 -18
  2. config.json +1 -1
  3. generation_config.json +1 -1
  4. model.safetensors +1 -1
  5. training_args.bin +2 -2
README.md CHANGED
@@ -3,8 +3,6 @@ license: apache-2.0
3
  base_model: t5-small
4
  tags:
5
  - generated_from_trainer
6
- datasets:
7
- - squad
8
  model-index:
9
  - name: t5-small-squad-qg
10
  results: []
@@ -15,9 +13,9 @@ should probably proofread and complete it, then remove this comment. -->
15
 
16
  # t5-small-squad-qg
17
 
18
- This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on the squad dataset.
19
  It achieves the following results on the evaluation set:
20
- - Loss: 2.0170
21
 
22
  ## Model description
23
 
@@ -37,31 +35,31 @@ More information needed
37
 
38
  The following hyperparameters were used during training:
39
  - learning_rate: 5e-05
40
- - train_batch_size: 32
41
- - eval_batch_size: 128
42
  - seed: 42
43
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
44
  - lr_scheduler_type: linear
45
  - lr_scheduler_warmup_steps: 500
46
- - num_epochs: 4
47
 
48
  ### Training results
49
 
50
  | Training Loss | Epoch | Step | Validation Loss |
51
  |:-------------:|:-----:|:----:|:---------------:|
52
- | 2.0686 | 0.48 | 300 | 2.0513 |
53
- | 2.0064 | 0.96 | 600 | 2.0541 |
54
- | 2.0108 | 1.44 | 900 | 2.0439 |
55
- | 2.0272 | 1.92 | 1200 | 2.0337 |
56
- | 2.0112 | 2.4 | 1500 | 2.0252 |
57
- | 2.0202 | 2.88 | 1800 | 2.0223 |
58
- | 2.0219 | 3.36 | 2100 | 2.0174 |
59
- | 2.0024 | 3.84 | 2400 | 2.0170 |
60
 
61
 
62
  ### Framework versions
63
 
64
- - Transformers 4.37.0
65
  - Pytorch 2.1.2
66
- - Datasets 2.1.0
67
- - Tokenizers 0.15.1
 
3
  base_model: t5-small
4
  tags:
5
  - generated_from_trainer
 
 
6
  model-index:
7
  - name: t5-small-squad-qg
8
  results: []
 
13
 
14
  # t5-small-squad-qg
15
 
16
+ This model is a fine-tuned version of [t5-small](https://huggingface.co/t5-small) on the None dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Loss: 2.1731
19
 
20
  ## Model description
21
 
 
35
 
36
  The following hyperparameters were used during training:
37
  - learning_rate: 5e-05
38
+ - train_batch_size: 64
39
+ - eval_batch_size: 64
40
  - seed: 42
41
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
42
  - lr_scheduler_type: linear
43
  - lr_scheduler_warmup_steps: 500
44
+ - num_epochs: 3
45
 
46
  ### Training results
47
 
48
  | Training Loss | Epoch | Step | Validation Loss |
49
  |:-------------:|:-----:|:----:|:---------------:|
50
+ | 2.4623 | 0.37 | 500 | 2.3734 |
51
+ | 2.4617 | 0.73 | 1000 | 2.2860 |
52
+ | 2.3629 | 1.1 | 1500 | 2.2450 |
53
+ | 2.2836 | 1.46 | 2000 | 2.2154 |
54
+ | 2.2393 | 1.83 | 2500 | 2.1966 |
55
+ | 2.2242 | 2.19 | 3000 | 2.1849 |
56
+ | 2.2134 | 2.56 | 3500 | 2.1760 |
57
+ | 2.2058 | 2.92 | 4000 | 2.1731 |
58
 
59
 
60
  ### Framework versions
61
 
62
+ - Transformers 4.38.1
63
  - Pytorch 2.1.2
64
+ - Datasets 2.13.1
65
+ - Tokenizers 0.15.2
config.json CHANGED
@@ -55,7 +55,7 @@
55
  }
56
  },
57
  "torch_dtype": "float32",
58
- "transformers_version": "4.37.0",
59
  "use_cache": true,
60
  "vocab_size": 32128
61
  }
 
55
  }
56
  },
57
  "torch_dtype": "float32",
58
+ "transformers_version": "4.38.1",
59
  "use_cache": true,
60
  "vocab_size": 32128
61
  }
generation_config.json CHANGED
@@ -3,5 +3,5 @@
3
  "decoder_start_token_id": 0,
4
  "eos_token_id": 1,
5
  "pad_token_id": 0,
6
- "transformers_version": "4.37.0"
7
  }
 
3
  "decoder_start_token_id": 0,
4
  "eos_token_id": 1,
5
  "pad_token_id": 0,
6
+ "transformers_version": "4.38.1"
7
  }
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:ab1b031d2763756e6cdfb670091648c88d39f61d60f4d225659bb7d0a3293f14
3
  size 242041896
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7dd9544a748206cf8b1025a6da8cba0e13f2959527f1b2cc4b537331a8b99752
3
  size 242041896
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:fe1f83ce663977195d35dd2d1d3783f1fa73041a0903046d873c41d664a46cb7
3
- size 4664
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:1ef48a24c829eb3b92ab182c09204347d6fa5802805222aec5f4addd6cff092c
3
+ size 4856