bassemessam commited on
Commit
62350de
1 Parent(s): 28bd801

End of training

Browse files
README.md CHANGED
@@ -2,8 +2,6 @@
2
  base_model: csebuetnlp/mT5_multilingual_XLSum
3
  tags:
4
  - generated_from_trainer
5
- metrics:
6
- - rouge
7
  model-index:
8
  - name: mT5_multilingual_XLSum-finetuned-wiki-lingua
9
  results: []
@@ -15,13 +13,6 @@ should probably proofread and complete it, then remove this comment. -->
15
  # mT5_multilingual_XLSum-finetuned-wiki-lingua
16
 
17
  This model is a fine-tuned version of [csebuetnlp/mT5_multilingual_XLSum](https://huggingface.co/csebuetnlp/mT5_multilingual_XLSum) on the None dataset.
18
- It achieves the following results on the evaluation set:
19
- - Loss: nan
20
- - Rouge1: 0.0
21
- - Rouge2: 0.0
22
- - Rougel: 0.0
23
- - Rougelsum: 0.0
24
- - Gen Len: 83.0
25
 
26
  ## Model description
27
 
@@ -46,18 +37,14 @@ The following hyperparameters were used during training:
46
  - seed: 42
47
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
48
  - lr_scheduler_type: linear
49
- - num_epochs: 5
50
  - mixed_precision_training: Native AMP
51
 
52
  ### Training results
53
 
54
- | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
55
- |:-------------:|:-----:|:----:|:---------------:|:-------:|:------:|:-------:|:---------:|:-------:|
56
- | No log | 1.0 | 160 | 3.5827 | 13.3527 | 3.7474 | 10.2363 | 12.3399 | 29.4615 |
57
- | No log | 2.0 | 320 | 3.5827 | 13.3637 | 3.7687 | 10.2693 | 12.3497 | 29.3223 |
58
- | No log | 3.0 | 480 | nan | 0.0 | 0.0 | 0.0 | 0.0 | 83.0 |
59
- | 3.9813 | 4.0 | 640 | nan | 0.0 | 0.0 | 0.0 | 0.0 | 83.0 |
60
- | 3.9813 | 5.0 | 800 | nan | 0.0 | 0.0 | 0.0 | 0.0 | 83.0 |
61
 
62
 
63
  ### Framework versions
 
2
  base_model: csebuetnlp/mT5_multilingual_XLSum
3
  tags:
4
  - generated_from_trainer
 
 
5
  model-index:
6
  - name: mT5_multilingual_XLSum-finetuned-wiki-lingua
7
  results: []
 
13
  # mT5_multilingual_XLSum-finetuned-wiki-lingua
14
 
15
  This model is a fine-tuned version of [csebuetnlp/mT5_multilingual_XLSum](https://huggingface.co/csebuetnlp/mT5_multilingual_XLSum) on the None dataset.
 
 
 
 
 
 
 
16
 
17
  ## Model description
18
 
 
37
  - seed: 42
38
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
39
  - lr_scheduler_type: linear
40
+ - num_epochs: 1
41
  - mixed_precision_training: Native AMP
42
 
43
  ### Training results
44
 
45
+ | Training Loss | Epoch | Step | Validation Loss | Rouge1 | Rouge2 | Rougel | Rougelsum | Gen Len |
46
+ |:-------------:|:-----:|:----:|:---------------:|:-------:|:------:|:------:|:---------:|:-------:|
47
+ | No log | 1.0 | 160 | 3.6384 | 11.8921 | 3.1437 | 9.029 | 10.7853 | 28.4615 |
 
 
 
 
48
 
49
 
50
  ### Framework versions
generation_config.json CHANGED
@@ -1,6 +1,10 @@
1
  {
2
- "do_sample": true,
3
  "eos_token_id": 1,
4
- "max_new_tokens": 50,
 
 
 
 
5
  "transformers_version": "4.40.2"
6
  }
 
1
  {
2
+ "decoder_start_token_id": 0,
3
  "eos_token_id": 1,
4
+ "length_penalty": 0.6,
5
+ "max_length": 84,
6
+ "no_repeat_ngram_size": 2,
7
+ "num_beams": 4,
8
+ "pad_token_id": 0,
9
  "transformers_version": "4.40.2"
10
  }
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:e6ca2a1c31c01f7f5847cc44ac97ab8aca5ad3c907d799fbe90bb5c45392a5f8
3
  size 2329638768
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d72e54f189c8c935d74005b2807a7669f2da43136648bd3ea7265fd6df1a3b20
3
  size 2329638768
runs/May12_09-39-07_36723006b0fb/events.out.tfevents.1715506779.36723006b0fb.4735.0 ADDED
@@ -0,0 +1,3 @@
 
 
 
 
1
+ version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ac5bf3b9d4ffb8740dbce34cee64a849c5f50c0cbac0a2f1da210c200026e512
3
+ size 6048
tokenizer.json CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:84ec7ac09e74719df0d7ac26684f6bb9939553133a2b7916d91c08ff9d959a2f
3
  size 16330638
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:573929f8d971fbe24f97f5e5dfb47d7e6e7f9ba43ae8dd35b424d61767660c6f
3
  size 16330638
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:f624d65cdcfa4673f05a14d38feccb6f96f5857b4da6e7e2f1bd21c182fbdc5f
3
  size 5176
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:f7f7234b6dee15832fee763229852c4ef24c8ba092470c7cefce39b025c030e8
3
  size 5176