pakawadeep commited on
Commit
6c50202
·
1 Parent(s): 588d8b1

Training in progress epoch 4

Browse files
README.md CHANGED
@@ -15,14 +15,14 @@ probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [google/mt5-large](https://huggingface.co/google/mt5-large) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
- - Train Loss: 1.4302
19
- - Validation Loss: 1.0860
20
  - Train Rouge1: 8.3805
21
  - Train Rouge2: 2.4257
22
  - Train Rougel: 8.4158
23
  - Train Rougelsum: 8.4158
24
- - Train Gen Len: 11.8069
25
- - Epoch: 3
26
 
27
  ## Model description
28
 
@@ -52,6 +52,7 @@ The following hyperparameters were used during training:
52
  | 2.2977 | 1.5680 | 7.0014 | 1.0891 | 7.0651 | 6.9307 | 11.3267 | 1 |
53
  | 1.7363 | 1.2611 | 7.0674 | 1.0891 | 7.1287 | 7.0745 | 11.5545 | 2 |
54
  | 1.4302 | 1.0860 | 8.3805 | 2.4257 | 8.4158 | 8.4158 | 11.8069 | 3 |
 
55
 
56
 
57
  ### Framework versions
 
15
 
16
  This model is a fine-tuned version of [google/mt5-large](https://huggingface.co/google/mt5-large) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Train Loss: 1.2082
19
+ - Validation Loss: 0.9516
20
  - Train Rouge1: 8.3805
21
  - Train Rouge2: 2.4257
22
  - Train Rougel: 8.4158
23
  - Train Rougelsum: 8.4158
24
+ - Train Gen Len: 11.8861
25
+ - Epoch: 4
26
 
27
  ## Model description
28
 
 
52
  | 2.2977 | 1.5680 | 7.0014 | 1.0891 | 7.0651 | 6.9307 | 11.3267 | 1 |
53
  | 1.7363 | 1.2611 | 7.0674 | 1.0891 | 7.1287 | 7.0745 | 11.5545 | 2 |
54
  | 1.4302 | 1.0860 | 8.3805 | 2.4257 | 8.4158 | 8.4158 | 11.8069 | 3 |
55
+ | 1.2082 | 0.9516 | 8.3805 | 2.4257 | 8.4158 | 8.4158 | 11.8861 | 4 |
56
 
57
 
58
  ### Framework versions
logs/train/events.out.tfevents.1719321340.9ce680c6f5b2.1682.0.v2 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:8d5dde927941ea9f34743677a04b44f54ffcf3cee6a4c4e1eff53b1342a682bd
3
- size 13280096
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e6df23010623d4d3a37b6f7b6b2e6f28855f173402c2b241d4efb446ce7d70b9
3
+ size 13280518
logs/validation/events.out.tfevents.1719321998.9ce680c6f5b2.1682.1.v2 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:c7db293ac6f9ff7ffe7cda1f64500bce5d2c5c284b20e09704aad75dfed8f590
3
- size 700
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:aecccfd9e57c907caeb19229f7fa8a64da586dc6748e67b676bbfebc2608405f
3
+ size 856
tf_model.h5 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:980f059f2b1db0a1839ad1cc57090f61fc6fa47c73105c69c7c0937b895a23e3
3
  size 6968370776
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ec53682f56c8d287fd5a867d065ca6eb0d86b25f712f5347af7d89a0f85a0051
3
  size 6968370776