pakawadeep commited on
Commit
4b2239e
·
1 Parent(s): 6c50202

Training in progress epoch 5

Browse files
README.md CHANGED
@@ -15,14 +15,14 @@ probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [google/mt5-large](https://huggingface.co/google/mt5-large) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
- - Train Loss: 1.2082
19
- - Validation Loss: 0.9516
20
  - Train Rouge1: 8.3805
21
  - Train Rouge2: 2.4257
22
  - Train Rougel: 8.4158
23
  - Train Rougelsum: 8.4158
24
- - Train Gen Len: 11.8861
25
- - Epoch: 4
26
 
27
  ## Model description
28
 
@@ -53,6 +53,7 @@ The following hyperparameters were used during training:
53
  | 1.7363 | 1.2611 | 7.0674 | 1.0891 | 7.1287 | 7.0745 | 11.5545 | 2 |
54
  | 1.4302 | 1.0860 | 8.3805 | 2.4257 | 8.4158 | 8.4158 | 11.8069 | 3 |
55
  | 1.2082 | 0.9516 | 8.3805 | 2.4257 | 8.4158 | 8.4158 | 11.8861 | 4 |
 
56
 
57
 
58
  ### Framework versions
 
15
 
16
  This model is a fine-tuned version of [google/mt5-large](https://huggingface.co/google/mt5-large) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Train Loss: 1.0516
19
+ - Validation Loss: 0.8511
20
  - Train Rouge1: 8.3805
21
  - Train Rouge2: 2.4257
22
  - Train Rougel: 8.4158
23
  - Train Rougelsum: 8.4158
24
+ - Train Gen Len: 12.0149
25
+ - Epoch: 5
26
 
27
  ## Model description
28
 
 
53
  | 1.7363 | 1.2611 | 7.0674 | 1.0891 | 7.1287 | 7.0745 | 11.5545 | 2 |
54
  | 1.4302 | 1.0860 | 8.3805 | 2.4257 | 8.4158 | 8.4158 | 11.8069 | 3 |
55
  | 1.2082 | 0.9516 | 8.3805 | 2.4257 | 8.4158 | 8.4158 | 11.8861 | 4 |
56
+ | 1.0516 | 0.8511 | 8.3805 | 2.4257 | 8.4158 | 8.4158 | 12.0149 | 5 |
57
 
58
 
59
  ### Framework versions
logs/train/events.out.tfevents.1719321340.9ce680c6f5b2.1682.0.v2 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:e6df23010623d4d3a37b6f7b6b2e6f28855f173402c2b241d4efb446ce7d70b9
3
- size 13280518
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:0a628e321ecedb19bc408b6c28993125197a2d6abacf07a10c82e8033f8a5f40
3
+ size 13280940
logs/validation/events.out.tfevents.1719321998.9ce680c6f5b2.1682.1.v2 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:aecccfd9e57c907caeb19229f7fa8a64da586dc6748e67b676bbfebc2608405f
3
- size 856
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:7cc99fbe4afda4b971460173f25104700c81b9b059ebdc657cdff43afd11d819
3
+ size 1012
tf_model.h5 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:ec53682f56c8d287fd5a867d065ca6eb0d86b25f712f5347af7d89a0f85a0051
3
  size 6968370776
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:ff77b8569a3200871b4846ac1c9b9a256d6212834efaf66e4096128b291473ea
3
  size 6968370776