pakawadeep commited on
Commit
3073edb
·
1 Parent(s): c20324c

Training in progress epoch 11

Browse files
README.md CHANGED
@@ -15,14 +15,14 @@ probably proofread and complete it, then remove this comment. -->
15
 
16
  This model is a fine-tuned version of [google/mt5-large](https://huggingface.co/google/mt5-large) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
- - Train Loss: 0.6330
19
- - Validation Loss: 0.6676
20
- - Train Rouge1: 8.6634
21
- - Train Rouge2: 1.7822
22
- - Train Rougel: 8.6810
23
- - Train Rougelsum: 8.6103
24
- - Train Gen Len: 11.9109
25
- - Epoch: 10
26
 
27
  ## Model description
28
 
@@ -59,6 +59,7 @@ The following hyperparameters were used during training:
59
  | 0.7521 | 0.7230 | 8.9109 | 2.3762 | 8.8755 | 8.9109 | 11.9406 | 8 |
60
  | 0.6888 | 0.6988 | 8.9109 | 2.3762 | 8.8755 | 8.9109 | 11.9307 | 9 |
61
  | 0.6330 | 0.6676 | 8.6634 | 1.7822 | 8.6810 | 8.6103 | 11.9109 | 10 |
 
62
 
63
 
64
  ### Framework versions
 
15
 
16
  This model is a fine-tuned version of [google/mt5-large](https://huggingface.co/google/mt5-large) on an unknown dataset.
17
  It achieves the following results on the evaluation set:
18
+ - Train Loss: 0.5835
19
+ - Validation Loss: 0.6465
20
+ - Train Rouge1: 7.7793
21
+ - Train Rouge2: 1.2871
22
+ - Train Rougel: 7.9208
23
+ - Train Rougelsum: 7.9208
24
+ - Train Gen Len: 11.9010
25
+ - Epoch: 11
26
 
27
  ## Model description
28
 
 
59
  | 0.7521 | 0.7230 | 8.9109 | 2.3762 | 8.8755 | 8.9109 | 11.9406 | 8 |
60
  | 0.6888 | 0.6988 | 8.9109 | 2.3762 | 8.8755 | 8.9109 | 11.9307 | 9 |
61
  | 0.6330 | 0.6676 | 8.6634 | 1.7822 | 8.6810 | 8.6103 | 11.9109 | 10 |
62
+ | 0.5835 | 0.6465 | 7.7793 | 1.2871 | 7.9208 | 7.9208 | 11.9010 | 11 |
63
 
64
 
65
  ### Framework versions
logs/train/events.out.tfevents.1719321340.9ce680c6f5b2.1682.0.v2 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:331364ef8a55240eca1d8e777a44df69cc462586cd59e2137ca82019c46d13f7
3
- size 13283050
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:e2411955e32a9b6c1cc22ed6eb4d13c81a4c274bcdd97bd7c061c78ceda572b2
3
+ size 13283472
logs/validation/events.out.tfevents.1719321998.9ce680c6f5b2.1682.1.v2 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:19d0d1cf31e07c95b492bb905d80dc0840ed16122a330c2acbdb84baa77f9855
3
- size 1792
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:d5ce8d69b56cf0f632ca09963a7cb97adec952340cde3088c2b7c5f91a7d8eb3
3
+ size 1948
tf_model.h5 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:1ba44f29f4249ccb9cc2a4587294e2c9cfdb502285f6fdd22b07bcff9504c70e
3
  size 6968370776
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:a93ccc308c53d7504cb8cef08f336d037e029ce76fd00ef18ffc5bdba2bb0d0a
3
  size 6968370776