add model
Browse files- README.md +24 -5
- pytorch_model.bin +1 -1
- runs/Jul28_02-23-52_742332c5305d/1627439069.5438976/events.out.tfevents.1627439069.742332c5305d.64.1 +3 -0
- runs/Jul28_02-23-52_742332c5305d/events.out.tfevents.1627439069.742332c5305d.64.0 +3 -0
- runs/Jul28_02-25-34_742332c5305d/1627439150.1962867/events.out.tfevents.1627439150.742332c5305d.64.3 +3 -0
- runs/Jul28_02-25-34_742332c5305d/events.out.tfevents.1627439150.742332c5305d.64.2 +3 -0
- runs/Jul28_02-26-55_742332c5305d/1627439229.0293324/events.out.tfevents.1627439229.742332c5305d.64.5 +3 -0
- runs/Jul28_02-26-55_742332c5305d/events.out.tfevents.1627439229.742332c5305d.64.4 +3 -0
- runs/Jul28_02-30-25_742332c5305d/1627439438.9761453/events.out.tfevents.1627439438.742332c5305d.64.7 +3 -0
- runs/Jul28_02-30-25_742332c5305d/events.out.tfevents.1627439438.742332c5305d.64.6 +3 -0
- runs/Jul28_02-31-51_742332c5305d/1627439526.9561028/events.out.tfevents.1627439526.742332c5305d.64.9 +3 -0
- runs/Jul28_02-31-51_742332c5305d/events.out.tfevents.1627439526.742332c5305d.64.8 +3 -0
- runs/Jul28_02-34-02_742332c5305d/1627439645.1489418/events.out.tfevents.1627439645.742332c5305d.64.11 +3 -0
- runs/Jul28_02-34-02_742332c5305d/events.out.tfevents.1627439645.742332c5305d.64.10 +3 -0
- runs/Jul28_02-35-20_742332c5305d/1627439723.0324335/events.out.tfevents.1627439723.742332c5305d.64.13 +3 -0
- runs/Jul28_02-35-20_742332c5305d/events.out.tfevents.1627439723.742332c5305d.64.12 +3 -0
- runs/Jul28_02-37-15_742332c5305d/1627439838.335754/events.out.tfevents.1627439838.742332c5305d.64.15 +3 -0
- runs/Jul28_02-37-15_742332c5305d/events.out.tfevents.1627439838.742332c5305d.64.14 +3 -0
- runs/Jul28_02-41-34_742332c5305d/1627440097.181891/events.out.tfevents.1627440097.742332c5305d.64.17 +3 -0
- runs/Jul28_02-41-34_742332c5305d/events.out.tfevents.1627440097.742332c5305d.64.16 +3 -0
- training_args.bin +1 -1
README.md
CHANGED
@@ -1,12 +1,18 @@
|
|
1 |
---
|
2 |
tags:
|
3 |
- generated_from_trainer
|
|
|
|
|
4 |
model_index:
|
5 |
- name: opus-mt-ja-en-finetuned-ja-to-en_test
|
6 |
results:
|
7 |
- task:
|
8 |
name: Sequence-to-sequence Language Modeling
|
9 |
type: text2text-generation
|
|
|
|
|
|
|
|
|
10 |
---
|
11 |
|
12 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
@@ -15,6 +21,10 @@ should probably proofread and complete it, then remove this comment. -->
|
|
15 |
# opus-mt-ja-en-finetuned-ja-to-en_test
|
16 |
|
17 |
This model is a fine-tuned version of [Helsinki-NLP/opus-mt-ja-en](https://huggingface.co/Helsinki-NLP/opus-mt-ja-en) on an unkown dataset.
|
|
|
|
|
|
|
|
|
18 |
|
19 |
## Model description
|
20 |
|
@@ -33,20 +43,29 @@ More information needed
|
|
33 |
### Training hyperparameters
|
34 |
|
35 |
The following hyperparameters were used during training:
|
36 |
-
- learning_rate:
|
37 |
- train_batch_size: 16
|
38 |
- eval_batch_size: 16
|
39 |
- seed: 42
|
40 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
41 |
- lr_scheduler_type: linear
|
42 |
-
- num_epochs:
|
43 |
- mixed_precision_training: Native AMP
|
44 |
|
45 |
### Training results
|
46 |
|
47 |
-
| Training Loss | Epoch | Step | Validation Loss | Bleu
|
48 |
-
|
49 |
-
| No log | 1.0 | 20 | 2.
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
50 |
|
51 |
|
52 |
### Framework versions
|
|
|
1 |
---
|
2 |
tags:
|
3 |
- generated_from_trainer
|
4 |
+
metrics:
|
5 |
+
- bleu
|
6 |
model_index:
|
7 |
- name: opus-mt-ja-en-finetuned-ja-to-en_test
|
8 |
results:
|
9 |
- task:
|
10 |
name: Sequence-to-sequence Language Modeling
|
11 |
type: text2text-generation
|
12 |
+
metric:
|
13 |
+
name: Bleu
|
14 |
+
type: bleu
|
15 |
+
value: 41.7379
|
16 |
---
|
17 |
|
18 |
<!-- This model card has been generated automatically according to the information the Trainer had access to. You
|
|
|
21 |
# opus-mt-ja-en-finetuned-ja-to-en_test
|
22 |
|
23 |
This model is a fine-tuned version of [Helsinki-NLP/opus-mt-ja-en](https://huggingface.co/Helsinki-NLP/opus-mt-ja-en) on an unkown dataset.
|
24 |
+
It achieves the following results on the evaluation set:
|
25 |
+
- Loss: 2.2845
|
26 |
+
- Bleu: 41.7379
|
27 |
+
- Gen Len: 12.0337
|
28 |
|
29 |
## Model description
|
30 |
|
|
|
43 |
### Training hyperparameters
|
44 |
|
45 |
The following hyperparameters were used during training:
|
46 |
+
- learning_rate: 0.0001
|
47 |
- train_batch_size: 16
|
48 |
- eval_batch_size: 16
|
49 |
- seed: 42
|
50 |
- optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
|
51 |
- lr_scheduler_type: linear
|
52 |
+
- num_epochs: 10
|
53 |
- mixed_precision_training: Native AMP
|
54 |
|
55 |
### Training results
|
56 |
|
57 |
+
| Training Loss | Epoch | Step | Validation Loss | Bleu | Gen Len |
|
58 |
+
|:-------------:|:-----:|:----:|:---------------:|:-------:|:-------:|
|
59 |
+
| No log | 1.0 | 20 | 2.4396 | 18.3131 | 11.5506 |
|
60 |
+
| No log | 2.0 | 40 | 2.2467 | 27.0591 | 12.0562 |
|
61 |
+
| No log | 3.0 | 60 | 2.1419 | 33.5675 | 11.9888 |
|
62 |
+
| No log | 4.0 | 80 | 2.1646 | 37.728 | 12.5955 |
|
63 |
+
| No log | 5.0 | 100 | 2.1563 | 39.1659 | 12.191 |
|
64 |
+
| No log | 6.0 | 120 | 2.2355 | 38.2432 | 11.8652 |
|
65 |
+
| No log | 7.0 | 140 | 2.2503 | 41.4545 | 12.0 |
|
66 |
+
| No log | 8.0 | 160 | 2.2656 | 43.8778 | 12.2584 |
|
67 |
+
| No log | 9.0 | 180 | 2.2816 | 41.1478 | 12.2135 |
|
68 |
+
| No log | 10.0 | 200 | 2.2845 | 41.7379 | 12.0337 |
|
69 |
|
70 |
|
71 |
### Framework versions
|
pytorch_model.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 301227653
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:0a4902adef02be0c57efe2c87a90a4ca4d209d2c64d790b0e501b6f901af8095
|
3 |
size 301227653
|
runs/Jul28_02-23-52_742332c5305d/1627439069.5438976/events.out.tfevents.1627439069.742332c5305d.64.1
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:f2a870ab9248761f700e71f80a4f52d00500ad3f7915a9bf76814d857d547eac
|
3 |
+
size 4339
|
runs/Jul28_02-23-52_742332c5305d/events.out.tfevents.1627439069.742332c5305d.64.0
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:8896f75c52376f084c971b8391d251451fa60ce39c8a3ef4ddda6b7edbd7ddc6
|
3 |
+
size 4445
|
runs/Jul28_02-25-34_742332c5305d/1627439150.1962867/events.out.tfevents.1627439150.742332c5305d.64.3
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:dcf76475530d8e70a3823e0842703702d13978cae9d04844e0d7f302d4fee181
|
3 |
+
size 4339
|
runs/Jul28_02-25-34_742332c5305d/events.out.tfevents.1627439150.742332c5305d.64.2
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:6935fb2aa85325876411269ddf2c4237f1592fbacb164a2b63f2561a8ed35e9b
|
3 |
+
size 4446
|
runs/Jul28_02-26-55_742332c5305d/1627439229.0293324/events.out.tfevents.1627439229.742332c5305d.64.5
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:bbf60f19c867587429843358612d18780fa6e08f2974a81f3b87ab3bedaa2331
|
3 |
+
size 4339
|
runs/Jul28_02-26-55_742332c5305d/events.out.tfevents.1627439229.742332c5305d.64.4
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:f7fed804e8a2b6ed9457ac3e845331675cd61cf6108fbff7c6e86128193dd01f
|
3 |
+
size 4445
|
runs/Jul28_02-30-25_742332c5305d/1627439438.9761453/events.out.tfevents.1627439438.742332c5305d.64.7
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:8a29ff98dae3fc696bad002af820930f229ba9ade74eefe988dfc38be577c1a9
|
3 |
+
size 4339
|
runs/Jul28_02-30-25_742332c5305d/events.out.tfevents.1627439438.742332c5305d.64.6
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:348571be619eb7b3a8c238586dbdc15d39d24729a3e214e9b247179f20021ac8
|
3 |
+
size 4446
|
runs/Jul28_02-31-51_742332c5305d/1627439526.9561028/events.out.tfevents.1627439526.742332c5305d.64.9
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:31fc9b531c5e628f5e6be803f64085cb847e09aeeaafddf758fd6d3facc625b0
|
3 |
+
size 4339
|
runs/Jul28_02-31-51_742332c5305d/events.out.tfevents.1627439526.742332c5305d.64.8
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:820407614988d9f9e1c286cdd8f90103337cdaebf9d29d007b00ba8828a79ad6
|
3 |
+
size 4446
|
runs/Jul28_02-34-02_742332c5305d/1627439645.1489418/events.out.tfevents.1627439645.742332c5305d.64.11
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:66ac436652f79f1fd16d92dcc90becb8956f619c51c640f0bf9a632f801f44da
|
3 |
+
size 4339
|
runs/Jul28_02-34-02_742332c5305d/events.out.tfevents.1627439645.742332c5305d.64.10
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:9f40a63469db1beba22ed50be9bf27e44e9ef7ff8cd910f5b58f39ae81f641c0
|
3 |
+
size 4446
|
runs/Jul28_02-35-20_742332c5305d/1627439723.0324335/events.out.tfevents.1627439723.742332c5305d.64.13
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:7c48c3bba9dc5b60826b6328b57bacda5d99171d58ece456a504e8b9c9926e1c
|
3 |
+
size 4339
|
runs/Jul28_02-35-20_742332c5305d/events.out.tfevents.1627439723.742332c5305d.64.12
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:56aefcad7e7aa69a2957320417499c6d21b84310f0b0474a7b8234eb3826affd
|
3 |
+
size 4445
|
runs/Jul28_02-37-15_742332c5305d/1627439838.335754/events.out.tfevents.1627439838.742332c5305d.64.15
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:a967e73f41ff8a2325de661f5a8ce83f98fb0bd998468fe4f754792d145bb977
|
3 |
+
size 4339
|
runs/Jul28_02-37-15_742332c5305d/events.out.tfevents.1627439838.742332c5305d.64.14
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:52be27562de860c0febe12e82616035f18b2b42da4112c94fc71fc7803c0d18c
|
3 |
+
size 4446
|
runs/Jul28_02-41-34_742332c5305d/1627440097.181891/events.out.tfevents.1627440097.742332c5305d.64.17
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:47e79beaab6cf56ddbc2a557f36b85da5a7a26252bc03a10a8ed1d2312eaeb31
|
3 |
+
size 4339
|
runs/Jul28_02-41-34_742332c5305d/events.out.tfevents.1627440097.742332c5305d.64.16
ADDED
@@ -0,0 +1,3 @@
|
|
|
|
|
|
|
|
|
1 |
+
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:c489c2e065035a9e148997dc5443e6ef39da8abbac4805247d33778449b17920
|
3 |
+
size 7748
|
training_args.bin
CHANGED
@@ -1,3 +1,3 @@
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
-
oid sha256:
|
3 |
size 2735
|
|
|
1 |
version https://git-lfs.github.com/spec/v1
|
2 |
+
oid sha256:ec13c49c86fb9c016aab7572241e4a540d7cd499eac5b45dba4439c358973d56
|
3 |
size 2735
|