qing-yao commited on
Commit
317c8fc
·
verified ·
1 Parent(s): 803e95c

Model save

Browse files
Files changed (5) hide show
  1. README.md +22 -22
  2. model.safetensors +1 -1
  3. tokenizer.json +0 -0
  4. training_args.bin +1 -1
  5. vocab.json +0 -0
README.md CHANGED
@@ -16,8 +16,8 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  This model was trained from scratch on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
- - Loss: 3.1445
20
- - Accuracy: 0.4045
21
 
22
  ## Model description
23
 
@@ -52,26 +52,26 @@ The following hyperparameters were used during training:
52
 
53
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
54
  |:-------------:|:-------:|:-----:|:---------------:|:--------:|
55
- | 6.036 | 0.9998 | 1525 | 4.3721 | 0.2972 |
56
- | 3.946 | 1.9997 | 3050 | 3.8545 | 0.3376 |
57
- | 3.6934 | 2.9995 | 4575 | 3.5792 | 0.3611 |
58
- | 3.3956 | 4.0 | 6101 | 3.4280 | 0.3751 |
59
- | 3.287 | 4.9998 | 7626 | 3.3301 | 0.3844 |
60
- | 3.1683 | 5.9997 | 9151 | 3.2748 | 0.3895 |
61
- | 3.1069 | 6.9995 | 10676 | 3.2356 | 0.3933 |
62
- | 3.0496 | 8.0 | 12202 | 3.2118 | 0.3959 |
63
- | 3.0073 | 8.9998 | 13727 | 3.1902 | 0.3981 |
64
- | 2.9755 | 9.9997 | 15252 | 3.1811 | 0.3991 |
65
- | 2.9444 | 10.9995 | 16777 | 3.1732 | 0.4007 |
66
- | 2.9239 | 12.0 | 18303 | 3.1687 | 0.4017 |
67
- | 2.9028 | 12.9998 | 19828 | 3.1596 | 0.4023 |
68
- | 2.8881 | 13.9997 | 21353 | 3.1569 | 0.4028 |
69
- | 2.8729 | 14.9995 | 22878 | 3.1514 | 0.4032 |
70
- | 2.862 | 16.0 | 24404 | 3.1557 | 0.4036 |
71
- | 2.8532 | 16.9998 | 25929 | 3.1493 | 0.4037 |
72
- | 2.84 | 17.9997 | 27454 | 3.1471 | 0.4039 |
73
- | 2.8419 | 18.9995 | 28979 | 3.1467 | 0.4041 |
74
- | 2.8258 | 19.9967 | 30500 | 3.1445 | 0.4045 |
75
 
76
 
77
  ### Framework versions
 
16
 
17
  This model was trained from scratch on an unknown dataset.
18
  It achieves the following results on the evaluation set:
19
+ - Loss: 3.5169
20
+ - Accuracy: 0.3627
21
 
22
  ## Model description
23
 
 
52
 
53
  | Training Loss | Epoch | Step | Validation Loss | Accuracy |
54
  |:-------------:|:-------:|:-----:|:---------------:|:--------:|
55
+ | 6.1054 | 0.9998 | 1507 | 4.5218 | 0.2801 |
56
+ | 4.1313 | 1.9997 | 3014 | 4.0830 | 0.3122 |
57
+ | 3.9303 | 2.9995 | 4521 | 3.8739 | 0.3273 |
58
+ | 3.6971 | 4.0 | 6029 | 3.7495 | 0.3384 |
59
+ | 3.6089 | 4.9998 | 7536 | 3.6714 | 0.3456 |
60
+ | 3.5019 | 5.9997 | 9043 | 3.6255 | 0.3494 |
61
+ | 3.44 | 6.9995 | 10550 | 3.5873 | 0.3534 |
62
+ | 3.3899 | 8.0 | 12058 | 3.5776 | 0.3544 |
63
+ | 3.3437 | 8.9998 | 13565 | 3.5597 | 0.3571 |
64
+ | 3.3226 | 9.9997 | 15072 | 3.5447 | 0.3587 |
65
+ | 3.2818 | 10.9995 | 16579 | 3.5331 | 0.3599 |
66
+ | 3.2776 | 12.0 | 18087 | 3.5290 | 0.3604 |
67
+ | 3.2409 | 12.9998 | 19594 | 3.5242 | 0.3616 |
68
+ | 3.2466 | 13.9997 | 21101 | 3.5150 | 0.3623 |
69
+ | 3.2122 | 14.9995 | 22608 | 3.5189 | 0.3618 |
70
+ | 3.2248 | 16.0 | 24116 | 3.5207 | 0.3620 |
71
+ | 3.1929 | 16.9998 | 25623 | 3.5077 | 0.3634 |
72
+ | 3.2095 | 17.9997 | 27130 | 3.5113 | 0.3637 |
73
+ | 3.1794 | 18.9995 | 28637 | 3.5053 | 0.3634 |
74
+ | 3.1995 | 19.9967 | 30140 | 3.5169 | 0.3627 |
75
 
76
 
77
  ### Framework versions
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:bf041a27301cdd5261bedbdb4252e66ad3fa7d8ed6feee6160eb9845e55203be
3
  size 441702288
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:c1e25e335884a441bb5f63b6d76b10f683c9b4c3d58b8dab7d35afab0e65c362
3
  size 441702288
tokenizer.json CHANGED
The diff for this file is too large to render. See raw diff
 
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:37bb9d5d6e20148575e8d83dffd5b5bfcad65756451c6aef79e915572ce80111
3
  size 5368
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:fb5ee132f30ac1b67a7c54692c6b1860967050800495f7a8e1f5d0f5baed6dcb
3
  size 5368
vocab.json CHANGED
The diff for this file is too large to render. See raw diff