lysandre HF staff commited on
Commit
1a527b6
·
verified ·
1 Parent(s): c901e2f

End of training

Browse files
README.md CHANGED
@@ -16,29 +16,29 @@ should probably proofread and complete it, then remove this comment. -->
16
 
17
  This model is a fine-tuned version of [microsoft/conditional-detr-resnet-50](https://huggingface.co/microsoft/conditional-detr-resnet-50) on the None dataset.
18
  It achieves the following results on the evaluation set:
19
- - Loss: 0.8155
20
- - Map: 0.1291
21
- - Map 50: 0.1942
22
- - Map 75: 0.1499
23
- - Map Small: 0.1291
24
  - Map Medium: -1.0
25
  - Map Large: -1.0
26
- - Mar 1: 0.2282
27
- - Mar 10: 0.7027
28
- - Mar 100: 0.7855
29
- - Mar Small: 0.7855
30
  - Mar Medium: -1.0
31
  - Mar Large: -1.0
32
- - Map Left: 0.1364
33
- - Mar 100 Left: 0.8404
34
- - Map Right: 0.1038
35
- - Mar 100 Right: 0.8389
36
- - Map Up: 0.1453
37
- - Mar 100 Up: 0.7917
38
- - Map Down: 0.0569
39
- - Mar 100 Down: 0.7138
40
- - Map ?: 0.2029
41
- - Mar 100 ?: 0.7427
42
 
43
  ## Model description
44
 
@@ -69,41 +69,41 @@ The following hyperparameters were used during training:
69
 
70
  | Training Loss | Epoch | Step | Validation Loss | Map | Map 50 | Map 75 | Map Small | Map Medium | Map Large | Mar 1 | Mar 10 | Mar 100 | Mar Small | Mar Medium | Mar Large | Map Left | Mar 100 Left | Map Right | Mar 100 Right | Map Up | Mar 100 Up | Map Down | Mar 100 Down | Map ? | Mar 100 ? |
71
  |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:----------:|:---------:|:------:|:------:|:-------:|:---------:|:----------:|:---------:|:--------:|:------------:|:---------:|:-------------:|:------:|:----------:|:--------:|:------------:|:------:|:---------:|
72
- | No log | 1.0 | 8 | 11.3765 | 0.0004 | 0.0024 | 0.0 | 0.0004 | -1.0 | -1.0 | 0.0009 | 0.0069 | 0.0141 | 0.0141 | -1.0 | -1.0 | 0.0014 | 0.0085 | 0.0006 | 0.0444 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0177 |
73
- | No log | 2.0 | 16 | 4.0235 | 0.0098 | 0.0349 | 0.001 | 0.01 | -1.0 | -1.0 | 0.0094 | 0.1264 | 0.2197 | 0.2197 | -1.0 | -1.0 | 0.0204 | 0.3872 | 0.0001 | 0.0185 | 0.011 | 0.3104 | 0.0004 | 0.0483 | 0.0171 | 0.3339 |
74
- | No log | 3.0 | 24 | 2.1472 | 0.032 | 0.0868 | 0.0123 | 0.0321 | -1.0 | -1.0 | 0.0314 | 0.2188 | 0.3431 | 0.3431 | -1.0 | -1.0 | 0.083 | 0.483 | 0.0047 | 0.1889 | 0.0383 | 0.4917 | 0.0015 | 0.0966 | 0.0324 | 0.4556 |
75
- | No log | 4.0 | 32 | 1.6748 | 0.0318 | 0.0907 | 0.0132 | 0.0318 | -1.0 | -1.0 | 0.0476 | 0.2917 | 0.4206 | 0.4206 | -1.0 | -1.0 | 0.0572 | 0.566 | 0.0274 | 0.4889 | 0.0297 | 0.3479 | 0.0075 | 0.2172 | 0.0371 | 0.4831 |
76
- | No log | 5.0 | 40 | 1.4340 | 0.0396 | 0.1062 | 0.0206 | 0.0396 | -1.0 | -1.0 | 0.0477 | 0.3648 | 0.5125 | 0.5125 | -1.0 | -1.0 | 0.0642 | 0.6213 | 0.0432 | 0.587 | 0.0543 | 0.6042 | 0.0063 | 0.2276 | 0.0303 | 0.5226 |
77
- | No log | 6.0 | 48 | 1.2659 | 0.0501 | 0.1098 | 0.0332 | 0.0501 | -1.0 | -1.0 | 0.0785 | 0.4125 | 0.5584 | 0.5584 | -1.0 | -1.0 | 0.0872 | 0.6872 | 0.0415 | 0.6296 | 0.0654 | 0.5854 | 0.0106 | 0.2759 | 0.0459 | 0.6137 |
78
- | No log | 7.0 | 56 | 1.3187 | 0.0466 | 0.1349 | 0.0173 | 0.0466 | -1.0 | -1.0 | 0.0655 | 0.3572 | 0.4875 | 0.4875 | -1.0 | -1.0 | 0.0738 | 0.5617 | 0.0444 | 0.563 | 0.0572 | 0.5229 | 0.0206 | 0.231 | 0.0371 | 0.5589 |
79
- | No log | 8.0 | 64 | 1.1879 | 0.0692 | 0.1595 | 0.0365 | 0.0692 | -1.0 | -1.0 | 0.1132 | 0.4274 | 0.568 | 0.568 | -1.0 | -1.0 | 0.1319 | 0.6809 | 0.0434 | 0.6389 | 0.0953 | 0.5958 | 0.0216 | 0.3414 | 0.0536 | 0.5831 |
80
- | No log | 9.0 | 72 | 1.0866 | 0.0734 | 0.1604 | 0.0531 | 0.0734 | -1.0 | -1.0 | 0.1024 | 0.5221 | 0.6347 | 0.6347 | -1.0 | -1.0 | 0.0973 | 0.717 | 0.0706 | 0.6981 | 0.091 | 0.6146 | 0.0312 | 0.5034 | 0.077 | 0.6403 |
81
- | No log | 10.0 | 80 | 1.1141 | 0.0838 | 0.1778 | 0.0581 | 0.0838 | -1.0 | -1.0 | 0.1141 | 0.477 | 0.5995 | 0.5995 | -1.0 | -1.0 | 0.1254 | 0.7106 | 0.0469 | 0.5296 | 0.1313 | 0.6271 | 0.0639 | 0.4828 | 0.0517 | 0.6476 |
82
- | No log | 11.0 | 88 | 1.0441 | 0.1081 | 0.2059 | 0.113 | 0.1081 | -1.0 | -1.0 | 0.1468 | 0.5476 | 0.6598 | 0.6598 | -1.0 | -1.0 | 0.123 | 0.7596 | 0.0994 | 0.6926 | 0.2022 | 0.6979 | 0.0644 | 0.531 | 0.0513 | 0.6177 |
83
- | No log | 12.0 | 96 | 1.0162 | 0.0925 | 0.1924 | 0.0722 | 0.0925 | -1.0 | -1.0 | 0.1406 | 0.5546 | 0.6588 | 0.6588 | -1.0 | -1.0 | 0.1207 | 0.7447 | 0.0741 | 0.6481 | 0.1138 | 0.6708 | 0.0622 | 0.5586 | 0.0917 | 0.6718 |
84
- | No log | 13.0 | 104 | 1.0286 | 0.0654 | 0.1532 | 0.0471 | 0.0654 | -1.0 | -1.0 | 0.1208 | 0.4983 | 0.6221 | 0.6221 | -1.0 | -1.0 | 0.0899 | 0.7106 | 0.0484 | 0.613 | 0.0739 | 0.6313 | 0.0314 | 0.5103 | 0.0833 | 0.6452 |
85
- | No log | 14.0 | 112 | 1.0095 | 0.0779 | 0.1526 | 0.0658 | 0.078 | -1.0 | -1.0 | 0.1414 | 0.5457 | 0.6401 | 0.6401 | -1.0 | -1.0 | 0.0886 | 0.6894 | 0.0907 | 0.6981 | 0.088 | 0.6458 | 0.0389 | 0.5517 | 0.0834 | 0.6153 |
86
- | No log | 15.0 | 120 | 1.0553 | 0.0679 | 0.1489 | 0.0442 | 0.0679 | -1.0 | -1.0 | 0.0927 | 0.5145 | 0.6169 | 0.6169 | -1.0 | -1.0 | 0.0746 | 0.6766 | 0.0705 | 0.6204 | 0.0702 | 0.5854 | 0.0312 | 0.5931 | 0.0929 | 0.6089 |
87
- | No log | 16.0 | 128 | 0.9701 | 0.0871 | 0.1632 | 0.0832 | 0.0871 | -1.0 | -1.0 | 0.1328 | 0.5955 | 0.6803 | 0.6803 | -1.0 | -1.0 | 0.0797 | 0.734 | 0.0975 | 0.7315 | 0.0975 | 0.7167 | 0.0246 | 0.5862 | 0.136 | 0.6331 |
88
- | No log | 17.0 | 136 | 0.9433 | 0.0939 | 0.1694 | 0.0913 | 0.0939 | -1.0 | -1.0 | 0.1462 | 0.5953 | 0.6828 | 0.6828 | -1.0 | -1.0 | 0.0782 | 0.7234 | 0.1164 | 0.7685 | 0.0954 | 0.7167 | 0.0236 | 0.5448 | 0.156 | 0.6605 |
89
- | No log | 18.0 | 144 | 0.9000 | 0.1041 | 0.1756 | 0.116 | 0.1041 | -1.0 | -1.0 | 0.1777 | 0.6134 | 0.707 | 0.707 | -1.0 | -1.0 | 0.0824 | 0.7574 | 0.1205 | 0.7981 | 0.1057 | 0.7521 | 0.0248 | 0.5241 | 0.1868 | 0.7032 |
90
- | No log | 19.0 | 152 | 0.8657 | 0.1087 | 0.1706 | 0.1245 | 0.1087 | -1.0 | -1.0 | 0.1802 | 0.6586 | 0.7418 | 0.7418 | -1.0 | -1.0 | 0.0928 | 0.8255 | 0.0946 | 0.8037 | 0.116 | 0.7958 | 0.0286 | 0.5793 | 0.2114 | 0.7048 |
91
- | No log | 20.0 | 160 | 0.8645 | 0.1105 | 0.1855 | 0.1279 | 0.1105 | -1.0 | -1.0 | 0.1855 | 0.658 | 0.7452 | 0.7452 | -1.0 | -1.0 | 0.1074 | 0.8191 | 0.0912 | 0.8167 | 0.1259 | 0.7604 | 0.0375 | 0.6207 | 0.1906 | 0.7089 |
92
- | No log | 21.0 | 168 | 0.8434 | 0.1124 | 0.1785 | 0.1309 | 0.1124 | -1.0 | -1.0 | 0.2117 | 0.6712 | 0.7628 | 0.7628 | -1.0 | -1.0 | 0.1133 | 0.8404 | 0.0842 | 0.8167 | 0.1361 | 0.7833 | 0.0437 | 0.6552 | 0.1846 | 0.7185 |
93
- | No log | 22.0 | 176 | 0.8429 | 0.1141 | 0.1797 | 0.1332 | 0.1141 | -1.0 | -1.0 | 0.2087 | 0.6945 | 0.7789 | 0.7789 | -1.0 | -1.0 | 0.1176 | 0.817 | 0.0833 | 0.8278 | 0.1357 | 0.7979 | 0.0501 | 0.7379 | 0.1835 | 0.7137 |
94
- | No log | 23.0 | 184 | 0.8639 | 0.1106 | 0.1817 | 0.1213 | 0.1106 | -1.0 | -1.0 | 0.2023 | 0.6686 | 0.7507 | 0.7507 | -1.0 | -1.0 | 0.1275 | 0.8106 | 0.0756 | 0.7907 | 0.1272 | 0.7563 | 0.0457 | 0.6966 | 0.1767 | 0.6992 |
95
- | No log | 24.0 | 192 | 0.8237 | 0.1255 | 0.1907 | 0.1466 | 0.1255 | -1.0 | -1.0 | 0.2142 | 0.7028 | 0.7838 | 0.7838 | -1.0 | -1.0 | 0.1297 | 0.8277 | 0.0963 | 0.8352 | 0.1492 | 0.7979 | 0.0515 | 0.7241 | 0.2011 | 0.7339 |
96
- | No log | 25.0 | 200 | 0.8217 | 0.1305 | 0.1935 | 0.1579 | 0.1305 | -1.0 | -1.0 | 0.2163 | 0.7144 | 0.7838 | 0.7838 | -1.0 | -1.0 | 0.1317 | 0.8319 | 0.1076 | 0.8315 | 0.1577 | 0.7958 | 0.0541 | 0.7241 | 0.2014 | 0.7355 |
97
- | No log | 26.0 | 208 | 0.8197 | 0.1292 | 0.1955 | 0.1509 | 0.1292 | -1.0 | -1.0 | 0.2208 | 0.7083 | 0.7781 | 0.7781 | -1.0 | -1.0 | 0.1337 | 0.8277 | 0.1035 | 0.8389 | 0.1536 | 0.7854 | 0.0532 | 0.7 | 0.2023 | 0.7387 |
98
- | No log | 27.0 | 216 | 0.8266 | 0.1273 | 0.1942 | 0.1488 | 0.1273 | -1.0 | -1.0 | 0.2247 | 0.6947 | 0.7743 | 0.7743 | -1.0 | -1.0 | 0.1336 | 0.8277 | 0.1022 | 0.8315 | 0.1437 | 0.7792 | 0.0564 | 0.7 | 0.2004 | 0.7331 |
99
- | No log | 28.0 | 224 | 0.8207 | 0.1276 | 0.1941 | 0.1488 | 0.1276 | -1.0 | -1.0 | 0.2242 | 0.6992 | 0.7773 | 0.7773 | -1.0 | -1.0 | 0.1347 | 0.8319 | 0.1036 | 0.8296 | 0.1443 | 0.7833 | 0.0564 | 0.7069 | 0.1992 | 0.7347 |
100
- | No log | 29.0 | 232 | 0.8158 | 0.129 | 0.1942 | 0.1497 | 0.129 | -1.0 | -1.0 | 0.2275 | 0.7027 | 0.7851 | 0.7851 | -1.0 | -1.0 | 0.1364 | 0.8383 | 0.1038 | 0.8389 | 0.1451 | 0.7917 | 0.0569 | 0.7138 | 0.2028 | 0.7427 |
101
- | No log | 30.0 | 240 | 0.8155 | 0.1291 | 0.1942 | 0.1499 | 0.1291 | -1.0 | -1.0 | 0.2282 | 0.7027 | 0.7855 | 0.7855 | -1.0 | -1.0 | 0.1364 | 0.8404 | 0.1038 | 0.8389 | 0.1453 | 0.7917 | 0.0569 | 0.7138 | 0.2029 | 0.7427 |
102
 
103
 
104
  ### Framework versions
105
 
106
- - Transformers 4.47.1
107
  - Pytorch 2.5.1+cu124
108
  - Datasets 3.2.0
109
  - Tokenizers 0.21.0
 
16
 
17
  This model is a fine-tuned version of [microsoft/conditional-detr-resnet-50](https://huggingface.co/microsoft/conditional-detr-resnet-50) on the None dataset.
18
  It achieves the following results on the evaluation set:
19
+ - Loss: 1.0723
20
+ - Map: 0.2517
21
+ - Map 50: 0.4548
22
+ - Map 75: 0.257
23
+ - Map Small: 0.2517
24
  - Map Medium: -1.0
25
  - Map Large: -1.0
26
+ - Mar 1: 0.1521
27
+ - Mar 10: 0.6417
28
+ - Mar 100: 0.6542
29
+ - Mar Small: 0.6542
30
  - Mar Medium: -1.0
31
  - Mar Large: -1.0
32
+ - Map Left: 0.3868
33
+ - Mar 100 Left: 0.775
34
+ - Map Right: -1.0
35
+ - Mar 100 Right: -1.0
36
+ - Map Up: 0.3093
37
+ - Mar 100 Up: 0.7667
38
+ - Map Down: 0.0803
39
+ - Mar 100 Down: 0.7
40
+ - Map ?: 0.2304
41
+ - Mar 100 ?: 0.375
42
 
43
  ## Model description
44
 
 
69
 
70
  | Training Loss | Epoch | Step | Validation Loss | Map | Map 50 | Map 75 | Map Small | Map Medium | Map Large | Mar 1 | Mar 10 | Mar 100 | Mar Small | Mar Medium | Mar Large | Map Left | Mar 100 Left | Map Right | Mar 100 Right | Map Up | Mar 100 Up | Map Down | Mar 100 Down | Map ? | Mar 100 ? |
71
  |:-------------:|:-----:|:----:|:---------------:|:------:|:------:|:------:|:---------:|:----------:|:---------:|:------:|:------:|:-------:|:---------:|:----------:|:---------:|:--------:|:------------:|:---------:|:-------------:|:------:|:----------:|:--------:|:------------:|:------:|:---------:|
72
+ | No log | 1.0 | 8 | 22.5843 | 0.0006 | 0.0029 | 0.0 | 0.0008 | -1.0 | -1.0 | 0.0 | 0.0 | 0.05 | 0.05 | -1.0 | -1.0 | 0.0024 | 0.2 | -1.0 | -1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
73
+ | No log | 2.0 | 16 | 7.9952 | 0.0004 | 0.0015 | 0.0 | 0.0005 | -1.0 | -1.0 | 0.0 | 0.0 | 0.0375 | 0.0375 | -1.0 | -1.0 | 0.0018 | 0.15 | -1.0 | -1.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 | 0.0 |
74
+ | No log | 3.0 | 24 | 3.5489 | 0.0185 | 0.0727 | 0.0028 | 0.0212 | -1.0 | -1.0 | 0.05 | 0.1104 | 0.1917 | 0.1917 | -1.0 | -1.0 | 0.0246 | 0.4 | -1.0 | -1.0 | 0.0496 | 0.3667 | 0.0 | 0.0 | 0.0 | 0.0 |
75
+ | No log | 4.0 | 32 | 2.4717 | 0.027 | 0.1199 | 0.0 | 0.0275 | -1.0 | -1.0 | 0.0292 | 0.15 | 0.1688 | 0.1688 | -1.0 | -1.0 | 0.0515 | 0.375 | -1.0 | -1.0 | 0.0567 | 0.3 | 0.0 | 0.0 | 0.0 | 0.0 |
76
+ | No log | 5.0 | 40 | 1.8224 | 0.0917 | 0.2407 | 0.0369 | 0.0918 | -1.0 | -1.0 | 0.0083 | 0.4437 | 0.4563 | 0.4563 | -1.0 | -1.0 | 0.1954 | 0.7 | -1.0 | -1.0 | 0.071 | 0.6 | 0.075 | 0.3 | 0.0253 | 0.225 |
77
+ | No log | 6.0 | 48 | 1.7414 | 0.0807 | 0.2338 | 0.0645 | 0.0807 | -1.0 | -1.0 | 0.0562 | 0.4 | 0.4 | 0.4 | -1.0 | -1.0 | 0.1487 | 0.625 | -1.0 | -1.0 | 0.1012 | 0.6 | 0.0375 | 0.3 | 0.0353 | 0.075 |
78
+ | No log | 7.0 | 56 | 1.6537 | 0.0937 | 0.271 | 0.0217 | 0.0937 | -1.0 | -1.0 | 0.0521 | 0.3896 | 0.4521 | 0.4521 | -1.0 | -1.0 | 0.1276 | 0.6 | -1.0 | -1.0 | 0.0438 | 0.4333 | 0.1667 | 0.5 | 0.0367 | 0.275 |
79
+ | No log | 8.0 | 64 | 1.5604 | 0.1255 | 0.2203 | 0.1325 | 0.1255 | -1.0 | -1.0 | 0.1 | 0.4437 | 0.4563 | 0.4563 | -1.0 | -1.0 | 0.1517 | 0.65 | -1.0 | -1.0 | 0.2687 | 0.7 | 0.08 | 0.4 | 0.0017 | 0.075 |
80
+ | No log | 9.0 | 72 | 1.5820 | 0.1137 | 0.2273 | 0.0476 | 0.1137 | -1.0 | -1.0 | 0.0667 | 0.4042 | 0.4042 | 0.4042 | -1.0 | -1.0 | 0.1775 | 0.625 | -1.0 | -1.0 | 0.2082 | 0.5667 | 0.0667 | 0.4 | 0.0023 | 0.025 |
81
+ | No log | 10.0 | 80 | 1.7106 | 0.0698 | 0.2492 | 0.0092 | 0.0698 | -1.0 | -1.0 | 0.0208 | 0.2979 | 0.3042 | 0.3042 | -1.0 | -1.0 | 0.0692 | 0.35 | -1.0 | -1.0 | 0.0773 | 0.3667 | 0.1 | 0.4 | 0.0326 | 0.1 |
82
+ | No log | 11.0 | 88 | 1.5661 | 0.0972 | 0.3224 | 0.0265 | 0.0972 | -1.0 | -1.0 | 0.0333 | 0.2917 | 0.3625 | 0.3625 | -1.0 | -1.0 | 0.2155 | 0.5 | -1.0 | -1.0 | 0.1341 | 0.5 | 0.0233 | 0.3 | 0.0159 | 0.15 |
83
+ | No log | 12.0 | 96 | 1.3384 | 0.1733 | 0.3758 | 0.1295 | 0.1733 | -1.0 | -1.0 | 0.1813 | 0.4208 | 0.5375 | 0.5375 | -1.0 | -1.0 | 0.3161 | 0.7 | -1.0 | -1.0 | 0.2928 | 0.7 | 0.0497 | 0.5 | 0.0347 | 0.25 |
84
+ | No log | 13.0 | 104 | 1.4042 | 0.1442 | 0.3373 | 0.0748 | 0.1442 | -1.0 | -1.0 | 0.2021 | 0.3292 | 0.4292 | 0.4292 | -1.0 | -1.0 | 0.2102 | 0.625 | -1.0 | -1.0 | 0.3047 | 0.5667 | 0.0321 | 0.3 | 0.0297 | 0.225 |
85
+ | No log | 14.0 | 112 | 1.2224 | 0.2161 | 0.359 | 0.2552 | 0.2161 | -1.0 | -1.0 | 0.2021 | 0.5 | 0.5938 | 0.5938 | -1.0 | -1.0 | 0.258 | 0.825 | -1.0 | -1.0 | 0.4941 | 0.8 | 0.0819 | 0.5 | 0.0303 | 0.25 |
86
+ | No log | 15.0 | 120 | 1.2616 | 0.2083 | 0.3547 | 0.2527 | 0.2083 | -1.0 | -1.0 | 0.2062 | 0.4938 | 0.5688 | 0.5688 | -1.0 | -1.0 | 0.3888 | 0.75 | -1.0 | -1.0 | 0.3316 | 0.7 | 0.0861 | 0.6 | 0.0268 | 0.225 |
87
+ | No log | 16.0 | 128 | 1.2555 | 0.1365 | 0.3083 | 0.1347 | 0.1365 | -1.0 | -1.0 | 0.1167 | 0.4771 | 0.5083 | 0.5083 | -1.0 | -1.0 | 0.1695 | 0.625 | -1.0 | -1.0 | 0.276 | 0.6333 | 0.0571 | 0.4 | 0.0434 | 0.375 |
88
+ | No log | 17.0 | 136 | 1.3343 | 0.1443 | 0.3275 | 0.0747 | 0.1443 | -1.0 | -1.0 | 0.0938 | 0.4292 | 0.5042 | 0.5042 | -1.0 | -1.0 | 0.2401 | 0.65 | -1.0 | -1.0 | 0.2446 | 0.6667 | 0.0389 | 0.5 | 0.0535 | 0.2 |
89
+ | No log | 18.0 | 144 | 1.1292 | 0.2026 | 0.3685 | 0.2329 | 0.2026 | -1.0 | -1.0 | 0.1458 | 0.5688 | 0.625 | 0.625 | -1.0 | -1.0 | 0.2925 | 0.75 | -1.0 | -1.0 | 0.3259 | 0.8 | 0.0638 | 0.6 | 0.128 | 0.35 |
90
+ | No log | 19.0 | 152 | 1.1910 | 0.2172 | 0.4386 | 0.2382 | 0.2172 | -1.0 | -1.0 | 0.1312 | 0.4958 | 0.5583 | 0.5583 | -1.0 | -1.0 | 0.3371 | 0.75 | -1.0 | -1.0 | 0.3235 | 0.7333 | 0.05 | 0.5 | 0.1582 | 0.25 |
91
+ | No log | 20.0 | 160 | 1.1181 | 0.2308 | 0.4215 | 0.2283 | 0.2308 | -1.0 | -1.0 | 0.15 | 0.5521 | 0.6021 | 0.6021 | -1.0 | -1.0 | 0.2889 | 0.725 | -1.0 | -1.0 | 0.3201 | 0.7333 | 0.0643 | 0.6 | 0.2498 | 0.35 |
92
+ | No log | 21.0 | 168 | 1.1413 | 0.2461 | 0.4495 | 0.2525 | 0.2461 | -1.0 | -1.0 | 0.1437 | 0.5458 | 0.5958 | 0.5958 | -1.0 | -1.0 | 0.3734 | 0.75 | -1.0 | -1.0 | 0.3231 | 0.7333 | 0.0648 | 0.6 | 0.2232 | 0.3 |
93
+ | No log | 22.0 | 176 | 1.1065 | 0.2515 | 0.4523 | 0.2538 | 0.2515 | -1.0 | -1.0 | 0.1437 | 0.5583 | 0.6146 | 0.6146 | -1.0 | -1.0 | 0.3798 | 0.775 | -1.0 | -1.0 | 0.3221 | 0.7333 | 0.0754 | 0.6 | 0.2288 | 0.35 |
94
+ | No log | 23.0 | 184 | 1.1302 | 0.2227 | 0.46 | 0.2613 | 0.2227 | -1.0 | -1.0 | 0.1312 | 0.575 | 0.5813 | 0.5813 | -1.0 | -1.0 | 0.3648 | 0.75 | -1.0 | -1.0 | 0.2971 | 0.7 | 0.0625 | 0.6 | 0.1661 | 0.275 |
95
+ | No log | 24.0 | 192 | 1.0878 | 0.2496 | 0.4548 | 0.2559 | 0.2496 | -1.0 | -1.0 | 0.1437 | 0.6354 | 0.6625 | 0.6625 | -1.0 | -1.0 | 0.3698 | 0.775 | -1.0 | -1.0 | 0.3179 | 0.8 | 0.0797 | 0.7 | 0.2311 | 0.375 |
96
+ | No log | 25.0 | 200 | 1.0755 | 0.2593 | 0.4548 | 0.2566 | 0.2593 | -1.0 | -1.0 | 0.1437 | 0.6229 | 0.6292 | 0.6292 | -1.0 | -1.0 | 0.4009 | 0.775 | -1.0 | -1.0 | 0.3272 | 0.7667 | 0.0758 | 0.6 | 0.2335 | 0.375 |
97
+ | No log | 26.0 | 208 | 1.0898 | 0.2489 | 0.4548 | 0.257 | 0.2489 | -1.0 | -1.0 | 0.1521 | 0.6271 | 0.6396 | 0.6396 | -1.0 | -1.0 | 0.3803 | 0.75 | -1.0 | -1.0 | 0.3077 | 0.7333 | 0.0803 | 0.7 | 0.2275 | 0.375 |
98
+ | No log | 27.0 | 216 | 1.0802 | 0.2524 | 0.4632 | 0.257 | 0.2524 | -1.0 | -1.0 | 0.1521 | 0.6333 | 0.6396 | 0.6396 | -1.0 | -1.0 | 0.3818 | 0.75 | -1.0 | -1.0 | 0.3077 | 0.7333 | 0.0936 | 0.7 | 0.2264 | 0.375 |
99
+ | No log | 28.0 | 224 | 1.0753 | 0.2503 | 0.4526 | 0.2548 | 0.2503 | -1.0 | -1.0 | 0.1521 | 0.6417 | 0.6542 | 0.6542 | -1.0 | -1.0 | 0.3811 | 0.775 | -1.0 | -1.0 | 0.3093 | 0.7667 | 0.0803 | 0.7 | 0.2304 | 0.375 |
100
+ | No log | 29.0 | 232 | 1.0727 | 0.2517 | 0.4548 | 0.257 | 0.2517 | -1.0 | -1.0 | 0.1521 | 0.6417 | 0.6542 | 0.6542 | -1.0 | -1.0 | 0.3868 | 0.775 | -1.0 | -1.0 | 0.3093 | 0.7667 | 0.0803 | 0.7 | 0.2304 | 0.375 |
101
+ | No log | 30.0 | 240 | 1.0723 | 0.2517 | 0.4548 | 0.257 | 0.2517 | -1.0 | -1.0 | 0.1521 | 0.6417 | 0.6542 | 0.6542 | -1.0 | -1.0 | 0.3868 | 0.775 | -1.0 | -1.0 | 0.3093 | 0.7667 | 0.0803 | 0.7 | 0.2304 | 0.375 |
102
 
103
 
104
  ### Framework versions
105
 
106
+ - Transformers 4.48.0.dev0
107
  - Pytorch 2.5.1+cu124
108
  - Datasets 3.2.0
109
  - Tokenizers 0.21.0
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:3b2fe8296e0e3add8feee703780ffb69d39bd8fd8d04b8cf86a407e80660e86b
3
  size 174079796
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:395b04b33f33c3c1e2e659c0b2a0555dc05f631f9ce696e5415387d02761c05c
3
  size 174079796
runs/Dec27_15-12-19_ip-10-90-0-154/events.out.tfevents.1735312340.ip-10-90-0-154.54078.0 CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:fb1b38095698c57dc2e81e4d63ffca40cfe0b9ffd0ad9131aa41ec12d3bd0f6a
3
- size 46697
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:078e737ab77ebda6d4974895dbd9f50cefd5c44d2f9918fd9bd3c630e6308f3c
3
+ size 48471