flant5-tuned-15-warmup

This model is a fine-tuned version of google/flan-t5-small on the None dataset. It achieves the following results on the evaluation set:

  • Loss: 1.2355

Model description

More information needed

Intended uses & limitations

More information needed

Training and evaluation data

More information needed

Training procedure

Training hyperparameters

The following hyperparameters were used during training:

  • learning_rate: 0.0001
  • train_batch_size: 8
  • eval_batch_size: 8
  • seed: 42
  • optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
  • lr_scheduler_type: linear
  • lr_scheduler_warmup_steps: 41
  • num_epochs: 15

Training results

Training Loss Epoch Step Validation Loss
1.9667 0.02 1 1.6464
2.4545 0.04 2 1.6443
2.4361 0.06 3 1.6400
2.1706 0.09 4 1.6340
1.8817 0.11 5 1.6263
1.7807 0.13 6 1.6177
1.948 0.15 7 1.6087
2.105 0.17 8 1.5986
2.3075 0.19 9 1.5886
2.2806 0.21 10 1.5780
1.7889 0.23 11 1.5675
2.0699 0.26 12 1.5571
1.6099 0.28 13 1.5468
1.739 0.3 14 1.5357
1.8916 0.32 15 1.5257
1.709 0.34 16 1.5157
1.6401 0.36 17 1.5061
2.1285 0.38 18 1.4964
1.9514 0.4 19 1.4866
1.8444 0.43 20 1.4766
1.5476 0.45 21 1.4668
1.6058 0.47 22 1.4559
1.4958 0.49 23 1.4449
1.6716 0.51 24 1.4328
1.3041 0.53 25 1.4220
2.3662 0.55 26 1.4107
2.0195 0.57 27 1.3997
1.5151 0.6 28 1.3897
2.0098 0.62 29 1.3796
2.232 0.64 30 1.3703
1.352 0.66 31 1.3615
1.7479 0.68 32 1.3532
1.756 0.7 33 1.3458
1.7509 0.72 34 1.3387
1.888 0.74 35 1.3317
1.5797 0.77 36 1.3256
1.74 0.79 37 1.3201
1.6288 0.81 38 1.3151
1.7669 0.83 39 1.3099
2.101 0.85 40 1.3045
1.322 0.87 41 1.3004
1.2121 0.89 42 1.2976
1.6025 0.91 43 1.2954
1.3643 0.94 44 1.2930
1.6701 0.96 45 1.2917
1.4029 0.98 46 1.2914
2.7487 1.0 47 1.2906
1.3429 1.02 48 1.2902
1.1366 1.04 49 1.2909
1.1554 1.06 50 1.2927
1.7526 1.09 51 1.2928
1.474 1.11 52 1.2910
1.863 1.13 53 1.2876
1.9122 1.15 54 1.2837
1.4084 1.17 55 1.2787
1.7607 1.19 56 1.2731
1.6029 1.21 57 1.2669
1.0863 1.23 58 1.2620
1.4262 1.26 59 1.2576
1.0527 1.28 60 1.2546
1.6006 1.3 61 1.2508
1.6631 1.32 62 1.2468
1.6574 1.34 63 1.2432
1.5513 1.36 64 1.2399
1.5335 1.38 65 1.2368
1.6789 1.4 66 1.2332
1.5144 1.43 67 1.2298
1.3471 1.45 68 1.2276
1.8718 1.47 69 1.2255
1.3126 1.49 70 1.2241
1.5729 1.51 71 1.2229
1.7027 1.53 72 1.2216
1.4672 1.55 73 1.2204
1.5982 1.57 74 1.2193
1.0943 1.6 75 1.2197
1.5396 1.62 76 1.2201
1.7971 1.64 77 1.2200
1.5562 1.66 78 1.2201
1.8446 1.68 79 1.2199
1.4402 1.7 80 1.2189
1.4202 1.72 81 1.2189
1.2645 1.74 82 1.2191
1.7016 1.77 83 1.2192
1.454 1.79 84 1.2198
1.229 1.81 85 1.2212
1.6632 1.83 86 1.2221
1.7068 1.85 87 1.2227
0.9776 1.87 88 1.2246
1.6608 1.89 89 1.2263
1.7359 1.91 90 1.2270
1.2871 1.94 91 1.2275
1.0407 1.96 92 1.2286
1.3798 1.98 93 1.2290
1.0639 2.0 94 1.2296
1.3378 2.02 95 1.2293
1.6735 2.04 96 1.2289
1.5583 2.06 97 1.2281
1.6152 2.09 98 1.2269
1.274 2.11 99 1.2246
1.6458 2.13 100 1.2219
1.0769 2.15 101 1.2204
1.3137 2.17 102 1.2181
1.2746 2.19 103 1.2165
1.3584 2.21 104 1.2153
1.3399 2.23 105 1.2144
1.1962 2.26 106 1.2134
1.4782 2.28 107 1.2118
1.0676 2.3 108 1.2109
1.3653 2.32 109 1.2104
1.4322 2.34 110 1.2092
1.3757 2.36 111 1.2085
0.8468 2.38 112 1.2088
1.4621 2.4 113 1.2084
1.2588 2.43 114 1.2086
1.0934 2.45 115 1.2089
1.1923 2.47 116 1.2100
1.2989 2.49 117 1.2105
1.4685 2.51 118 1.2099
1.5793 2.53 119 1.2094
1.5058 2.55 120 1.2087
1.3849 2.57 121 1.2085
1.7642 2.6 122 1.2074
1.2858 2.62 123 1.2062
1.0106 2.64 124 1.2057
1.5447 2.66 125 1.2048
1.3997 2.68 126 1.2045
1.2507 2.7 127 1.2048
1.0845 2.72 128 1.2050
1.3003 2.74 129 1.2057
1.5281 2.77 130 1.2073
1.0856 2.79 131 1.2091
1.5316 2.81 132 1.2104
1.5002 2.83 133 1.2113
1.2596 2.85 134 1.2119
1.3849 2.87 135 1.2123
1.6906 2.89 136 1.2117
1.5706 2.91 137 1.2100
1.623 2.94 138 1.2073
1.5116 2.96 139 1.2045
1.0376 2.98 140 1.2024
1.7914 3.0 141 1.2000
1.3306 3.02 142 1.1977
1.1926 3.04 143 1.1956
1.3087 3.06 144 1.1938
1.1353 3.09 145 1.1921
1.5161 3.11 146 1.1897
1.1842 3.13 147 1.1876
1.407 3.15 148 1.1856
1.213 3.17 149 1.1842
1.5212 3.19 150 1.1831
1.4898 3.21 151 1.1818
1.4041 3.23 152 1.1806
1.4567 3.26 153 1.1794
1.3061 3.28 154 1.1779
1.4033 3.3 155 1.1768
1.3228 3.32 156 1.1762
1.3397 3.34 157 1.1761
1.0637 3.36 158 1.1762
1.1256 3.38 159 1.1772
1.1232 3.4 160 1.1785
1.2295 3.43 161 1.1797
1.3916 3.45 162 1.1811
1.4738 3.47 163 1.1822
1.2466 3.49 164 1.1831
1.4013 3.51 165 1.1834
1.0512 3.53 166 1.1839
0.9767 3.55 167 1.1846
1.1432 3.57 168 1.1849
1.2935 3.6 169 1.1855
1.2865 3.62 170 1.1859
1.487 3.64 171 1.1859
1.3381 3.66 172 1.1856
1.2626 3.68 173 1.1855
1.2248 3.7 174 1.1850
0.9318 3.72 175 1.1859
1.3223 3.74 176 1.1867
1.3226 3.77 177 1.1880
1.3931 3.79 178 1.1891
1.2752 3.81 179 1.1904
1.047 3.83 180 1.1924
1.1886 3.85 181 1.1944
1.2285 3.87 182 1.1961
0.8821 3.89 183 1.1989
1.1705 3.91 184 1.2014
1.4242 3.94 185 1.2031
1.2853 3.96 186 1.2044
1.3986 3.98 187 1.2044
0.995 4.0 188 1.2040
1.4626 4.02 189 1.2036
1.2084 4.04 190 1.2029
1.2624 4.06 191 1.2020
1.0631 4.09 192 1.2006
1.0948 4.11 193 1.1984
1.1124 4.13 194 1.1961
1.493 4.15 195 1.1931
1.1308 4.17 196 1.1906
0.9889 4.19 197 1.1888
0.9871 4.21 198 1.1880
1.3625 4.23 199 1.1874
1.0979 4.26 200 1.1868
1.1923 4.28 201 1.1867
1.2575 4.3 202 1.1863
1.0115 4.32 203 1.1861
1.2956 4.34 204 1.1857
1.3653 4.36 205 1.1856
0.9581 4.38 206 1.1864
0.9203 4.4 207 1.1876
1.1975 4.43 208 1.1892
1.1205 4.45 209 1.1908
1.0527 4.47 210 1.1924
1.5056 4.49 211 1.1935
1.1822 4.51 212 1.1946
1.4337 4.53 213 1.1948
1.0364 4.55 214 1.1954
1.4977 4.57 215 1.1948
1.5055 4.6 216 1.1932
1.0352 4.62 217 1.1924
1.084 4.64 218 1.1922
1.1441 4.66 219 1.1911
1.4411 4.68 220 1.1894
1.6655 4.7 221 1.1873
1.1853 4.72 222 1.1860
1.3747 4.74 223 1.1841
1.1934 4.77 224 1.1831
1.2477 4.79 225 1.1824
1.3121 4.81 226 1.1817
1.1818 4.83 227 1.1809
0.9407 4.85 228 1.1804
1.303 4.87 229 1.1792
1.0454 4.89 230 1.1779
1.2141 4.91 231 1.1770
1.0324 4.94 232 1.1763
1.4047 4.96 233 1.1754
0.9807 4.98 234 1.1755
1.0778 5.0 235 1.1756
1.2506 5.02 236 1.1762
1.2868 5.04 237 1.1769
1.1988 5.06 238 1.1774
1.3789 5.09 239 1.1774
1.3524 5.11 240 1.1778
1.2539 5.13 241 1.1782
0.9897 5.15 242 1.1789
0.8569 5.17 243 1.1794
1.0817 5.19 244 1.1805
1.0787 5.21 245 1.1814
1.2513 5.23 246 1.1823
0.9568 5.26 247 1.1830
1.4582 5.28 248 1.1833
0.937 5.3 249 1.1840
1.0291 5.32 250 1.1855
1.0047 5.34 251 1.1867
1.2426 5.36 252 1.1879
1.1143 5.38 253 1.1895
0.9608 5.4 254 1.1905
0.8095 5.43 255 1.1909
1.3595 5.45 256 1.1911
1.0627 5.47 257 1.1911
0.9898 5.49 258 1.1909
0.9931 5.51 259 1.1906
1.1374 5.53 260 1.1898
1.3457 5.55 261 1.1885
1.2375 5.57 262 1.1875
1.4788 5.6 263 1.1865
1.2088 5.62 264 1.1857
1.266 5.64 265 1.1847
0.9349 5.66 266 1.1849
0.726 5.68 267 1.1858
1.002 5.7 268 1.1867
1.3502 5.72 269 1.1876
1.0362 5.74 270 1.1886
1.2812 5.77 271 1.1887
0.8987 5.79 272 1.1893
1.044 5.81 273 1.1903
1.0796 5.83 274 1.1909
1.1613 5.85 275 1.1912
0.9256 5.87 276 1.1915
1.0668 5.89 277 1.1916
0.8857 5.91 278 1.1913
1.2915 5.94 279 1.1905
1.1701 5.96 280 1.1897
0.8379 5.98 281 1.1894
0.8857 6.0 282 1.1893
1.0643 6.02 283 1.1892
0.6472 6.04 284 1.1893
0.9508 6.06 285 1.1893
1.2486 6.09 286 1.1895
1.0606 6.11 287 1.1901
1.2126 6.13 288 1.1905
1.0849 6.15 289 1.1911
0.9876 6.17 290 1.1922
1.0195 6.19 291 1.1932
1.1427 6.21 292 1.1937
0.975 6.23 293 1.1942
0.7155 6.26 294 1.1952
1.2893 6.28 295 1.1962
0.7985 6.3 296 1.1974
1.0161 6.32 297 1.1991
0.9194 6.34 298 1.2017
0.9292 6.36 299 1.2041
1.0484 6.38 300 1.2064
1.2873 6.4 301 1.2081
1.0052 6.43 302 1.2100
1.0546 6.45 303 1.2117
1.0157 6.47 304 1.2131
0.9218 6.49 305 1.2142
0.801 6.51 306 1.2155
1.0598 6.53 307 1.2165
0.9959 6.55 308 1.2176
1.087 6.57 309 1.2189
1.4722 6.6 310 1.2192
0.9014 6.62 311 1.2196
1.0947 6.64 312 1.2200
1.0382 6.66 313 1.2198
0.858 6.68 314 1.2192
0.9865 6.7 315 1.2184
1.2978 6.72 316 1.2169
1.0321 6.74 317 1.2157
1.0643 6.77 318 1.2143
1.2922 6.79 319 1.2123
0.9979 6.81 320 1.2108
0.9959 6.83 321 1.2092
0.8715 6.85 322 1.2084
0.9744 6.87 323 1.2079
1.2371 6.89 324 1.2073
1.1729 6.91 325 1.2062
1.3964 6.94 326 1.2044
0.9163 6.96 327 1.2029
1.5975 6.98 328 1.2010
1.0804 7.0 329 1.1995
1.0323 7.02 330 1.1991
0.9406 7.04 331 1.1989
1.0921 7.06 332 1.1985
1.0013 7.09 333 1.1985
0.9852 7.11 334 1.1986
1.1076 7.13 335 1.1984
0.9244 7.15 336 1.1986
1.1296 7.17 337 1.1988
1.0143 7.19 338 1.1990
1.1623 7.21 339 1.1994
0.9863 7.23 340 1.2004
1.1909 7.26 341 1.2011
0.6969 7.28 342 1.2022
1.2884 7.3 343 1.2028
1.0522 7.32 344 1.2033
1.0197 7.34 345 1.2038
1.0357 7.36 346 1.2047
1.3089 7.38 347 1.2053
0.8648 7.4 348 1.2064
1.2338 7.43 349 1.2067
1.3109 7.45 350 1.2066
0.9705 7.47 351 1.2067
0.9489 7.49 352 1.2068
0.8943 7.51 353 1.2064
0.8248 7.53 354 1.2062
1.0269 7.55 355 1.2058
1.0779 7.57 356 1.2052
1.0896 7.6 357 1.2048
0.8612 7.62 358 1.2042
1.1585 7.64 359 1.2039
1.2301 7.66 360 1.2029
1.0214 7.68 361 1.2020
0.8139 7.7 362 1.2011
0.7917 7.72 363 1.2007
1.1115 7.74 364 1.2003
0.6928 7.77 365 1.2002
1.102 7.79 366 1.1998
1.0183 7.81 367 1.2000
0.8946 7.83 368 1.2000
0.8605 7.85 369 1.2006
1.0517 7.87 370 1.2007
1.0663 7.89 371 1.2006
1.0523 7.91 372 1.2010
1.2151 7.94 373 1.2013
1.0897 7.96 374 1.2013
1.1518 7.98 375 1.2013
1.3891 8.0 376 1.2017
1.3617 8.02 377 1.2015
1.0901 8.04 378 1.2018
1.5281 8.06 379 1.2015
0.7301 8.09 380 1.2013
0.95 8.11 381 1.2011
1.0097 8.13 382 1.2008
1.0254 8.15 383 1.2004
1.0425 8.17 384 1.2000
0.7891 8.19 385 1.2006
0.5937 8.21 386 1.2016
1.2323 8.23 387 1.2026
0.7101 8.26 388 1.2038
0.9621 8.28 389 1.2046
1.0163 8.3 390 1.2052
0.9164 8.32 391 1.2057
1.0047 8.34 392 1.2056
1.206 8.36 393 1.2052
0.8669 8.38 394 1.2052
1.3794 8.4 395 1.2049
0.9991 8.43 396 1.2047
0.9352 8.45 397 1.2047
0.7621 8.47 398 1.2048
0.9343 8.49 399 1.2055
1.004 8.51 400 1.2058
0.8509 8.53 401 1.2063
1.038 8.55 402 1.2067
0.8357 8.57 403 1.2073
1.1329 8.6 404 1.2075
0.6185 8.62 405 1.2082
1.1308 8.64 406 1.2084
1.2186 8.66 407 1.2080
0.9866 8.68 408 1.2077
1.1335 8.7 409 1.2079
0.6568 8.72 410 1.2089
1.0745 8.74 411 1.2098
1.2385 8.77 412 1.2100
0.9554 8.79 413 1.2109
0.7232 8.81 414 1.2117
1.0016 8.83 415 1.2118
1.0813 8.85 416 1.2117
0.9908 8.87 417 1.2110
1.2274 8.89 418 1.2101
0.9004 8.91 419 1.2092
0.8995 8.94 420 1.2082
0.6937 8.96 421 1.2073
1.0519 8.98 422 1.2062
1.2531 9.0 423 1.2053
0.5019 9.02 424 1.2054
0.9939 9.04 425 1.2053
0.9925 9.06 426 1.2054
0.8055 9.09 427 1.2055
0.8666 9.11 428 1.2060
0.9951 9.13 429 1.2064
0.9366 9.15 430 1.2069
1.1513 9.17 431 1.2077
0.864 9.19 432 1.2086
1.0111 9.21 433 1.2089
0.8156 9.23 434 1.2095
0.6794 9.26 435 1.2104
1.024 9.28 436 1.2109
1.1593 9.3 437 1.2109
1.2502 9.32 438 1.2105
0.8281 9.34 439 1.2101
0.9008 9.36 440 1.2101
0.7181 9.38 441 1.2105
0.7282 9.4 442 1.2109
0.8809 9.43 443 1.2109
1.0362 9.45 444 1.2112
0.9121 9.47 445 1.2112
1.1075 9.49 446 1.2112
0.906 9.51 447 1.2114
1.0327 9.53 448 1.2119
1.2168 9.55 449 1.2120
0.6352 9.57 450 1.2126
0.8875 9.6 451 1.2135
0.8587 9.62 452 1.2142
0.9561 9.64 453 1.2150
1.0226 9.66 454 1.2158
1.3241 9.68 455 1.2163
1.0117 9.7 456 1.2171
0.7255 9.72 457 1.2177
1.1023 9.74 458 1.2180
1.2677 9.77 459 1.2178
0.8539 9.79 460 1.2173
0.9335 9.81 461 1.2166
0.7214 9.83 462 1.2160
1.0556 9.85 463 1.2153
1.0694 9.87 464 1.2140
1.1814 9.89 465 1.2127
1.1573 9.91 466 1.2115
0.9729 9.94 467 1.2103
0.8609 9.96 468 1.2096
0.9811 9.98 469 1.2092
0.8868 10.0 470 1.2086
0.8651 10.02 471 1.2088
0.8995 10.04 472 1.2088
1.1421 10.06 473 1.2093
0.944 10.09 474 1.2101
0.9218 10.11 475 1.2106
0.9569 10.13 476 1.2116
0.8649 10.15 477 1.2125
1.0783 10.17 478 1.2133
1.0356 10.19 479 1.2139
0.8099 10.21 480 1.2152
0.5889 10.23 481 1.2166
0.8428 10.26 482 1.2180
0.8419 10.28 483 1.2198
0.7744 10.3 484 1.2214
1.1157 10.32 485 1.2221
0.9793 10.34 486 1.2227
1.0079 10.36 487 1.2231
0.8489 10.38 488 1.2234
1.0579 10.4 489 1.2235
0.6212 10.43 490 1.2239
0.9997 10.45 491 1.2239
0.9023 10.47 492 1.2240
0.7276 10.49 493 1.2246
1.0537 10.51 494 1.2249
0.8226 10.53 495 1.2254
1.3604 10.55 496 1.2257
0.9817 10.57 497 1.2261
0.8109 10.6 498 1.2263
0.8237 10.62 499 1.2266
0.8102 10.64 500 1.2268
1.1175 10.66 501 1.2269
0.9518 10.68 502 1.2270
0.8294 10.7 503 1.2268
0.6542 10.72 504 1.2270
1.15 10.74 505 1.2268
0.847 10.77 506 1.2270
1.0546 10.79 507 1.2273
0.9229 10.81 508 1.2271
0.8002 10.83 509 1.2272
0.8821 10.85 510 1.2276
1.226 10.87 511 1.2273
0.703 10.89 512 1.2273
0.8306 10.91 513 1.2272
1.3012 10.94 514 1.2270
1.1099 10.96 515 1.2267
0.8719 10.98 516 1.2266
0.8791 11.0 517 1.2264
1.0088 11.02 518 1.2262
0.9005 11.04 519 1.2262
0.9958 11.06 520 1.2262
0.6769 11.09 521 1.2263
0.8875 11.11 522 1.2265
1.0043 11.13 523 1.2265
0.8179 11.15 524 1.2266
1.0228 11.17 525 1.2267
1.0861 11.19 526 1.2268
1.0958 11.21 527 1.2269
0.8243 11.23 528 1.2271
0.7919 11.26 529 1.2276
0.8028 11.28 530 1.2282
0.868 11.3 531 1.2289
1.0766 11.32 532 1.2295
0.9489 11.34 533 1.2296
1.2521 11.36 534 1.2293
0.8151 11.38 535 1.2292
0.8777 11.4 536 1.2293
0.9093 11.43 537 1.2294
0.8013 11.45 538 1.2295
0.8986 11.47 539 1.2297
0.8591 11.49 540 1.2299
0.9412 11.51 541 1.2298
1.0771 11.53 542 1.2297
0.9792 11.55 543 1.2293
0.8906 11.57 544 1.2289
0.9887 11.6 545 1.2284
1.0249 11.62 546 1.2276
1.0449 11.64 547 1.2269
1.0692 11.66 548 1.2262
1.1385 11.68 549 1.2256
0.9511 11.7 550 1.2250
0.9088 11.72 551 1.2244
0.5108 11.74 552 1.2242
0.8387 11.77 553 1.2243
0.5605 11.79 554 1.2247
0.8029 11.81 555 1.2250
0.9064 11.83 556 1.2252
1.0701 11.85 557 1.2255
0.7642 11.87 558 1.2260
0.9011 11.89 559 1.2265
0.6675 11.91 560 1.2271
0.7199 11.94 561 1.2278
0.8222 11.96 562 1.2282
1.2255 11.98 563 1.2286
0.6935 12.0 564 1.2292
0.9331 12.02 565 1.2296
0.9065 12.04 566 1.2297
1.2259 12.06 567 1.2293
0.7701 12.09 568 1.2289
0.785 12.11 569 1.2286
0.7013 12.13 570 1.2287
0.7481 12.15 571 1.2288
1.1147 12.17 572 1.2289
0.8022 12.19 573 1.2291
1.152 12.21 574 1.2292
0.7772 12.23 575 1.2293
1.1077 12.26 576 1.2293
0.9741 12.28 577 1.2293
0.6635 12.3 578 1.2296
0.7384 12.32 579 1.2299
0.7896 12.34 580 1.2301
0.8486 12.36 581 1.2303
0.7299 12.38 582 1.2306
1.2021 12.4 583 1.2307
0.8791 12.43 584 1.2309
0.9863 12.45 585 1.2311
0.5738 12.47 586 1.2314
0.8494 12.49 587 1.2318
0.8436 12.51 588 1.2320
0.8499 12.53 589 1.2322
0.8047 12.55 590 1.2323
0.6874 12.57 591 1.2324
0.9029 12.6 592 1.2326
1.0359 12.62 593 1.2325
0.8538 12.64 594 1.2323
0.9876 12.66 595 1.2322
0.8192 12.68 596 1.2321
1.1117 12.7 597 1.2321
1.0893 12.72 598 1.2323
1.0301 12.74 599 1.2323
0.8162 12.77 600 1.2325
0.8453 12.79 601 1.2326
0.4536 12.81 602 1.2330
0.9448 12.83 603 1.2331
1.1688 12.85 604 1.2331
0.802 12.87 605 1.2330
0.9937 12.89 606 1.2332
0.969 12.91 607 1.2330
0.817 12.94 608 1.2328
0.9496 12.96 609 1.2328
0.9989 12.98 610 1.2325
0.8977 13.0 611 1.2323
0.6483 13.02 612 1.2322
0.9165 13.04 613 1.2323
0.7105 13.06 614 1.2327
0.7978 13.09 615 1.2330
0.6971 13.11 616 1.2335
1.0446 13.13 617 1.2338
0.6923 13.15 618 1.2341
0.8159 13.17 619 1.2344
0.9534 13.19 620 1.2347
0.9893 13.21 621 1.2352
1.2735 13.23 622 1.2354
0.6668 13.26 623 1.2358
0.745 13.28 624 1.2361
0.6876 13.3 625 1.2366
0.6819 13.32 626 1.2369
1.0197 13.34 627 1.2370
0.5219 13.36 628 1.2373
0.7793 13.38 629 1.2376
0.8771 13.4 630 1.2378
0.779 13.43 631 1.2380
0.9322 13.45 632 1.2382
0.8594 13.47 633 1.2384
1.0583 13.49 634 1.2384
0.7861 13.51 635 1.2384
1.0326 13.53 636 1.2382
0.8159 13.55 637 1.2380
0.9854 13.57 638 1.2378
0.9686 13.6 639 1.2376
0.9824 13.62 640 1.2375
0.8299 13.64 641 1.2374
0.9476 13.66 642 1.2374
0.9941 13.68 643 1.2372
1.0541 13.7 644 1.2370
0.8441 13.72 645 1.2369
0.9255 13.74 646 1.2367
1.0689 13.77 647 1.2365
0.9323 13.79 648 1.2362
0.7558 13.81 649 1.2361
1.0473 13.83 650 1.2360
1.0349 13.85 651 1.2357
0.961 13.87 652 1.2355
0.9356 13.89 653 1.2353
0.8867 13.91 654 1.2353
0.5838 13.94 655 1.2353
0.9355 13.96 656 1.2353
1.015 13.98 657 1.2354
0.8472 14.0 658 1.2355
0.8122 14.02 659 1.2355
0.989 14.04 660 1.2355
0.6454 14.06 661 1.2356
1.0597 14.09 662 1.2356
1.0055 14.11 663 1.2357
0.6391 14.13 664 1.2356
1.0323 14.15 665 1.2355
0.891 14.17 666 1.2354
1.0657 14.19 667 1.2353
1.0912 14.21 668 1.2351
0.8337 14.23 669 1.2349
0.7615 14.26 670 1.2348
0.7749 14.28 671 1.2347
0.7914 14.3 672 1.2346
1.0347 14.32 673 1.2345
1.0402 14.34 674 1.2345
0.6305 14.36 675 1.2344
0.6122 14.38 676 1.2345
0.8261 14.4 677 1.2345
1.1074 14.43 678 1.2345
0.8629 14.45 679 1.2345
0.7628 14.47 680 1.2344
1.1052 14.49 681 1.2345
0.9245 14.51 682 1.2345
0.7653 14.53 683 1.2346
0.8255 14.55 684 1.2347
1.0361 14.57 685 1.2348
1.0867 14.6 686 1.2349
1.1457 14.62 687 1.2350
0.9392 14.64 688 1.2350
0.7818 14.66 689 1.2351
0.8908 14.68 690 1.2351
0.6478 14.7 691 1.2352
0.8556 14.72 692 1.2352
0.8901 14.74 693 1.2353
0.7234 14.77 694 1.2354
1.1695 14.79 695 1.2354
0.833 14.81 696 1.2354
0.596 14.83 697 1.2355
0.9776 14.85 698 1.2355
0.981 14.87 699 1.2355
0.8301 14.89 700 1.2355
0.6443 14.91 701 1.2355
0.8316 14.94 702 1.2355
1.0333 14.96 703 1.2355
0.9073 14.98 704 1.2355
0.9073 15.0 705 1.2355

Framework versions

  • Transformers 4.38.1
  • Pytorch 2.1.0+cu121
  • Datasets 2.17.0
  • Tokenizers 0.15.2
Downloads last month
17
Safetensors
Model size
77M params
Tensor type
F32
·
Inference Examples
This model does not have enough activity to be deployed to Inference API (serverless) yet. Increase its social visibility and check back later, or deploy to Inference Endpoints (dedicated) instead.

Model tree for mixtralyanis/flant5-tuned-15-warmup

Finetuned
(298)
this model