End of training

Browse files

Files changed (7) hide show

README.md +35 -43
model.safetensors +1 -1
runs/Apr04_09-19-12_76de971d69be/events.out.tfevents.1712222357.76de971d69be.181.0 +3 -0
runs/Apr04_09-19-12_76de971d69be/events.out.tfevents.1712232761.76de971d69be.181.1 +3 -0
runs/Apr04_12-24-34_76de971d69be/events.out.tfevents.1712233479.76de971d69be.181.2 +3 -0
runs/Apr04_12-24-34_76de971d69be/events.out.tfevents.1712240723.76de971d69be.181.3 +3 -0
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -20,11 +20,11 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [distilbert-base-cased](https://huggingface.co/distilbert-base-cased) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.0027
-- Precision: 0.8141
-- Recall: 0.8067
-- F1: 0.8073
-- Accuracy: 0.8067
 ## Model description
@@ -49,49 +49,41 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 5
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1     | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:---------:|:------:|:------:|:--------:|
-| 1.9463        | 0.14  | 30   | 1.8631          | 0.1245    | 0.1625 | 0.0819 | 0.1625   |
-| 1.7589        | 0.27  | 60   | 1.4567          | 0.4725    | 0.5098 | 0.4483 | 0.5098   |
-| 1.389         | 0.41  | 90   | 1.2228          | 0.6230    | 0.5714 | 0.5547 | 0.5714   |
-| 1.2009        | 0.54  | 120  | 1.0306          | 0.7264    | 0.6835 | 0.6666 | 0.6835   |
-| 1.0999        | 0.68  | 150  | 0.8052          | 0.7808    | 0.7647 | 0.7625 | 0.7647   |
-| 0.8848        | 0.81  | 180  | 0.7826          | 0.7499    | 0.7283 | 0.7191 | 0.7283   |
-| 0.685         | 0.95  | 210  | 0.7337          | 0.7765    | 0.7591 | 0.7587 | 0.7591   |
-| 0.5562        | 1.08  | 240  | 0.6653          | 0.7897    | 0.7871 | 0.7863 | 0.7871   |
-| 0.4662        | 1.22  | 270  | 0.7158          | 0.7895    | 0.7535 | 0.7539 | 0.7535   |
-| 0.3985        | 1.35  | 300  | 0.6552          | 0.8160    | 0.8011 | 0.8024 | 0.8011   |
-| 0.317         | 1.49  | 330  | 0.7378          | 0.7902    | 0.7843 | 0.7836 | 0.7843   |
-| 0.4177        | 1.62  | 360  | 0.6983          | 0.8085    | 0.8039 | 0.8028 | 0.8039   |
-| 0.383         | 1.76  | 390  | 0.7612          | 0.7979    | 0.7759 | 0.7640 | 0.7759   |
-| 0.2906        | 1.89  | 420  | 0.7369          | 0.7914    | 0.7759 | 0.7761 | 0.7759   |
-| 0.3305        | 2.03  | 450  | 0.7302          | 0.7904    | 0.7787 | 0.7791 | 0.7787   |
-| 0.1398        | 2.16  | 480  | 0.7798          | 0.8169    | 0.8095 | 0.8084 | 0.8095   |
-| 0.0988        | 2.3   | 510  | 0.9284          | 0.7902    | 0.7815 | 0.7799 | 0.7815   |
-| 0.1449        | 2.43  | 540  | 0.8863          | 0.8196    | 0.8123 | 0.8133 | 0.8123   |
-| 0.2552        | 2.57  | 570  | 0.8396          | 0.8227    | 0.8179 | 0.8177 | 0.8179   |
-| 0.1616        | 2.7   | 600  | 0.8182          | 0.8172    | 0.8123 | 0.8128 | 0.8123   |
-| 0.2163        | 2.84  | 630  | 0.8075          | 0.8031    | 0.7983 | 0.7994 | 0.7983   |
-| 0.2134        | 2.97  | 660  | 0.9430          | 0.8190    | 0.8067 | 0.8080 | 0.8067   |
-| 0.1255        | 3.11  | 690  | 0.8907          | 0.8166    | 0.8123 | 0.8116 | 0.8123   |
-| 0.0969        | 3.24  | 720  | 0.8805          | 0.8009    | 0.7983 | 0.7977 | 0.7983   |
-| 0.0649        | 3.38  | 750  | 0.9065          | 0.7957    | 0.7843 | 0.7846 | 0.7843   |
-| 0.0328        | 3.51  | 780  | 0.9083          | 0.8141    | 0.8095 | 0.8093 | 0.8095   |
-| 0.0274        | 3.65  | 810  | 0.8894          | 0.8096    | 0.8011 | 0.8011 | 0.8011   |
-| 0.0906        | 3.78  | 840  | 0.9425          | 0.8166    | 0.8095 | 0.8101 | 0.8095   |
-| 0.0906        | 3.92  | 870  | 0.9333          | 0.8066    | 0.8011 | 0.8011 | 0.8011   |
-| 0.0641        | 4.05  | 900  | 0.9052          | 0.8108    | 0.8067 | 0.8063 | 0.8067   |
-| 0.0246        | 4.19  | 930  | 0.9993          | 0.8017    | 0.7955 | 0.7946 | 0.7955   |
-| 0.0551        | 4.32  | 960  | 0.9899          | 0.8174    | 0.8123 | 0.8122 | 0.8123   |
-| 0.0084        | 4.46  | 990  | 0.9954          | 0.8127    | 0.8067 | 0.8066 | 0.8067   |
-| 0.0049        | 4.59  | 1020 | 0.9912          | 0.8145    | 0.8095 | 0.8093 | 0.8095   |
-| 0.0217        | 4.73  | 1050 | 0.9957          | 0.8128    | 0.8067 | 0.8067 | 0.8067   |
-| 0.0144        | 4.86  | 1080 | 1.0042          | 0.8164    | 0.8095 | 0.8100 | 0.8095   |
-| 0.0276        | 5.0   | 1110 | 1.0027          | 0.8141    | 0.8067 | 0.8073 | 0.8067   |
 ### Framework versions

 This model is a fine-tuned version of [distilbert-base-cased](https://huggingface.co/distilbert-base-cased) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.8043
+- Precision: 0.8432
+- Recall: 0.8375
+- F1: 0.8381
+- Accuracy: 0.8375
 ## Model description
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 4
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Precision | Recall | F1     | Accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:---------:|:------:|:------:|:--------:|
+| 1.9232        | 0.14  | 30   | 1.8330          | 0.2755    | 0.2773 | 0.2340 | 0.2773   |
+| 1.7293        | 0.27  | 60   | 1.4729          | 0.3588    | 0.3613 | 0.2487 | 0.3613   |
+| 1.3897        | 0.41  | 90   | 1.2344          | 0.6697    | 0.5238 | 0.4653 | 0.5238   |
+| 1.2399        | 0.54  | 120  | 1.1505          | 0.6705    | 0.6106 | 0.5897 | 0.6106   |
+| 1.1299        | 0.68  | 150  | 0.8937          | 0.7178    | 0.7087 | 0.7062 | 0.7087   |
+| 0.9878        | 0.81  | 180  | 0.8656          | 0.7067    | 0.6583 | 0.6466 | 0.6583   |
+| 0.7844        | 0.95  | 210  | 0.7538          | 0.7501    | 0.7339 | 0.7321 | 0.7339   |
+| 0.5865        | 1.08  | 240  | 0.7162          | 0.7628    | 0.7563 | 0.7559 | 0.7563   |
+| 0.4725        | 1.22  | 270  | 0.7242          | 0.8196    | 0.7815 | 0.7836 | 0.7815   |
+| 0.4168        | 1.35  | 300  | 0.6477          | 0.8091    | 0.7983 | 0.8001 | 0.7983   |
+| 0.3725        | 1.49  | 330  | 0.5628          | 0.7972    | 0.7871 | 0.7872 | 0.7871   |
+| 0.3664        | 1.62  | 360  | 0.6316          | 0.8052    | 0.7955 | 0.7957 | 0.7955   |
+| 0.3654        | 1.76  | 390  | 0.6254          | 0.8246    | 0.8179 | 0.8177 | 0.8179   |
+| 0.2986        | 1.89  | 420  | 0.6129          | 0.8150    | 0.8095 | 0.8098 | 0.8095   |
+| 0.2652        | 2.03  | 450  | 0.6471          | 0.8190    | 0.8151 | 0.8151 | 0.8151   |
+| 0.1143        | 2.16  | 480  | 0.6956          | 0.8349    | 0.8291 | 0.8262 | 0.8291   |
+| 0.0961        | 2.3   | 510  | 0.7992          | 0.8205    | 0.8179 | 0.8170 | 0.8179   |
+| 0.1593        | 2.43  | 540  | 0.7508          | 0.8296    | 0.8207 | 0.8210 | 0.8207   |
+| 0.1486        | 2.57  | 570  | 0.7732          | 0.8262    | 0.8207 | 0.8203 | 0.8207   |
+| 0.1515        | 2.7   | 600  | 0.7413          | 0.8362    | 0.8319 | 0.8321 | 0.8319   |
+| 0.0922        | 2.84  | 630  | 0.7168          | 0.8416    | 0.8375 | 0.8375 | 0.8375   |
+| 0.1195        | 2.97  | 660  | 0.7461          | 0.8436    | 0.8347 | 0.8357 | 0.8347   |
+| 0.0882        | 3.11  | 690  | 0.7472          | 0.8404    | 0.8319 | 0.8321 | 0.8319   |
+| 0.0573        | 3.24  | 720  | 0.7631          | 0.8409    | 0.8347 | 0.8356 | 0.8347   |
+| 0.0284        | 3.38  | 750  | 0.7559          | 0.8346    | 0.8319 | 0.8321 | 0.8319   |
+| 0.0307        | 3.51  | 780  | 0.7669          | 0.8425    | 0.8375 | 0.8379 | 0.8375   |
+| 0.0225        | 3.65  | 810  | 0.7827          | 0.8428    | 0.8375 | 0.8380 | 0.8375   |
+| 0.0512        | 3.78  | 840  | 0.8073          | 0.8444    | 0.8375 | 0.8381 | 0.8375   |
+| 0.0261        | 3.92  | 870  | 0.8061          | 0.8412    | 0.8347 | 0.8354 | 0.8347   |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:098e20c330fc9c6fc9e29ac4e68724325b0ddab6f2c605c40292d0121e0fc192
 size 263160068

 version https://git-lfs.github.com/spec/v1
+oid sha256:15b9b92c2440804696580d734aa95264256ffcbe9b8a2cbbd3c05794c088356f
 size 263160068

runs/Apr04_09-19-12_76de971d69be/events.out.tfevents.1712222357.76de971d69be.181.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:7e1978cf3934538aeda87bb5e4b38728e9574d92c09e69aaf0dc1675a322d46d
+size 41372

runs/Apr04_09-19-12_76de971d69be/events.out.tfevents.1712232761.76de971d69be.181.1 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:da42418c5767d04c420e265e35219fc348f34478cddc46c05e1872d1864523fa
+size 6658

runs/Apr04_12-24-34_76de971d69be/events.out.tfevents.1712233479.76de971d69be.181.2 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:cac58620435d90dea9fccce64369cce72e8bc73f51f9483da6fd2cecf790a4ea
+size 24885

runs/Apr04_12-24-34_76de971d69be/events.out.tfevents.1712240723.76de971d69be.181.3 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:692d6819253e4506c2eeb7ab2ea57c1e500b8e0a7b2ccd3fda24701bdc4b7c28
+size 560

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:c564b281f4d86a09a862aa9e6561baff85bf27abedcee7201f2b7210f26de2eb
 size 4920

 version https://git-lfs.github.com/spec/v1
+oid sha256:159cbf29cd5655c579352d1e69c14f7142e43d982408798542460577a13cd0ef
 size 4920