marieke93
/

MiniLM-evidence-types

@@ -16,11 +16,11 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [microsoft/MiniLM-L12-H384-uncased](https://huggingface.co/microsoft/MiniLM-L12-H384-uncased) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.3612
-- Macro f1: 0.1900
-- Weighted f1: 0.5901
-- Accuracy: 0.6499
-- Balanced accuracy: 0.2161
 ## Model description
@@ -39,20 +39,39 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 3e-05
-- train_batch_size: 32
-- eval_batch_size: 32
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 1
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Macro f1 | Weighted f1 | Accuracy | Balanced accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|:-----------:|:--------:|:-----------------:|
-| 1.5           | 1.0   | 125  | 1.3612          | 0.1900   | 0.5901      | 0.6499   | 0.2161            |
 ### Framework versions

 This model is a fine-tuned version of [microsoft/MiniLM-L12-H384-uncased](https://huggingface.co/microsoft/MiniLM-L12-H384-uncased) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.8672
+- Macro f1: 0.3726
+- Weighted f1: 0.7030
+- Accuracy: 0.7161
+- Balanced accuracy: 0.3616
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 2e-05
+- train_batch_size: 16
+- eval_batch_size: 16
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 20
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Macro f1 | Weighted f1 | Accuracy | Balanced accuracy |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|:-----------:|:--------:|:-----------------:|
+| 1.4106        | 1.0   | 250  | 1.2698          | 0.1966   | 0.6084      | 0.6735   | 0.2195            |
+| 1.1437        | 2.0   | 500  | 1.0985          | 0.3484   | 0.6914      | 0.7116   | 0.3536            |
+| 0.9714        | 3.0   | 750  | 1.0901          | 0.2606   | 0.6413      | 0.6446   | 0.2932            |
+| 0.8382        | 4.0   | 1000 | 1.0197          | 0.2764   | 0.7024      | 0.7237   | 0.2783            |
+| 0.7192        | 5.0   | 1250 | 1.0895          | 0.2847   | 0.6824      | 0.6963   | 0.2915            |
+| 0.6249        | 6.0   | 1500 | 1.1296          | 0.3487   | 0.6888      | 0.6948   | 0.3377            |
+| 0.5336        | 7.0   | 1750 | 1.1515          | 0.3591   | 0.6982      | 0.7024   | 0.3496            |
+| 0.4694        | 8.0   | 2000 | 1.1962          | 0.3626   | 0.7185      | 0.7314   | 0.3415            |
+| 0.4058        | 9.0   | 2250 | 1.3313          | 0.3121   | 0.6920      | 0.7085   | 0.3033            |
+| 0.3746        | 10.0  | 2500 | 1.3993          | 0.3628   | 0.6976      | 0.7047   | 0.3495            |
+| 0.3267        | 11.0  | 2750 | 1.5078          | 0.3560   | 0.6958      | 0.7055   | 0.3464            |
+| 0.2939        | 12.0  | 3000 | 1.5875          | 0.3685   | 0.6968      | 0.7062   | 0.3514            |
+| 0.2677        | 13.0  | 3250 | 1.6470          | 0.3606   | 0.6976      | 0.7070   | 0.3490            |
+| 0.2425        | 14.0  | 3500 | 1.7164          | 0.3714   | 0.7069      | 0.7207   | 0.3551            |
+| 0.2301        | 15.0  | 3750 | 1.8151          | 0.3597   | 0.6975      | 0.7123   | 0.3466            |
+| 0.2268        | 16.0  | 4000 | 1.7838          | 0.3940   | 0.7034      | 0.7123   | 0.3869            |
+| 0.201         | 17.0  | 4250 | 1.8328          | 0.3725   | 0.6964      | 0.7062   | 0.3704            |
+| 0.1923        | 18.0  | 4500 | 1.8788          | 0.3708   | 0.7019      | 0.7154   | 0.3591            |
+| 0.1795        | 19.0  | 4750 | 1.8574          | 0.3752   | 0.7031      | 0.7161   | 0.3619            |
+| 0.1713        | 20.0  | 5000 | 1.8672          | 0.3726   | 0.7030      | 0.7161   | 0.3616            |
 ### Framework versions