Daniegiday
/

GPT_nyala

@@ -4,8 +4,6 @@ license: mit
 base_model: xlm-roberta-base
 tags:
 - generated_from_trainer
-metrics:
-- f1
 model-index:
 - name: GPT_nyala
   results: []
@@ -18,9 +16,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [xlm-roberta-base](https://huggingface.co/xlm-roberta-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 4.3888
-- Exact Match: 0.0
-- F1: 0.0
 ## Model description
@@ -39,28 +35,38 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 5e-05
 - train_batch_size: 8
 - eval_batch_size: 8
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
-- num_epochs: 10
 ### Training results
-| Training Loss | Epoch | Step | Validation Loss | Exact Match | F1      |
-|:-------------:|:-----:|:----:|:---------------:|:-----------:|:-------:|
-| No log        | 1.0   | 2    | 5.3929          | 0.0         | 24.9084 |
-| No log        | 2.0   | 4    | 4.9811          | 0.0         | 35.4497 |
-| No log        | 3.0   | 6    | 5.0173          | 0.0         | 35.4497 |
-| No log        | 4.0   | 8    | 4.6001          | 0.0         | 18.6667 |
-| 4.8411        | 5.0   | 10   | 4.4303          | 0.0         | 0.0     |
-| 4.8411        | 6.0   | 12   | 4.4729          | 0.0         | 0.0     |
-| 4.8411        | 7.0   | 14   | 4.4833          | 0.0         | 0.0     |
-| 4.8411        | 8.0   | 16   | 4.4062          | 0.0         | 23.7037 |
-| 4.8411        | 9.0   | 18   | 4.3840          | 0.0         | 0.0     |
-| 3.0229        | 10.0  | 20   | 4.3888          | 0.0         | 0.0     |
 ### Framework versions

 base_model: xlm-roberta-base
 tags:
 - generated_from_trainer
 model-index:
 - name: GPT_nyala
   results: []
 This model is a fine-tuned version of [xlm-roberta-base](https://huggingface.co/xlm-roberta-base) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.0000
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 3e-05
 - train_batch_size: 8
 - eval_batch_size: 8
 - seed: 42
 - optimizer: Use OptimizerNames.ADAMW_TORCH with betas=(0.9,0.999) and epsilon=1e-08 and optimizer_args=No additional optimizer arguments
 - lr_scheduler_type: linear
+- num_epochs: 20
 ### Training results
+| Training Loss | Epoch | Step | Validation Loss |
+|:-------------:|:-----:|:----:|:---------------:|
+| No log        | 1.0   | 63   | 0.0981          |
+| No log        | 2.0   | 126  | 0.0007          |
+| No log        | 3.0   | 189  | 0.0002          |
+| No log        | 4.0   | 252  | 0.0006          |
+| No log        | 5.0   | 315  | 0.0169          |
+| No log        | 6.0   | 378  | 0.0002          |
+| No log        | 7.0   | 441  | 0.0001          |
+| 0.3539        | 8.0   | 504  | 0.0002          |
+| 0.3539        | 9.0   | 567  | 0.0001          |
+| 0.3539        | 10.0  | 630  | 0.0001          |
+| 0.3539        | 11.0  | 693  | 0.0001          |
+| 0.3539        | 12.0  | 756  | 0.0001          |
+| 0.3539        | 13.0  | 819  | 0.0001          |
+| 0.3539        | 14.0  | 882  | 0.0001          |
+| 0.3539        | 15.0  | 945  | 0.0001          |
+| 0.0012        | 16.0  | 1008 | 0.0000          |
+| 0.0012        | 17.0  | 1071 | 0.0001          |
+| 0.0012        | 18.0  | 1134 | 0.0000          |
+| 0.0012        | 19.0  | 1197 | 0.0000          |
+| 0.0012        | 20.0  | 1260 | 0.0000          |
 ### Framework versions