Model save

Files changed (4) hide show

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [roberta-base](https://huggingface.co/roberta-base) on the None dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.5713
 ## Model description
@@ -34,25 +34,23 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 5e-05
-- train_batch_size: 64
-- eval_batch_size: 64
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 500
-- num_epochs: 5
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 0.6371        | 1.0   | 874  | 0.5714          |
-| 0.4772        | 2.0   | 1748 | 0.4770          |
-| 0.3336        | 3.0   | 2622 | 0.5198          |
-| 0.2174        | 4.0   | 3496 | 0.4755          |
-| 0.141         | 5.0   | 4370 | 0.5713          |
 ### Framework versions

 This model is a fine-tuned version of [roberta-base](https://huggingface.co/roberta-base) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.4183
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 2e-05
+- train_batch_size: 48
+- eval_batch_size: 48
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
 - lr_scheduler_warmup_steps: 500
+- num_epochs: 3
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 0.6532        | 1.0   | 586  | 0.5639          |
+| 0.4884        | 2.0   | 1172 | 0.4635          |
+| 0.3384        | 3.0   | 1758 | 0.4183          |
 ### Framework versions

logs/events.out.tfevents.1702229150.a86980be031b.215.7 CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:ecf02db000687ab51d2130ca4c2e71a5834264520cea363342b8326da86c9fe5
-size 5218

 version https://git-lfs.github.com/spec/v1
+oid sha256:8b6f4777a5a8c35972ea36a700567bca46d51b6353f533df2e3490c6d659146f
+size 6000

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:09b74d1723274bfc68012104c885f9d85434cf0b6a7809bcb8500f9426bbbb72
 size 498612824

 version https://git-lfs.github.com/spec/v1
+oid sha256:d8298daf926489dbdb9fbaf34dd288b8a10728c884be1b8a32359b0c05f6af42
 size 498612824

tokenizer.json CHANGED Viewed

@@ -2,7 +2,7 @@
   "version": "1.0",
   "truncation": {
     "direction": "Right",
-    "max_length": 136,
     "strategy": "LongestFirst",
     "stride": 0
   },

   "version": "1.0",
   "truncation": {
     "direction": "Right",
+    "max_length": 269,
     "strategy": "LongestFirst",
     "stride": 0
   },