CuATR-distilbert-LoRA

Files changed (4) hide show

README.md CHANGED Viewed

@@ -18,9 +18,9 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 0.6865
-- Accuracy: 0.6087
-- F1: 0.7429
 ## Model description
@@ -47,16 +47,23 @@ The following hyperparameters were used during training:
 - total_train_batch_size: 64
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 4
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1     |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|
-| 0.6894        | 0.67  | 1    | 0.6867          | 0.6087   | 0.7429 |
-| 0.6931        | 2.0   | 3    | 0.6865          | 0.6087   | 0.7429 |
-| 0.6873        | 2.67  | 4    | 0.6865          | 0.6087   | 0.7429 |
 ### Framework versions

 This model is a fine-tuned version of [distilbert-base-uncased](https://huggingface.co/distilbert-base-uncased) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.6999
+- Accuracy: 0.0870
+- F1: 0.0870
 ## Model description
 - total_train_batch_size: 64
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 14
 - mixed_precision_training: Native AMP
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss | Accuracy | F1     |
 |:-------------:|:-----:|:----:|:---------------:|:--------:|:------:|
+| 0.7063        | 0.67  | 1    | 0.7011          | 0.0870   | 0.0870 |
+| 0.6861        | 2.0   | 3    | 0.7008          | 0.0870   | 0.0870 |
+| 0.6976        | 2.67  | 4    | 0.7007          | 0.0870   | 0.0870 |
+| 0.7033        | 4.0   | 6    | 0.7004          | 0.0870   | 0.0870 |
+| 0.7011        | 4.67  | 7    | 0.7003          | 0.0870   | 0.0870 |
+| 0.698         | 6.0   | 9    | 0.7002          | 0.0870   | 0.0870 |
+| 0.7037        | 6.67  | 10   | 0.7001          | 0.0870   | 0.0870 |
+| 0.6977        | 8.0   | 12   | 0.7000          | 0.0870   | 0.0870 |
+| 0.7002        | 8.67  | 13   | 0.7000          | 0.0870   | 0.0870 |
+| 0.6995        | 9.33  | 14   | 0.6999          | 0.0870   | 0.0870 |
 ### Framework versions

adapter_config.json CHANGED Viewed

@@ -16,9 +16,9 @@
   "rank_pattern": {},
   "revision": null,
   "target_modules": [
-    "v_lin",
     "q_lin",
-    "k_lin"
   ],
   "task_type": "TOKEN_CLS"
 }

   "rank_pattern": {},
   "revision": null,
   "target_modules": [
     "q_lin",
+    "k_lin",
+    "v_lin"
   ],
   "task_type": "TOKEN_CLS"
 }

adapter_model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:a2303adc67ddf09fc04b437f94e611540e087e9104083d01ac994cf027044b73
 size 447536

 version https://git-lfs.github.com/spec/v1
+oid sha256:b46de21be5f826a8bb505a096e7cce4fa2020dd39e306bcb90c95b3c4c78e1d5
 size 447536

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:28b83cee47f7fb3807d87d114642f6a24ecf08708f7e1deea6b3572f865871a8
 size 4600

 version https://git-lfs.github.com/spec/v1
+oid sha256:b498cdae053c0476cb868b3e16d621ff5d43bf34f20c923f0c2ab78edb09fadf
 size 4600