End of training

Browse files

Files changed (16) hide show

README.md +48 -10
config.json +1 -1
model.safetensors +1 -1
runs/Dec02_08-01-05_Software-AI/events.out.tfevents.1701491466.Software-AI.14591.0 +3 -0
runs/Dec02_08-01-53_Software-AI/events.out.tfevents.1701491513.Software-AI.15993.0 +3 -0
runs/Dec02_08-02-54_Software-AI/events.out.tfevents.1701491575.Software-AI.17159.0 +3 -0
runs/Dec02_08-03-15_Software-AI/events.out.tfevents.1701491596.Software-AI.17599.0 +3 -0
runs/Dec02_08-03-31_Software-AI/events.out.tfevents.1701491611.Software-AI.17979.0 +3 -0
runs/Dec02_08-03-48_Software-AI/events.out.tfevents.1701491628.Software-AI.18340.0 +3 -0
runs/Dec02_08-04-02_Software-AI/events.out.tfevents.1701491643.Software-AI.18684.0 +3 -0
runs/Dec02_08-04-20_Software-AI/events.out.tfevents.1701491660.Software-AI.19058.0 +3 -0
runs/Dec02_08-04-49_Software-AI/events.out.tfevents.1701491689.Software-AI.19688.0 +3 -0
runs/Dec02_08-06-47_Software-AI/events.out.tfevents.1701491808.Software-AI.21828.0 +3 -0
special_tokens_map.json +42 -6
tokenizer_config.json +4 -0
training_args.bin +2 -2

README.md CHANGED Viewed

@@ -1,6 +1,6 @@
 ---
 license: apache-2.0
-base_model: HooshvareLab/albert-fa-zwnj-base-v2
 tags:
 - generated_from_trainer
 datasets:
@@ -15,9 +15,9 @@ should probably proofread and complete it, then remove this comment. -->
 # qa-persian-albert-fa-zwnj-base-v2
-This model is a fine-tuned version of [HooshvareLab/albert-fa-zwnj-base-v2](https://huggingface.co/HooshvareLab/albert-fa-zwnj-base-v2) on the pquad dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.0456
 ## Model description
@@ -36,25 +36,63 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 1e-05
-- train_batch_size: 128
-- eval_batch_size: 128
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 2
 ### Training results
 | Training Loss | Epoch | Step  | Validation Loss |
 |:-------------:|:-----:|:-----:|:---------------:|
-| 1.0575        | 1.0   | 8000  | 1.1253          |
-| 0.8976        | 2.0   | 16000 | 1.0456          |
 ### Framework versions
 - Transformers 4.35.2
-- Pytorch 2.1.0+cu118
 - Datasets 2.15.0
 - Tokenizers 0.15.0

 ---
 license: apache-2.0
+base_model: makhataei/qa-persian-albert-fa-zwnj-base-v2
 tags:
 - generated_from_trainer
 datasets:
 # qa-persian-albert-fa-zwnj-base-v2
+This model is a fine-tuned version of [makhataei/qa-persian-albert-fa-zwnj-base-v2](https://huggingface.co/makhataei/qa-persian-albert-fa-zwnj-base-v2) on the pquad dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.4440
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 0.0001
+- train_batch_size: 16
+- eval_batch_size: 16
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 5
 ### Training results
 | Training Loss | Epoch | Step  | Validation Loss |
 |:-------------:|:-----:|:-----:|:---------------:|
+| 1.5731        | 0.12  | 500   | 1.6899          |
+| 1.4759        | 0.25  | 1000  | 1.4951          |
+| 1.3844        | 0.38  | 1500  | 1.5161          |
+| 1.3116        | 0.5   | 2000  | 1.3618          |
+| 1.3055        | 0.62  | 2500  | 1.3795          |
+| 1.2364        | 0.75  | 3000  | 1.3386          |
+| 1.2189        | 0.88  | 3500  | 1.3131          |
+| 1.1737        | 1.0   | 4000  | 1.2202          |
+| 1.0047        | 1.12  | 4500  | 1.2268          |
+| 0.9573        | 1.25  | 5000  | 1.3119          |
+| 0.978         | 1.38  | 5500  | 1.1918          |
+| 0.9655        | 1.5   | 6000  | 1.1896          |
+| 0.9505        | 1.62  | 6500  | 1.1730          |
+| 0.9379        | 1.75  | 7000  | 1.1215          |
+| 0.9237        | 1.88  | 7500  | 1.0691          |
+| 0.8911        | 2.0   | 8000  | 1.0819          |
+| 0.6874        | 2.12  | 8500  | 1.1670          |
+| 0.6919        | 2.25  | 9000  | 1.1506          |
+| 0.7118        | 2.38  | 9500  | 1.1352          |
+| 0.7062        | 2.5   | 10000 | 1.1762          |
+| 0.7077        | 2.62  | 10500 | 1.1072          |
+| 0.7055        | 2.75  | 11000 | 1.0788          |
+| 0.6869        | 2.88  | 11500 | 1.0863          |
+| 0.6707        | 3.0   | 12000 | 1.0167          |
+| 0.4597        | 3.12  | 12500 | 1.2769          |
+| 0.4652        | 3.25  | 13000 | 1.1891          |
+| 0.4673        | 3.38  | 13500 | 1.1466          |
+| 0.4644        | 3.5   | 14000 | 1.1818          |
+| 0.4701        | 3.62  | 14500 | 1.1939          |
+| 0.4765        | 3.75  | 15000 | 1.1518          |
+| 0.4537        | 3.88  | 15500 | 1.1528          |
+| 0.4164        | 4.0   | 16000 | 1.2239          |
+| 0.2465        | 4.12  | 16500 | 1.4501          |
+| 0.2495        | 4.25  | 17000 | 1.3717          |
+| 0.263         | 4.38  | 17500 | 1.4030          |
+| 0.2423        | 4.5   | 18000 | 1.4249          |
+| 0.2297        | 4.62  | 18500 | 1.4387          |
+| 0.227         | 4.75  | 19000 | 1.4600          |
+| 0.239         | 4.88  | 19500 | 1.4452          |
+| 0.2307        | 5.0   | 20000 | 1.4440          |
 ### Framework versions
 - Transformers 4.35.2
+- Pytorch 2.0.1+cu117
 - Datasets 2.15.0
 - Tokenizers 0.15.0

config.json CHANGED Viewed

@@ -1,5 +1,5 @@
 {
-  "_name_or_path": "HooshvareLab/albert-fa-zwnj-base-v2",
   "architectures": [
     "AlbertForQuestionAnswering"
   ],

 {
+  "_name_or_path": "makhataei/qa-persian-albert-fa-zwnj-base-v2",
   "architectures": [
     "AlbertForQuestionAnswering"
   ],

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:e0590edf20d46e1ef6f204a0ebef4f2318a1b0999f25fe3adbf65aca81354536
 size 44381360

 version https://git-lfs.github.com/spec/v1
+oid sha256:6a7269e1c1a55a9beb75143243ed1ec44a0c1c02dcaa8f757d12df54bda3b231
 size 44381360

runs/Dec02_08-01-05_Software-AI/events.out.tfevents.1701491466.Software-AI.14591.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:4c0d21b4973fec1df7128e06cec31ff439dde6c02d99f888ab496a2708fb094b
+size 4485

runs/Dec02_08-01-53_Software-AI/events.out.tfevents.1701491513.Software-AI.15993.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:304666ba506c39358a1596bf4daa087235f757d7fa2c4e48ccd906481fabf057
+size 22068

runs/Dec02_08-02-54_Software-AI/events.out.tfevents.1701491575.Software-AI.17159.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:465c7a648761b7355b205b8381f8ad04aae0d8d95387fd27a772693392d5db6e
+size 4483

runs/Dec02_08-03-15_Software-AI/events.out.tfevents.1701491596.Software-AI.17599.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:892a21bbf0f2eb229ee14b0773a75b19390aeeb549961d69b49953f6e37631c1
+size 4483

runs/Dec02_08-03-31_Software-AI/events.out.tfevents.1701491611.Software-AI.17979.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:fc29370a780c81d08829eabe6dd47ffcdd6081de212fb880ebaf50e2db7711be
+size 4483

runs/Dec02_08-03-48_Software-AI/events.out.tfevents.1701491628.Software-AI.18340.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:7c3c4c589b19b63a91f8e7f254b6e41117a4810c383901c8b8a5bd67653d112f
+size 4483

runs/Dec02_08-04-02_Software-AI/events.out.tfevents.1701491643.Software-AI.18684.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:4eb50b91f5a512217b587c718c67d75c895f5f3a7669bc20318e0a4cc4a1b203
+size 4483

runs/Dec02_08-04-20_Software-AI/events.out.tfevents.1701491660.Software-AI.19058.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:e8075b5f282bdd3d4a75aae6aa6416c70281ec1e8503c12057498c1e9eac78ef
+size 4483

runs/Dec02_08-04-49_Software-AI/events.out.tfevents.1701491689.Software-AI.19688.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:ac378ec4e6ea6da66cd9665fc60495d38e72d78683fb9c41e12b9a2d1aed8cce
+size 4483

runs/Dec02_08-06-47_Software-AI/events.out.tfevents.1701491808.Software-AI.21828.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:22a58e4886bbc0c3fe805cf31e041e8695d63da13af7014c1a817bd6bb34727a
+size 22027

special_tokens_map.json CHANGED Viewed

@@ -1,7 +1,25 @@
 {
-  "bos_token": "[CLS]",
-  "cls_token": "[CLS]",
-  "eos_token": "[SEP]",
   "mask_token": {
     "content": "[MASK]",
     "lstrip": true,
@@ -9,7 +27,25 @@
     "rstrip": false,
     "single_word": false
   },
-  "pad_token": "<pad>",
-  "sep_token": "[SEP]",
-  "unk_token": "<unk>"
 }

 {
+  "bos_token": {
+    "content": "[CLS]",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  },
+  "cls_token": {
+    "content": "[CLS]",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  },
+  "eos_token": {
+    "content": "[SEP]",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  },
   "mask_token": {
     "content": "[MASK]",
     "lstrip": true,
     "rstrip": false,
     "single_word": false
   },
+  "pad_token": {
+    "content": "<pad>",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  },
+  "sep_token": {
+    "content": "[SEP]",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  },
+  "unk_token": {
+    "content": "<unk>",
+    "lstrip": false,
+    "normalized": false,
+    "rstrip": false,
+    "single_word": false
+  }
 }

tokenizer_config.json CHANGED Viewed

@@ -904,10 +904,14 @@
   "eos_token": "[SEP]",
   "keep_accents": false,
   "mask_token": "[MASK]",
   "model_max_length": 512,
   "pad_token": "<pad>",
   "remove_space": true,
   "sep_token": "[SEP]",
   "tokenizer_class": "AlbertTokenizer",
   "unk_token": "<unk>"
 }

   "eos_token": "[SEP]",
   "keep_accents": false,
   "mask_token": "[MASK]",
+  "max_length": 512,
   "model_max_length": 512,
   "pad_token": "<pad>",
   "remove_space": true,
   "sep_token": "[SEP]",
+  "stride": 256,
   "tokenizer_class": "AlbertTokenizer",
+  "truncation_side": "right",
+  "truncation_strategy": "only_second",
   "unk_token": "<unk>"
 }

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:3d7afd4fb7d2066a1346e34ece2c590ce39699a13563541493eefcc0ff847c4e
-size 4600

 version https://git-lfs.github.com/spec/v1
+oid sha256:96156d6ac9d294f2d0a5e8257ef1f40e2feedd2adb3fbd88285bcb89a752cb20
+size 4155