End of training

Browse files

Files changed (4) hide show

README.md +42 -42
model.safetensors +1 -1
runs/Dec02_22-53-02_Software-AI/events.out.tfevents.1701544983.Software-AI.21828.8 +3 -0
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -17,7 +17,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [makhataei/qa-persian-albert-fa-zwnj-base-v2](https://huggingface.co/makhataei/qa-persian-albert-fa-zwnj-base-v2) on the pquad dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.0167
 ## Model description
@@ -36,7 +36,7 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 7.8125e-07
 - train_batch_size: 16
 - eval_batch_size: 16
 - seed: 42
@@ -48,46 +48,46 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step  | Validation Loss |
 |:-------------:|:-----:|:-----:|:---------------:|
-| 0.0122        | 0.12  | 500   | 2.2415          |
-| 0.0401        | 0.25  | 1000  | 2.4681          |
-| 0.1308        | 0.38  | 1500  | 2.3118          |
-| 0.218         | 0.5   | 2000  | 2.0826          |
-| 0.2057        | 0.62  | 2500  | 1.9572          |
-| 0.2441        | 0.75  | 3000  | 1.8029          |
-| 0.2311        | 0.88  | 3500  | 1.7156          |
-| 0.2439        | 1.0   | 4000  | 1.6101          |
-| 0.1531        | 1.12  | 4500  | 1.7252          |
-| 0.1405        | 1.25  | 5000  | 1.7982          |
-| 0.1478        | 1.38  | 5500  | 1.8344          |
-| 0.141         | 1.5   | 6000  | 1.8482          |
-| 0.1426        | 1.62  | 6500  | 1.8463          |
-| 0.1376        | 1.75  | 7000  | 1.8652          |
-| 0.1419        | 1.88  | 7500  | 1.8663          |
-| 0.1361        | 2.0   | 8000  | 1.8739          |
-| 0.1321        | 2.12  | 8500  | 1.8946          |
-| 0.1337        | 2.25  | 9000  | 1.8995          |
-| 0.1411        | 2.38  | 9500  | 1.9112          |
-| 0.1258        | 2.5   | 10000 | 1.9391          |
-| 0.1373        | 2.62  | 10500 | 1.9296          |
-| 0.1241        | 2.75  | 11000 | 1.9384          |
-| 0.1328        | 2.88  | 11500 | 1.9444          |
-| 0.1266        | 3.0   | 12000 | 1.9511          |
-| 0.1176        | 3.12  | 12500 | 1.9737          |
-| 0.1231        | 3.25  | 13000 | 1.9778          |
-| 0.1246        | 3.38  | 13500 | 1.9836          |
-| 0.1166        | 3.5   | 14000 | 1.9863          |
-| 0.1326        | 3.62  | 14500 | 1.9850          |
-| 0.1282        | 3.75  | 15000 | 1.9833          |
-| 0.1275        | 3.88  | 15500 | 1.9858          |
-| 0.1228        | 4.0   | 16000 | 1.9928          |
-| 0.1169        | 4.12  | 16500 | 1.9970          |
-| 0.116         | 4.25  | 17000 | 2.0050          |
-| 0.1259        | 4.38  | 17500 | 2.0058          |
-| 0.1161        | 4.5   | 18000 | 2.0098          |
-| 0.1234        | 4.62  | 18500 | 2.0140          |
-| 0.1117        | 4.75  | 19000 | 2.0149          |
-| 0.1254        | 4.88  | 19500 | 2.0164          |
-| 0.1182        | 5.0   | 20000 | 2.0167          |
 ### Framework versions

 This model is a fine-tuned version of [makhataei/qa-persian-albert-fa-zwnj-base-v2](https://huggingface.co/makhataei/qa-persian-albert-fa-zwnj-base-v2) on the pquad dataset.
 It achieves the following results on the evaluation set:
+- Loss: 1.9991
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 3.90625e-07
 - train_batch_size: 16
 - eval_batch_size: 16
 - seed: 42
 | Training Loss | Epoch | Step  | Validation Loss |
 |:-------------:|:-----:|:-----:|:---------------:|
+| 0.0101        | 0.12  | 500   | 2.1486          |
+| 0.0342        | 0.25  | 1000  | 2.3710          |
+| 0.1133        | 0.38  | 1500  | 2.3393          |
+| 0.1976        | 0.5   | 2000  | 2.2309          |
+| 0.1944        | 0.62  | 2500  | 2.1436          |
+| 0.2336        | 0.75  | 3000  | 2.0430          |
+| 0.2277        | 0.88  | 3500  | 1.9529          |
+| 0.2443        | 1.0   | 4000  | 1.8560          |
+| 0.1506        | 1.12  | 4500  | 1.8765          |
+| 0.1369        | 1.25  | 5000  | 1.9003          |
+| 0.1435        | 1.38  | 5500  | 1.9158          |
+| 0.1365        | 1.5   | 6000  | 1.9251          |
+| 0.1387        | 1.62  | 6500  | 1.9243          |
+| 0.1323        | 1.75  | 7000  | 1.9363          |
+| 0.1369        | 1.88  | 7500  | 1.9341          |
+| 0.1312        | 2.0   | 8000  | 1.9389          |
+| 0.1339        | 2.12  | 8500  | 1.9453          |
+| 0.1349        | 2.25  | 9000  | 1.9435          |
+| 0.1418        | 2.38  | 9500  | 1.9479          |
+| 0.1256        | 2.5   | 10000 | 1.9647          |
+| 0.1375        | 2.62  | 10500 | 1.9592          |
+| 0.1237        | 2.75  | 11000 | 1.9647          |
+| 0.1326        | 2.88  | 11500 | 1.9689          |
+| 0.126         | 3.0   | 12000 | 1.9736          |
+| 0.1217        | 3.12  | 12500 | 1.9841          |
+| 0.1274        | 3.25  | 13000 | 1.9852          |
+| 0.1285        | 3.38  | 13500 | 1.9878          |
+| 0.1202        | 3.5   | 14000 | 1.9885          |
+| 0.1365        | 3.62  | 14500 | 1.9865          |
+| 0.1319        | 3.75  | 15000 | 1.9847          |
+| 0.131         | 3.88  | 15500 | 1.9854          |
+| 0.1258        | 4.0   | 16000 | 1.9887          |
+| 0.1233        | 4.12  | 16500 | 1.9899          |
+| 0.1211        | 4.25  | 17000 | 1.9936          |
+| 0.1326        | 4.38  | 17500 | 1.9935          |
+| 0.1216        | 4.5   | 18000 | 1.9955          |
+| 0.1292        | 4.62  | 18500 | 1.9976          |
+| 0.1166        | 4.75  | 19000 | 1.9982          |
+| 0.1313        | 4.88  | 19500 | 1.9990          |
+| 0.1242        | 5.0   | 20000 | 1.9991          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:122ffa9b984bac2111b2dd9df163ec73d0d0985fc24d74eac675bcf347b55623
 size 44381360

 version https://git-lfs.github.com/spec/v1
+oid sha256:568da479779fe88c03de16d4901635f8328f166736b6f5a818f6ae19d9e52cf6
 size 44381360

runs/Dec02_22-53-02_Software-AI/events.out.tfevents.1701544983.Software-AI.21828.8 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:3d90481bff8ef03c1603ace12a41768a6fe992d734f175a5e63d0e9cb1ea64c0
+size 22032

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:cd50438b1918fb5bc606ba42d8eaa76a940b24c0ebd4b178f1a1e1740c95cc5e
 size 4155

 version https://git-lfs.github.com/spec/v1
+oid sha256:285a6492f74cc05a51f33c05397fce7b17a22607a6c5f084280d765029f21997
 size 4155