End of training

Browse files

Files changed (4) hide show

README.md +42 -42
model.safetensors +1 -1
runs/Dec03_00-37-39_Software-AI/events.out.tfevents.1701551259.Software-AI.21828.9 +3 -0
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -17,7 +17,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [makhataei/qa-persian-albert-fa-zwnj-base-v2](https://huggingface.co/makhataei/qa-persian-albert-fa-zwnj-base-v2) on the pquad dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.9991
 ## Model description
@@ -36,7 +36,7 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 3.90625e-07
 - train_batch_size: 16
 - eval_batch_size: 16
 - seed: 42
@@ -48,46 +48,46 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step  | Validation Loss |
 |:-------------:|:-----:|:-----:|:---------------:|
-| 0.0101        | 0.12  | 500   | 2.1486          |
-| 0.0342        | 0.25  | 1000  | 2.3710          |
-| 0.1133        | 0.38  | 1500  | 2.3393          |
-| 0.1976        | 0.5   | 2000  | 2.2309          |
-| 0.1944        | 0.62  | 2500  | 2.1436          |
-| 0.2336        | 0.75  | 3000  | 2.0430          |
-| 0.2277        | 0.88  | 3500  | 1.9529          |
-| 0.2443        | 1.0   | 4000  | 1.8560          |
-| 0.1506        | 1.12  | 4500  | 1.8765          |
-| 0.1369        | 1.25  | 5000  | 1.9003          |
-| 0.1435        | 1.38  | 5500  | 1.9158          |
-| 0.1365        | 1.5   | 6000  | 1.9251          |
-| 0.1387        | 1.62  | 6500  | 1.9243          |
-| 0.1323        | 1.75  | 7000  | 1.9363          |
-| 0.1369        | 1.88  | 7500  | 1.9341          |
-| 0.1312        | 2.0   | 8000  | 1.9389          |
-| 0.1339        | 2.12  | 8500  | 1.9453          |
-| 0.1349        | 2.25  | 9000  | 1.9435          |
-| 0.1418        | 2.38  | 9500  | 1.9479          |
-| 0.1256        | 2.5   | 10000 | 1.9647          |
-| 0.1375        | 2.62  | 10500 | 1.9592          |
-| 0.1237        | 2.75  | 11000 | 1.9647          |
-| 0.1326        | 2.88  | 11500 | 1.9689          |
-| 0.126         | 3.0   | 12000 | 1.9736          |
-| 0.1217        | 3.12  | 12500 | 1.9841          |
-| 0.1274        | 3.25  | 13000 | 1.9852          |
-| 0.1285        | 3.38  | 13500 | 1.9878          |
-| 0.1202        | 3.5   | 14000 | 1.9885          |
-| 0.1365        | 3.62  | 14500 | 1.9865          |
-| 0.1319        | 3.75  | 15000 | 1.9847          |
-| 0.131         | 3.88  | 15500 | 1.9854          |
-| 0.1258        | 4.0   | 16000 | 1.9887          |
-| 0.1233        | 4.12  | 16500 | 1.9899          |
-| 0.1211        | 4.25  | 17000 | 1.9936          |
-| 0.1326        | 4.38  | 17500 | 1.9935          |
-| 0.1216        | 4.5   | 18000 | 1.9955          |
-| 0.1292        | 4.62  | 18500 | 1.9976          |
-| 0.1166        | 4.75  | 19000 | 1.9982          |
-| 0.1313        | 4.88  | 19500 | 1.9990          |
-| 0.1242        | 5.0   | 20000 | 1.9991          |
 ### Framework versions

 This model is a fine-tuned version of [makhataei/qa-persian-albert-fa-zwnj-base-v2](https://huggingface.co/makhataei/qa-persian-albert-fa-zwnj-base-v2) on the pquad dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.0165
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 1.953125e-07
 - train_batch_size: 16
 - eval_batch_size: 16
 - seed: 42
 | Training Loss | Epoch | Step  | Validation Loss |
 |:-------------:|:-----:|:-----:|:---------------:|
+| 0.0081        | 0.12  | 500   | 2.1432          |
+| 0.0312        | 0.25  | 1000  | 2.3066          |
+| 0.1035        | 0.38  | 1500  | 2.3197          |
+| 0.1844        | 0.5   | 2000  | 2.2733          |
+| 0.187         | 0.62  | 2500  | 2.2288          |
+| 0.2282        | 0.75  | 3000  | 2.1744          |
+| 0.2288        | 0.88  | 3500  | 2.1178          |
+| 0.2505        | 1.0   | 4000  | 2.0551          |
+| 0.1537        | 1.12  | 4500  | 2.0434          |
+| 0.1387        | 1.25  | 5000  | 2.0391          |
+| 0.1445        | 1.38  | 5500  | 2.0356          |
+| 0.1368        | 1.5   | 6000  | 2.0332          |
+| 0.1394        | 1.62  | 6500  | 2.0261          |
+| 0.1319        | 1.75  | 7000  | 2.0272          |
+| 0.1365        | 1.88  | 7500  | 2.0207          |
+| 0.1305        | 2.0   | 8000  | 2.0193          |
+| 0.1369        | 2.12  | 8500  | 2.0181          |
+| 0.1376        | 2.25  | 9000  | 2.0122          |
+| 0.1445        | 2.38  | 9500  | 2.0105          |
+| 0.1274        | 2.5   | 10000 | 2.0166          |
+| 0.1396        | 2.62  | 10500 | 2.0115          |
+| 0.1252        | 2.75  | 11000 | 2.0124          |
+| 0.1341        | 2.88  | 11500 | 2.0132          |
+| 0.1271        | 3.0   | 12000 | 2.0146          |
+| 0.125         | 3.12  | 12500 | 2.0185          |
+| 0.1312        | 3.25  | 13000 | 2.0179          |
+| 0.1319        | 3.38  | 13500 | 2.0184          |
+| 0.1235        | 3.5   | 14000 | 2.0178          |
+| 0.1403        | 3.62  | 14500 | 2.0159          |
+| 0.1356        | 3.75  | 15000 | 2.0139          |
+| 0.1344        | 3.88  | 15500 | 2.0132          |
+| 0.1288        | 4.0   | 16000 | 2.0142          |
+| 0.1282        | 4.12  | 16500 | 2.0139          |
+| 0.1251        | 4.25  | 17000 | 2.0152          |
+| 0.1377        | 4.38  | 17500 | 2.0146          |
+| 0.1261        | 4.5   | 18000 | 2.0153          |
+| 0.134         | 4.62  | 18500 | 2.0160          |
+| 0.1205        | 4.75  | 19000 | 2.0162          |
+| 0.1359        | 4.88  | 19500 | 2.0165          |
+| 0.1289        | 5.0   | 20000 | 2.0165          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:568da479779fe88c03de16d4901635f8328f166736b6f5a818f6ae19d9e52cf6
 size 44381360

 version https://git-lfs.github.com/spec/v1
+oid sha256:80c8a6cb7c6ded80241b1ab7b2a484c9e564c7d0ae51bef92c5883e4d725e338
 size 44381360

runs/Dec03_00-37-39_Software-AI/events.out.tfevents.1701551259.Software-AI.21828.9 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:42a32b2071d301a3d594e45327d10a19feaa177f31f0c5dd765cfb779e76d539
+size 22033

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:285a6492f74cc05a51f33c05397fce7b17a22607a6c5f084280d765029f21997
 size 4155

 version https://git-lfs.github.com/spec/v1
+oid sha256:e9fb77bbf24522594e348dad4b39a1fece33f1e0c32171ed77249759f7add128
 size 4155