End of training

Browse files

Files changed (4) hide show

README.md +42 -42
model.safetensors +1 -1
runs/Dec02_19-18-50_Software-AI/events.out.tfevents.1701532130.Software-AI.21828.6 +3 -0
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -17,7 +17,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [makhataei/qa-persian-albert-fa-zwnj-base-v2](https://huggingface.co/makhataei/qa-persian-albert-fa-zwnj-base-v2) on the pquad dataset.
 It achieves the following results on the evaluation set:
-- Loss: 2.3917
 ## Model description
@@ -36,7 +36,7 @@ More information needed
 ### Training hyperparameters
 The following hyperparameters were used during training:
-- learning_rate: 3.125e-06
 - train_batch_size: 16
 - eval_batch_size: 16
 - seed: 42
@@ -48,46 +48,46 @@ The following hyperparameters were used during training:
 | Training Loss | Epoch | Step  | Validation Loss |
 |:-------------:|:-----:|:-----:|:---------------:|
-| 0.0242        | 0.12  | 500   | 2.3895          |
-| 0.0834        | 0.25  | 1000  | 2.4284          |
-| 0.2221        | 0.38  | 1500  | 1.9002          |
-| 0.3264        | 0.5   | 2000  | 1.4454          |
-| 0.286         | 0.62  | 2500  | 1.4277          |
-| 0.2499        | 0.75  | 3000  | 1.4548          |
-| 0.2359        | 0.88  | 3500  | 1.4717          |
-| 0.2149        | 1.0   | 4000  | 1.4957          |
-| 0.1637        | 1.12  | 4500  | 1.6380          |
-| 0.1513        | 1.25  | 5000  | 1.7106          |
-| 0.157         | 1.38  | 5500  | 1.7514          |
-| 0.1501        | 1.5   | 6000  | 1.7322          |
-| 0.1511        | 1.62  | 6500  | 1.7126          |
-| 0.1464        | 1.75  | 7000  | 1.7646          |
-| 0.1552        | 1.88  | 7500  | 1.7580          |
-| 0.1468        | 2.0   | 8000  | 1.7727          |
-| 0.1136        | 2.12  | 8500  | 1.9067          |
-| 0.1181        | 2.25  | 9000  | 1.9514          |
-| 0.1243        | 2.38  | 9500  | 1.9811          |
-| 0.1137        | 2.5   | 10000 | 2.0183          |
-| 0.1229        | 2.62  | 10500 | 2.0141          |
-| 0.1144        | 2.75  | 11000 | 2.0105          |
-| 0.1212        | 2.88  | 11500 | 2.0363          |
-| 0.1185        | 3.0   | 12000 | 2.0345          |
-| 0.0892        | 3.12  | 12500 | 2.1508          |
-| 0.0937        | 3.25  | 13000 | 2.1875          |
-| 0.0956        | 3.38  | 13500 | 2.2057          |
-| 0.0876        | 3.5   | 14000 | 2.2200          |
-| 0.1032        | 3.62  | 14500 | 2.2411          |
-| 0.0997        | 3.75  | 15000 | 2.2433          |
-| 0.0978        | 3.88  | 15500 | 2.2572          |
-| 0.0974        | 4.0   | 16000 | 2.2873          |
-| 0.0773        | 4.12  | 16500 | 2.3254          |
-| 0.0795        | 4.25  | 17000 | 2.3504          |
-| 0.0885        | 4.38  | 17500 | 2.3555          |
-| 0.0784        | 4.5   | 18000 | 2.3785          |
-| 0.0865        | 4.62  | 18500 | 2.3804          |
-| 0.0768        | 4.75  | 19000 | 2.3799          |
-| 0.084         | 4.88  | 19500 | 2.3895          |
-| 0.0807        | 5.0   | 20000 | 2.3917          |
 ### Framework versions

 This model is a fine-tuned version of [makhataei/qa-persian-albert-fa-zwnj-base-v2](https://huggingface.co/makhataei/qa-persian-albert-fa-zwnj-base-v2) on the pquad dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.1007
 ## Model description
 ### Training hyperparameters
 The following hyperparameters were used during training:
+- learning_rate: 1.5625e-06
 - train_batch_size: 16
 - eval_batch_size: 16
 - seed: 42
 | Training Loss | Epoch | Step  | Validation Loss |
 |:-------------:|:-----:|:-----:|:---------------:|
+| 0.0157        | 0.12  | 500   | 2.3489          |
+| 0.0549        | 0.25  | 1000  | 2.5113          |
+| 0.1605        | 0.38  | 1500  | 2.1880          |
+| 0.256         | 0.5   | 2000  | 1.8072          |
+| 0.2326        | 0.62  | 2500  | 1.7167          |
+| 0.2736        | 0.75  | 3000  | 1.5287          |
+| 0.2506        | 0.88  | 3500  | 1.5003          |
+| 0.2254        | 1.0   | 4000  | 1.5040          |
+| 0.1609        | 1.12  | 4500  | 1.6583          |
+| 0.1488        | 1.25  | 5000  | 1.7421          |
+| 0.1542        | 1.38  | 5500  | 1.7805          |
+| 0.1478        | 1.5   | 6000  | 1.7781          |
+| 0.1501        | 1.62  | 6500  | 1.7627          |
+| 0.1448        | 1.75  | 7000  | 1.7906          |
+| 0.1499        | 1.88  | 7500  | 1.7947          |
+| 0.1435        | 2.0   | 8000  | 1.8035          |
+| 0.1279        | 2.12  | 8500  | 1.8594          |
+| 0.1303        | 2.25  | 9000  | 1.8789          |
+| 0.1384        | 2.38  | 9500  | 1.8980          |
+| 0.124         | 2.5   | 10000 | 1.9347          |
+| 0.1345        | 2.62  | 10500 | 1.9211          |
+| 0.1237        | 2.75  | 11000 | 1.9283          |
+| 0.1304        | 2.88  | 11500 | 1.9416          |
+| 0.1258        | 3.0   | 12000 | 1.9489          |
+| 0.1092        | 3.12  | 12500 | 1.9973          |
+| 0.1146        | 3.25  | 13000 | 2.0114          |
+| 0.1156        | 3.38  | 13500 | 2.0210          |
+| 0.1076        | 3.5   | 14000 | 2.0272          |
+| 0.1235        | 3.62  | 14500 | 2.0330          |
+| 0.1195        | 3.75  | 15000 | 2.0305          |
+| 0.1196        | 3.88  | 15500 | 2.0372          |
+| 0.1159        | 4.0   | 16000 | 2.0504          |
+| 0.1042        | 4.12  | 16500 | 2.0632          |
+| 0.1047        | 4.25  | 17000 | 2.0779          |
+| 0.1132        | 4.38  | 17500 | 2.0810          |
+| 0.1051        | 4.5   | 18000 | 2.0899          |
+| 0.1113        | 4.62  | 18500 | 2.0969          |
+| 0.1011        | 4.75  | 19000 | 2.0972          |
+| 0.1125        | 4.88  | 19500 | 2.1002          |
+| 0.1053        | 5.0   | 20000 | 2.1007          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:0263a53aef24222e6d304c621dc0cfcf03d945f74d565bf30281e139f2697faf
 size 44381360

 version https://git-lfs.github.com/spec/v1
+oid sha256:48ca923b3abc2c31c973cea13dd25c268e5948b65c68237bc1646fdf0b13d604
 size 44381360

runs/Dec02_19-18-50_Software-AI/events.out.tfevents.1701532130.Software-AI.21828.6 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:339da275b12201378968a36a40fa43d6ca520246bdcb81abd70508d445296779
+size 22031

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:2239acbee13f0ab317d1615413df300209b09fd9bb294ec808ae45cf1b3495ac
 size 4155

 version https://git-lfs.github.com/spec/v1
+oid sha256:d2099ff6be5b9f11509b9535ad9eadf1dd52ab7d6f64dcd0b9e7ccc4f6b57c74
 size 4155