oddadmix/masrawy-english-arabic-translator-clauda-opus-v1

Browse files

Files changed (4) hide show

README.md +15 -20
model.safetensors +1 -1
runs/Dec16_19-14-59_n2b1f8fcs7/events.out.tfevents.1734376502.n2b1f8fcs7.364.0 +3 -0
training_args.bin +1 -1

README.md CHANGED Viewed

@@ -15,7 +15,7 @@ should probably proofread and complete it, then remove this comment. -->
 This model is a fine-tuned version of [Helsinki-NLP/opus-mt-en-ar](https://huggingface.co/Helsinki-NLP/opus-mt-en-ar) on an unknown dataset.
 It achieves the following results on the evaluation set:
-- Loss: 1.9350
 ## Model description
@@ -35,34 +35,29 @@ More information needed
 The following hyperparameters were used during training:
 - learning_rate: 5e-05
-- train_batch_size: 8
-- eval_batch_size: 8
 - seed: 42
 - gradient_accumulation_steps: 8
-- total_train_batch_size: 64
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 15
 ### Training results
 | Training Loss | Epoch | Step  | Validation Loss |
 |:-------------:|:-----:|:-----:|:---------------:|
-| 2.6915        | 1.0   | 1870  | 2.4310          |
-| 2.3824        | 2.0   | 3740  | 2.2902          |
-| 2.2275        | 3.0   | 5611  | 2.2189          |
-| 2.1461        | 4.0   | 7481  | 2.1871          |
-| 2.1294        | 5.0   | 9720  | 1.9911          |
-| 2.0879        | 6.0   | 11665 | 1.9805          |
-| 2.011         | 7.0   | 13609 | 1.9687          |
-| 1.9463        | 8.0   | 15554 | 1.9595          |
-| 1.8914        | 9.0   | 17499 | 1.9506          |
-| 1.8446        | 10.0  | 19443 | 1.9429          |
-| 1.8157        | 11.0  | 21388 | 1.9388          |
-| 1.7932        | 12.0  | 23333 | 1.9402          |
-| 1.7528        | 13.0  | 25277 | 1.9360          |
-| 1.741         | 14.0  | 27222 | 1.9378          |
-| 1.7257        | 15.0  | 29160 | 1.9350          |
 ### Framework versions

 This model is a fine-tuned version of [Helsinki-NLP/opus-mt-en-ar](https://huggingface.co/Helsinki-NLP/opus-mt-en-ar) on an unknown dataset.
 It achieves the following results on the evaluation set:
+- Loss: 2.1734
 ## Model description
 The following hyperparameters were used during training:
 - learning_rate: 5e-05
+- train_batch_size: 16
+- eval_batch_size: 16
 - seed: 42
 - gradient_accumulation_steps: 8
+- total_train_batch_size: 128
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 10
 ### Training results
 | Training Loss | Epoch | Step  | Validation Loss |
 |:-------------:|:-----:|:-----:|:---------------:|
+| 2.8321        | 1.0   | 1120  | 2.5553          |
+| 2.5351        | 2.0   | 2241  | 2.3913          |
+| 2.3732        | 3.0   | 3362  | 2.3113          |
+| 2.2808        | 4.0   | 4483  | 2.2630          |
+| 2.191         | 5.0   | 5604  | 2.2263          |
+| 2.1224        | 6.0   | 6725  | 2.2071          |
+| 2.0825        | 7.0   | 7846  | 2.1917          |
+| 2.0464        | 8.0   | 8967  | 2.1824          |
+| 2.027         | 9.0   | 10087 | 2.1775          |
+| 2.0087        | 9.99  | 11200 | 2.1734          |
 ### Framework versions

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:2fc5fe9aea24438e66b4bb5244c753c840a236857a5b117bc47e7385da9ba6cc
 size 305452744

 version https://git-lfs.github.com/spec/v1
+oid sha256:4e9b1e776a5bc56e740a927a5552f78ec1949e5249700ff70713cd9a926d068e
 size 305452744

runs/Dec16_19-14-59_n2b1f8fcs7/events.out.tfevents.1734376502.n2b1f8fcs7.364.0 ADDED Viewed

	@@ -0,0 +1,3 @@

+version https://git-lfs.github.com/spec/v1
+oid sha256:cc24a302d52a2b129136174fd529f573d219ae7e4535d30126dd4a19039b65a1
+size 11736

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:436d3a74630ee698945d82a903e285f90b2103f6b664340a9d7f1176e4d9f9c2
 size 4792

 version https://git-lfs.github.com/spec/v1
+oid sha256:116c331335b4dcc0060a391509f2a72ca9497d6a99252e89a20729fa7f5842a9
 size 4792