asimokby/nllb-finetuned-ar-en

Browse files

Files changed (4) hide show

README.md +20 -15
generation_config.json +1 -1
model.safetensors +1 -1
training_args.bin +2 -2

README.md CHANGED Viewed

@@ -5,18 +5,18 @@ base_model: facebook/nllb-200-distilled-600M
 tags:
 - generated_from_trainer
 model-index:
-- name: qcri_test
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
-# qcri_test
-This model is a fine-tuned version of [facebook/nllb-200-distilled-600M](https://huggingface.co/facebook/nllb-200-distilled-600M) on the [TEDx dataset](https://huggingface.co/datasets/IWSLT/ted_talks_iwslt).
 It achieves the following results on the evaluation set:
-- Loss: 0.5696
 ## Model description
@@ -41,22 +41,27 @@ The following hyperparameters were used during training:
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
-- num_epochs: 10
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
-| 0.8109        | 1.0   | 537  | 0.8636          |
-| 0.7152        | 2.0   | 1074 | 0.7924          |
-| 0.6361        | 3.0   | 1611 | 0.7339          |
-| 0.6227        | 4.0   | 2148 | 0.6863          |
-| 0.599         | 5.0   | 2685 | 0.6502          |
-| 0.551         | 6.0   | 3222 | 0.6199          |
-| 0.5277        | 7.0   | 3759 | 0.5982          |
-| 0.4916        | 8.0   | 4296 | 0.5823          |
-| 0.4766        | 9.0   | 4833 | 0.5734          |
-| 0.4581        | 10.0  | 5370 | 0.5696          |
 ### Framework versions

 tags:
 - generated_from_trainer
 model-index:
+- name: nllb-finetuned-ar-en
   results: []
 ---
 <!-- This model card has been generated automatically according to the information the Trainer had access to. You
 should probably proofread and complete it, then remove this comment. -->
+# nllb-finetuned-ar-en
+This model is a fine-tuned version of [facebook/nllb-200-distilled-600M](https://huggingface.co/facebook/nllb-200-distilled-600M) on the None dataset.
 It achieves the following results on the evaluation set:
+- Loss: 0.6562
 ## Model description
 - seed: 42
 - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
 - lr_scheduler_type: linear
+- num_epochs: 15
 ### Training results
 | Training Loss | Epoch | Step | Validation Loss |
 |:-------------:|:-----:|:----:|:---------------:|
+| 1.5222        | 1.0   | 515  | 1.2649          |
+| 1.2832        | 2.0   | 1030 | 1.1357          |
+| 1.1181        | 3.0   | 1545 | 1.0412          |
+| 1.0074        | 4.0   | 2060 | 0.9647          |
+| 0.8996        | 5.0   | 2575 | 0.8987          |
+| 0.8206        | 6.0   | 3090 | 0.8478          |
+| 0.7525        | 7.0   | 3605 | 0.8047          |
+| 0.6886        | 8.0   | 4120 | 0.7683          |
+| 0.6443        | 9.0   | 4635 | 0.7336          |
+| 0.5983        | 10.0  | 5150 | 0.7076          |
+| 0.5675        | 11.0  | 5665 | 0.6910          |
+| 0.5415        | 12.0  | 6180 | 0.6744          |
+| 0.524         | 13.0  | 6695 | 0.6652          |
+| 0.5028        | 14.0  | 7210 | 0.6578          |
+| 0.5028        | 15.0  | 7725 | 0.6562          |
 ### Framework versions

generation_config.json CHANGED Viewed

@@ -3,7 +3,7 @@
   "bos_token_id": 0,
   "decoder_start_token_id": 2,
   "eos_token_id": 2,
-  "max_length": 256,
   "pad_token_id": 1,
   "transformers_version": "4.44.2"
 }

   "bos_token_id": 0,
   "decoder_start_token_id": 2,
   "eos_token_id": 2,
+  "max_length": 200,
   "pad_token_id": 1,
   "transformers_version": "4.44.2"
 }

model.safetensors CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:0674405d32bd1f16e25e42b098df31518de67dc7058d70bd47557d2b859da7b7
 size 2460354912

 version https://git-lfs.github.com/spec/v1
+oid sha256:766fbf0019a038f4e7f0882c41cce256453dcb2372ed9ce04cfffe4e1a01d803
 size 2460354912

training_args.bin CHANGED Viewed

@@ -1,3 +1,3 @@
 version https://git-lfs.github.com/spec/v1
-oid sha256:c49895d2871c87367958b55d4da854337692e6c70b604aeb2ac200d229e89b11
-size 5368

 version https://git-lfs.github.com/spec/v1
+oid sha256:91b2d08caaa41c6e5bebe90105cb29fcb454129c02617686ebfeee592ba72a06
+size 5432