asimokby commited on
Commit
a1a9e37
1 Parent(s): d58a284

asimokby/nllb-finetuned-ar-en

Browse files
Files changed (4) hide show
  1. README.md +20 -15
  2. generation_config.json +1 -1
  3. model.safetensors +1 -1
  4. training_args.bin +2 -2
README.md CHANGED
@@ -5,18 +5,18 @@ base_model: facebook/nllb-200-distilled-600M
5
  tags:
6
  - generated_from_trainer
7
  model-index:
8
- - name: qcri_test
9
  results: []
10
  ---
11
 
12
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
13
  should probably proofread and complete it, then remove this comment. -->
14
 
15
- # qcri_test
16
 
17
- This model is a fine-tuned version of [facebook/nllb-200-distilled-600M](https://huggingface.co/facebook/nllb-200-distilled-600M) on the [TEDx dataset](https://huggingface.co/datasets/IWSLT/ted_talks_iwslt).
18
  It achieves the following results on the evaluation set:
19
- - Loss: 0.5696
20
 
21
  ## Model description
22
 
@@ -41,22 +41,27 @@ The following hyperparameters were used during training:
41
  - seed: 42
42
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
43
  - lr_scheduler_type: linear
44
- - num_epochs: 10
45
 
46
  ### Training results
47
 
48
  | Training Loss | Epoch | Step | Validation Loss |
49
  |:-------------:|:-----:|:----:|:---------------:|
50
- | 0.8109 | 1.0 | 537 | 0.8636 |
51
- | 0.7152 | 2.0 | 1074 | 0.7924 |
52
- | 0.6361 | 3.0 | 1611 | 0.7339 |
53
- | 0.6227 | 4.0 | 2148 | 0.6863 |
54
- | 0.599 | 5.0 | 2685 | 0.6502 |
55
- | 0.551 | 6.0 | 3222 | 0.6199 |
56
- | 0.5277 | 7.0 | 3759 | 0.5982 |
57
- | 0.4916 | 8.0 | 4296 | 0.5823 |
58
- | 0.4766 | 9.0 | 4833 | 0.5734 |
59
- | 0.4581 | 10.0 | 5370 | 0.5696 |
 
 
 
 
 
60
 
61
 
62
  ### Framework versions
 
5
  tags:
6
  - generated_from_trainer
7
  model-index:
8
+ - name: nllb-finetuned-ar-en
9
  results: []
10
  ---
11
 
12
  <!-- This model card has been generated automatically according to the information the Trainer had access to. You
13
  should probably proofread and complete it, then remove this comment. -->
14
 
15
+ # nllb-finetuned-ar-en
16
 
17
+ This model is a fine-tuned version of [facebook/nllb-200-distilled-600M](https://huggingface.co/facebook/nllb-200-distilled-600M) on the None dataset.
18
  It achieves the following results on the evaluation set:
19
+ - Loss: 0.6562
20
 
21
  ## Model description
22
 
 
41
  - seed: 42
42
  - optimizer: Adam with betas=(0.9,0.999) and epsilon=1e-08
43
  - lr_scheduler_type: linear
44
+ - num_epochs: 15
45
 
46
  ### Training results
47
 
48
  | Training Loss | Epoch | Step | Validation Loss |
49
  |:-------------:|:-----:|:----:|:---------------:|
50
+ | 1.5222 | 1.0 | 515 | 1.2649 |
51
+ | 1.2832 | 2.0 | 1030 | 1.1357 |
52
+ | 1.1181 | 3.0 | 1545 | 1.0412 |
53
+ | 1.0074 | 4.0 | 2060 | 0.9647 |
54
+ | 0.8996 | 5.0 | 2575 | 0.8987 |
55
+ | 0.8206 | 6.0 | 3090 | 0.8478 |
56
+ | 0.7525 | 7.0 | 3605 | 0.8047 |
57
+ | 0.6886 | 8.0 | 4120 | 0.7683 |
58
+ | 0.6443 | 9.0 | 4635 | 0.7336 |
59
+ | 0.5983 | 10.0 | 5150 | 0.7076 |
60
+ | 0.5675 | 11.0 | 5665 | 0.6910 |
61
+ | 0.5415 | 12.0 | 6180 | 0.6744 |
62
+ | 0.524 | 13.0 | 6695 | 0.6652 |
63
+ | 0.5028 | 14.0 | 7210 | 0.6578 |
64
+ | 0.5028 | 15.0 | 7725 | 0.6562 |
65
 
66
 
67
  ### Framework versions
generation_config.json CHANGED
@@ -3,7 +3,7 @@
3
  "bos_token_id": 0,
4
  "decoder_start_token_id": 2,
5
  "eos_token_id": 2,
6
- "max_length": 256,
7
  "pad_token_id": 1,
8
  "transformers_version": "4.44.2"
9
  }
 
3
  "bos_token_id": 0,
4
  "decoder_start_token_id": 2,
5
  "eos_token_id": 2,
6
+ "max_length": 200,
7
  "pad_token_id": 1,
8
  "transformers_version": "4.44.2"
9
  }
model.safetensors CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:0674405d32bd1f16e25e42b098df31518de67dc7058d70bd47557d2b859da7b7
3
  size 2460354912
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:766fbf0019a038f4e7f0882c41cce256453dcb2372ed9ce04cfffe4e1a01d803
3
  size 2460354912
training_args.bin CHANGED
@@ -1,3 +1,3 @@
1
  version https://git-lfs.github.com/spec/v1
2
- oid sha256:c49895d2871c87367958b55d4da854337692e6c70b604aeb2ac200d229e89b11
3
- size 5368
 
1
  version https://git-lfs.github.com/spec/v1
2
+ oid sha256:91b2d08caaa41c6e5bebe90105cb29fcb454129c02617686ebfeee592ba72a06
3
+ size 5432