|
--- |
|
license: apache-2.0 |
|
base_model: Helsinki-NLP/opus-mt-ar-en |
|
language: |
|
- ar |
|
- en |
|
pipeline_tag: translation |
|
widget: |
|
- text: 'salam ,labas ?' |
|
- text: ' kanbghik bzaf' |
|
--- |
|
--- |
|
license: apache-2.0 |
|
base_model: Helsinki-NLP/opus-mt-ar-en |
|
|
|
|
|
|
|
# This model's role is to translate Daraija with Latin words or Arabizi into English. It was trained on 60,000 rows of translation examples. |
|
|
|
This model is a fine-tuned version of [Helsinki-NLP/opus-mt-ar-en](https://huggingface.co/Helsinki-NLP/opus-mt-ar-en) on anDarija Open Dataset (DODa), an ambitious open-source project dedicated to the Moroccan dialect. With about 150,000 entries, DODa is arguably the largest open-source collaborative project for Darija <=> English translation built for Natural Language Processing purposes. |
|
|
|
|
|
|
|
### Training hyperparameters |
|
|
|
The following hyperparameters were used during training: |
|
- GPU : A100 |
|
- train_batch_size: 32 |
|
- eval_batch_size: 32 |
|
- num_epochs: 5 |
|
- mixed_precision_training: True FP16 enabled |