Update README.md
Browse files
README.md
CHANGED
@@ -17,14 +17,13 @@ widget:
|
|
17 |
<!-- Provide a quick summary of what the model is/does. -->
|
18 |
|
19 |
This is the first model for Named Entity Recognition (NER) in the Moroccan dialect (Darija). The model was trained on the very first NER dataset in Darija, DarNERcorp, that can be found on Mendeley https://data.mendeley.com/datasets/286sss4k9v/4.
|
20 |
-
The model was trained by finetuning AraBERT on a mixed dataset composed of DarNERcorp and ANERcorp.
|
21 |
|
22 |
### Model Description
|
23 |
|
24 |
<!-- Provide a longer summary of what this model is. -->
|
25 |
|
26 |
- **Developed by:** Hanane Nour Moussa
|
27 |
-
- **Model type:**
|
28 |
- **Language(s) (NLP):** Arabic, Darija
|
29 |
|
30 |
### Model Sources
|
@@ -34,10 +33,6 @@ The model was trained by finetuning AraBERT on a mixed dataset composed of DarNE
|
|
34 |
- **Repository:** https://github.com/HananeNourMoussa/darija-ner (private for the time being)
|
35 |
- **Paper (dataset):** Hanane Nour Moussa, Asmaa Mourhir, DarNERcorp: An annotated named entity recognition dataset in the Moroccan dialect, Data in Brief
|
36 |
|
37 |
-
#### Training Hyperparameters
|
38 |
-
|
39 |
-
The model was trained for 5 epochs using a learning rate of 3e-5 for BERT and 1e-4 for the classifier. BERT hidden dropout was set to 0.3.
|
40 |
-
|
41 |
#### Metrics
|
42 |
|
43 |
<!-- These are the evaluation metrics being used, ideally with a description of why. -->
|
|
|
17 |
<!-- Provide a quick summary of what the model is/does. -->
|
18 |
|
19 |
This is the first model for Named Entity Recognition (NER) in the Moroccan dialect (Darija). The model was trained on the very first NER dataset in Darija, DarNERcorp, that can be found on Mendeley https://data.mendeley.com/datasets/286sss4k9v/4.
|
|
|
20 |
|
21 |
### Model Description
|
22 |
|
23 |
<!-- Provide a longer summary of what this model is. -->
|
24 |
|
25 |
- **Developed by:** Hanane Nour Moussa
|
26 |
+
- **Model type:** Token classification
|
27 |
- **Language(s) (NLP):** Arabic, Darija
|
28 |
|
29 |
### Model Sources
|
|
|
33 |
- **Repository:** https://github.com/HananeNourMoussa/darija-ner (private for the time being)
|
34 |
- **Paper (dataset):** Hanane Nour Moussa, Asmaa Mourhir, DarNERcorp: An annotated named entity recognition dataset in the Moroccan dialect, Data in Brief
|
35 |
|
|
|
|
|
|
|
|
|
36 |
#### Metrics
|
37 |
|
38 |
<!-- These are the evaluation metrics being used, ideally with a description of why. -->
|