dsfsi
/

PuoBERTaJW300

masked langauge model

Inference Endpoints

Model card Files Files and versions Community

vukosi commited on Oct 16, 2023

Commit

b15142e

·

1 Parent(s): 00e38c4

Update README.md

Files changed (1) hide show

README.md +26 -7

README.md CHANGED Viewed

@@ -49,6 +49,19 @@ tokenizer = RobertaTokenizer.from_pretrained('dsfsi/PuoBERTaJW300')
 ## Downstream Performance
 ### MasakhaPOS
 Performance of models on the MasakhaPOS downstream task.
@@ -62,8 +75,10 @@ Performance of models on the MasakhaPOS downstream task.
 | AfroXLMR-large | 83.0 |
 | **Monolingual Models** |  |
 | NCHLT TSN RoBERTa | 82.3 |
-| PuoBERTa | 83.4 |
-| PuoBERTa+JW300 | **84.1** |
 ### MasakhaNER
@@ -77,16 +92,20 @@ Performance of models on the MasakhaNER downstream task.
 | AfroXLMR-large | 89.4 |
 | **Monolingual Models** |  |
 | NCHLT TSN RoBERTa | 74.2 |
-| PuoBERTa | 78.2 |
-| PuoBERTa+JW300 | **80.2** |
-## Dataset
-We used the PuoData dataset, a rich source of Setswana text, ensuring that our model is well-trained and culturally attuned.\\
 ## Citation Information
-Bibtex Refrence
 ```
 @inproceedings{marivate2023puoberta,

 ## Downstream Performance
+### Daily News Dikgang
+Learn more about the dataset in the [Dataset Folder](daily-news-dikgang)
+| **Model**                   | **5-fold Cross Validation F1**       | **Test F1**       |
+|-----------------------------|--------------------------------------|-------------------|
+| Logistic Regression + TFIDF | 60.1                                 | 56.2              |
+| NCHLT TSN RoBERTa           | 64.7                                 | 60.3              |
+| PuoBERTa                    | **63.8**                             | **62.9**          |
+| PuoBERTaJW300               | 66.2                                 | 65.4              |
+Downstream News Categorisation model 🤗 [https://huggingface.co/dsfsi/PuoBERTa-News](https://huggingface.co/dsfsi/PuoBERTa-News)
 ### MasakhaPOS
 Performance of models on the MasakhaPOS downstream task.
 | AfroXLMR-large | 83.0 |
 | **Monolingual Models** |  |
 | NCHLT TSN RoBERTa | 82.3 |
+| PuoBERTa | **83.4** |
+| PuoBERTa+JW300 | 84.1 |
+Downstream POS model 🤗 [https://huggingface.co/dsfsi/PuoBERTa-POS](https://huggingface.co/dsfsi/PuoBERTa-POS)
 ### MasakhaNER
 | AfroXLMR-large | 89.4 |
 | **Monolingual Models** |  |
 | NCHLT TSN RoBERTa | 74.2 |
+| PuoBERTa | **78.2** |
+| PuoBERTa+JW300 | 80.2 |
+Downstream NER model 🤗 [https://huggingface.co/dsfsi/PuoBERTa-NER](https://huggingface.co/dsfsi/PuoBERTa-NER)
+## Pre-Training Dataset
+We used the PuoData dataset, a rich source of Setswana text, ensuring that our model is well-trained and culturally attuned.
+[Github](https://github.com/dsfsi/PuoData), 🤗 [https://huggingface.co/datasets/dsfsi/PuoData](https://huggingface.co/datasets/dsfsi/PuoData)
 ## Citation Information
+Bibtex Reference
 ```
 @inproceedings{marivate2023puoberta,