UD-Filipino
/

tl_roberta_tgl_transition

Token Classification

Model card Files Files and versions Community

ljvmiranda921 commited on 4 days ago

Commit

2166a5e

•

1 Parent(s): f276300

Update README.md

Files changed (1) hide show

README.md +17 -1

README.md CHANGED Viewed

@@ -57,8 +57,24 @@ model-index:
     - name: Sentences F-Score
       type: f_score
       value: 0.9783715013
 ---
-Parsers for UD-NewsCrawl
 | Feature | Description |
 | --- | --- |

     - name: Sentences F-Score
       type: f_score
       value: 0.9783715013
+datasets:
+- UD-Filipino/UD_Tagalog-NewsCrawl
+base_model:
+- jcblaise/roberta-tagalog-large
+pipeline_tag: token-classification
+library_name: spacy
 ---
+<img src="https://cdn-avatars.huggingface.co/v1/production/uploads/634e20a0c1ce28f1de920cc4/k7SJny1M3lDa5CH_T1bp3.png" width="130" height="130" align="right" />
+# UD Parser (Monolingual context-sensitive vectors + transition-based parser)
+This is the spaCy pipeline trained on [UD-NewsCrawl](https://huggingface.co/datasets/UD-Filipino/UD_Tagalog-NewsCrawl).
+It uses context-sensitive vectors from [jcbalise/roberta-tagalog-large](https://huggingface.co/jcblaise/roberta-tagalog-large).
+It is trained using a transition-based parser based on [Honnibal and Johnson (2015)](https://aclanthology.org/D15-1162/) and can perform dependency parsing, lemmatization, and morphological annotation.
+The trainable lemmatizer is based on [Muller et al. (2015)](https://aclanthology.org/D15-1272/).
+More information can be found [in this blog post](https://explosion.ai/blog/edit-tree-lemmatizer).
 | Feature | Description |
 | --- | --- |