ljvmiranda921
commited on
Commit
•
2166a5e
1
Parent(s):
f276300
Update README.md
Browse files
README.md
CHANGED
@@ -57,8 +57,24 @@ model-index:
|
|
57 |
- name: Sentences F-Score
|
58 |
type: f_score
|
59 |
value: 0.9783715013
|
|
|
|
|
|
|
|
|
|
|
|
|
60 |
---
|
61 |
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
62 |
|
63 |
| Feature | Description |
|
64 |
| --- | --- |
|
|
|
57 |
- name: Sentences F-Score
|
58 |
type: f_score
|
59 |
value: 0.9783715013
|
60 |
+
datasets:
|
61 |
+
- UD-Filipino/UD_Tagalog-NewsCrawl
|
62 |
+
base_model:
|
63 |
+
- jcblaise/roberta-tagalog-large
|
64 |
+
pipeline_tag: token-classification
|
65 |
+
library_name: spacy
|
66 |
---
|
67 |
+
|
68 |
+
<img src="https://cdn-avatars.huggingface.co/v1/production/uploads/634e20a0c1ce28f1de920cc4/k7SJny1M3lDa5CH_T1bp3.png" width="130" height="130" align="right" />
|
69 |
+
|
70 |
+
# UD Parser (Monolingual context-sensitive vectors + transition-based parser)
|
71 |
+
|
72 |
+
This is the spaCy pipeline trained on [UD-NewsCrawl](https://huggingface.co/datasets/UD-Filipino/UD_Tagalog-NewsCrawl).
|
73 |
+
It uses context-sensitive vectors from [jcbalise/roberta-tagalog-large](https://huggingface.co/jcblaise/roberta-tagalog-large).
|
74 |
+
It is trained using a transition-based parser based on [Honnibal and Johnson (2015)](https://aclanthology.org/D15-1162/) and can perform dependency parsing, lemmatization, and morphological annotation.
|
75 |
+
The trainable lemmatizer is based on [Muller et al. (2015)](https://aclanthology.org/D15-1272/).
|
76 |
+
More information can be found [in this blog post](https://explosion.ai/blog/edit-tree-lemmatizer).
|
77 |
+
|
78 |
|
79 |
| Feature | Description |
|
80 |
| --- | --- |
|