Adriane Boyd
Add vi_udv25_vietnamesevtb_trf-0.0.1
7506945
metadata
tags:
  - spacy
  - token-classification
language:
  - vi
license: cc-by-sa-4.0
model-index:
  - name: vi_udv25_vietnamesevtb_trf
    results:
      - task:
          name: TAG
          type: token-classification
        metrics:
          - name: TAG (XPOS) Accuracy
            type: accuracy
            value: 0.8805048216
      - task:
          name: POS
          type: token-classification
        metrics:
          - name: POS (UPOS) Accuracy
            type: accuracy
            value: 0.9018631331
      - task:
          name: MORPH
          type: token-classification
        metrics:
          - name: Morph (UFeats) Accuracy
            type: accuracy
            value: 0.9695345305
      - task:
          name: LEMMA
          type: token-classification
        metrics:
          - name: Lemma Accuracy
            type: accuracy
            value: 0.8934519139
      - task:
          name: UNLABELED_DEPENDENCIES
          type: token-classification
        metrics:
          - name: Unlabeled Attachment Score (UAS)
            type: f_score
            value: 0.6807696182
      - task:
          name: LABELED_DEPENDENCIES
          type: token-classification
        metrics:
          - name: Labeled Attachment Score (LAS)
            type: f_score
            value: 0.6063552526
      - task:
          name: SENTS
          type: token-classification
        metrics:
          - name: Sentences F-Score
            type: f_score
            value: 0.943275972

UD v2.5 benchmarking pipeline for UD_Vietnamese-VTB

Feature Description
Name vi_udv25_vietnamesevtb_trf
Version 0.0.1
spaCy >=3.2.1,<3.3.0
Default Pipeline experimental_char_ner_tokenizer, transformer, tagger, morphologizer, parser, experimental_edit_tree_lemmatizer
Components experimental_char_ner_tokenizer, transformer, senter, tagger, morphologizer, parser, experimental_edit_tree_lemmatizer
Vectors 0 keys, 0 unique vectors (0 dimensions)
Sources Universal Dependencies v2.5 (Zeman, Daniel; et al.)
License CC BY-SA 4.0
Author Explosion

Label Scheme

View label scheme (81 labels for 6 components)
Component Labels
experimental_char_ner_tokenizer TOKEN
senter I, S
tagger !, ", ,, -, ., ..., :, ;, ?, @, A, C, CC, E, I, L, LBKT, M, N, NP, Nb, Nc, Np, Nu, Ny, P, R, RBKT, T, V, VP, X, Y, Z
morphologizer POS=NOUN, POS=ADP, POS=X|Polarity=Neg, POS=VERB, POS=ADJ, POS=PUNCT, POS=X, POS=SCONJ, NumType=Card|POS=NUM, POS=DET, POS=CCONJ, POS=PROPN, POS=AUX, POS=PART, POS=INTJ
parser ROOT, advcl, advmod, amod, appos, aux, aux:pass, case, cc, ccomp, compound, conj, cop, csubj, dep, det, discourse, iobj, list, mark, nmod, nsubj, nummod, obj, obl, parataxis, punct, xcomp
experimental_edit_tree_lemmatizer 0

Accuracy

Type Score
TOKEN_F 87.90
TOKEN_P 86.84
TOKEN_R 89.00
TOKEN_ACC 98.42
SENTS_F 94.33
SENTS_P 96.23
SENTS_R 92.50
TAG_ACC 88.05
POS_ACC 90.19
MORPH_ACC 96.95
DEP_UAS 68.08
DEP_LAS 60.64
LEMMA_ACC 89.35