Model Specification

  • Model: RoBERTa Tagalog Base (Jan Christian Blaise Cruz)
  • Training Data:
    • Naija / Nigerian Pigdin corpora (Top 2 Language)
  • Training Details:
    • Base configurations

Evaluation

  • Evaluation Dataset: Universal Dependencies Tagalog Ugnayan (Testing Set)
  • Tested in a zero-shot cross-lingual scenario on a Universal Dependencies Tagalog Ugnayan testing dataset (with 45.94% Accuracy)

POS Tags

  • ADJ – ADP – ADV – CCONJ – DET – INTJ – NOUN – NUM – PART – PRON – PROPN – PUNCT – SCONJ – VERB
Downloads last month
111
Safetensors
Model size
109M params
Tensor type
F32
Β·
Inference Providers NEW
This model is not currently available via any of the supported Inference Providers.

Dataset used to train iceman2434/roberta-tagalog-base-ft-udpos213-pcm

Collection including iceman2434/roberta-tagalog-base-ft-udpos213-pcm