somosnlp-hackathon-2022
/

t5-small-finetuned-spanish-to-quechua

text2text-generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

JCarlos commited on Apr 3, 2022

Commit

9a22478

•

1 Parent(s): 0c54fb9

Update README.md

Files changed (1) hide show

README.md +14 -7

README.md CHANGED Viewed

@@ -2,16 +2,24 @@
 language:
 - es
 - qu
 tags:
 - quechua
 - translation
 - spanish
 license: apache-2.0
 metrics:
 - bleu
 - sacrebleu
 widget:
-- text: "Buenos días"
 ---
 # t5-small-finetuned-spanish-to-quechua
@@ -20,19 +28,18 @@ This model is a finetuned version of the [t5-small](https://huggingface.co/t5-sm
 ## Model description
 ## Intended uses & limitations
 ### How to use
 You can import this model as follows:
 ```python
->>> from transformers import AutoModelForSeq2SeqLM
->>> from transformers import AutoTokenizer
 >>> model_name = 'hackathon-pln-es/t5-small-finetuned-spanish-to-quechua'
 >>> model = AutoModelForSeq2SeqLM.from_pretrained(model_name)
 >>> tokenizer = AutoTokenizer.from_pretrained(model_name)
@@ -49,11 +56,11 @@ To translate you can do:
 ### Limitations and bias
 ## Training data
 ## Evaluation results

 language:
 - es
 - qu
 tags:
 - quechua
 - translation
 - spanish
 license: apache-2.0
 metrics:
 - bleu
 - sacrebleu
 widget:
+- text: "Dios ama a los hombres"
+- text: "A pesar de todo, soy feliz"
+- text: "¿Qué harán allí?"
+- text: "Debes aprender a respetar"
 ---
 # t5-small-finetuned-spanish-to-quechua
 ## Model description
+t5-small-finetuned-spanish-to-quechua has trained for 46 epochs with 102 747 sentences, the validation was performed with 12 844 sentences and 12 843 sentences were used for the test.
 ## Intended uses & limitations
+A large part of the dataset has been extracted from biblical texts, which makes the model perform better with certain types of sentences.
 ### How to use
 You can import this model as follows:
 ```python
+>>> from transformers import AutoModelForSeq2SeqLM, AutoTokenizer
 >>> model_name = 'hackathon-pln-es/t5-small-finetuned-spanish-to-quechua'
 >>> model = AutoModelForSeq2SeqLM.from_pretrained(model_name)
 >>> tokenizer = AutoTokenizer.from_pretrained(model_name)
 ### Limitations and bias
+Actually this model only can translate to Quechua of Ayacucho.
 ## Training data
+For train this model we use [Spanish to Quechua dataset](https://huggingface.co/datasets/hackathon-pln-es/spanish-to-quechua)
 ## Evaluation results