proxectonos
/

Carballo-bloom-1.3B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

jramompichel commited on Feb 28, 2024

Commit

76126c8

·

verified ·

1 Parent(s): c806241

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -144,7 +144,7 @@ It was trained using HuggingFace Transformers and Pytorch, using the [Causal Mod
 ### Language adaptation and training
-The language adaptation technique used to train FLOR-1.3B-GL is based in the used to train FLOR-1.3B, which is explanied by their authors in this [Medium Post](https://medium.com/@mpamies247/flor-6-3b-a-chinchilla-compliant-model-for-catalan-spanish-and-english-7cdb389a9aac). In summary, we proceeded as follows:
 1) We trained our own BPE tokenizer for galician and replaced the original FLOR-1.3B tokenizer and vocabulary with it.
 2) The embeddings corresponding to tokens that are present in both the original and the target vocabulary (matching tokens) were used for initialization.
 3) The embeddings from tokens not present in FLOR-1.3-GL's original vocabulary were initialized as the average of all embeddings.

 ### Language adaptation and training
+The language adaptation technique used to train FLOR-1.3B-GL is based in the used to train FLOR-1.3B, which is explained by their authors in this [Medium Post](https://medium.com/@mpamies247/flor-6-3b-a-chinchilla-compliant-model-for-catalan-spanish-and-english-7cdb389a9aac). In summary, we proceeded as follows:
 1) We trained our own BPE tokenizer for galician and replaced the original FLOR-1.3B tokenizer and vocabulary with it.
 2) The embeddings corresponding to tokens that are present in both the original and the target vocabulary (matching tokens) were used for initialization.
 3) The embeddings from tokens not present in FLOR-1.3-GL's original vocabulary were initialized as the average of all embeddings.