proxectonos
/

Carballo-bloom-1.3B

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

pablo-rf commited on Feb 15, 2024

Commit

c661549

·

verified ·

1 Parent(s): 312bef8

Update README.md

Files changed (1) hide show

README.md +4 -20

README.md CHANGED Viewed

@@ -2,11 +2,13 @@
 language:
 - gl
 licence:
-- mit
 tags:
 - galician
 - FLOR
 - bloom
 ---
 # FLOR-1.3B-GL
@@ -33,7 +35,6 @@ tags:
     - [Copyright](#copyright)
     - [License](#license)
     - [Funding](#funding)
-    - [Disclaimer](#disclaimer)
 </details>
@@ -91,24 +92,7 @@ The language adaptation technique used to train FLOR-1.3B-GL is based in the use
 ### Training data
-The training corpus is the same that was used to train [Ǎguila-7B](https://huggingface.co/projecte-aina/aguila-7b).
-It consists of 26B tokens of several corpora gathered from web crawlings and public domain data.
-| Dataset             | Language | Words (per-epoch) | Epochs       |
-|---------------------|----------|--------------------|--------------|
-| Wikipedia           | en       |           2169.97M |  1.428144485 |
-| C4_es               | es       |          53709.80M | 0.1049686196 |
-| Biomedical          | es       |            455.03M | 0.7140722425 |
-| Legal               | es       |            995.70M | 0.7140722425 |
-| Wikipedia           | es       |            693.60M |  1.428144485 |
-| Gutenberg           | es       |             53.18M | 0.7140722425 |
-| C4_ca               | ca       |           2826.00M |  2.142216727 |
-| Biomedical          | ca       |             11.80M |  1.428144485 |
-| RacoCatalà Noticias | ca       |             17.16M |  2.142216727 |
-| RacoCatalà Forums   | ca       |            333.73M |  2.142216727 |
-| CaWaC               | ca       |             57.79M |  2.142216727 |
-| Wikipedia           | ca       |            228.01M |  3.570361212 |
-| Vilaweb             | ca       |             50.34M |  2.142216727 |
 ### Training hyperparameters

 language:
 - gl
 licence:
+- MIT
 tags:
 - galician
 - FLOR
 - bloom
+license: mit
+pipeline_tag: text-generation
 ---
 # FLOR-1.3B-GL
     - [Copyright](#copyright)
     - [License](#license)
     - [Funding](#funding)
 </details>
 ### Training data
 ### Training hyperparameters