LumiOpen
/

Poro-34B

@@ -14,7 +14,7 @@ datasets:
 _**NOTE:** This is a **research checkpoint** of a model for which **training has not been completed.** It is being provided in its current state for research and testing purposes. **Care should be taken when using the outputs of the model.** Once pretraining has completed we intend to release additional instruction-tuned and chat-tuned varieties._
-Poro is a 34B parameter decoder-only transformer pretrained on Finnish, English and code.  It is being trained on 1 trillion tokens (300 billion as of this release). Poro is a fully open source model and is made available under the Apache 2.0 License.
 Poro was created in a collaboration between [SiloGen](https://www.silo.ai/silogen) from [Silo AI](https://www.silo.ai/), the [TurkuNLP group](https://turkunlp.org/) of the University of Turku, and [High Performance Language Technologies](https://hplt-project.org/) (HPLT). Training was conducted on the [LUMI supercomputer](https://www.lumi-supercomputer.eu/), using compute resources generously provided by [CSC](https://csc.fi/) - IT Center for Science, Finland.
@@ -45,6 +45,8 @@ Checkpoints are available as branches in the repository.  Checkpoints will be re
 * [100B](https://huggingface.co/LumiOpen/Poro-34B/tree/100B)
 * [200B](https://huggingface.co/LumiOpen/Poro-34B/tree/200B)
 * [300B](https://huggingface.co/LumiOpen/Poro-34B/tree/300B)
 The transformers library allows you to load a checkpoint from a branch as follows:

 _**NOTE:** This is a **research checkpoint** of a model for which **training has not been completed.** It is being provided in its current state for research and testing purposes. **Care should be taken when using the outputs of the model.** Once pretraining has completed we intend to release additional instruction-tuned and chat-tuned varieties._
+Poro is a 34B parameter decoder-only transformer pretrained on Finnish, English and code.  It is being trained on 1 trillion tokens (500 billion as of this release). Poro is a fully open source model and is made available under the Apache 2.0 License.
 Poro was created in a collaboration between [SiloGen](https://www.silo.ai/silogen) from [Silo AI](https://www.silo.ai/), the [TurkuNLP group](https://turkunlp.org/) of the University of Turku, and [High Performance Language Technologies](https://hplt-project.org/) (HPLT). Training was conducted on the [LUMI supercomputer](https://www.lumi-supercomputer.eu/), using compute resources generously provided by [CSC](https://csc.fi/) - IT Center for Science, Finland.
 * [100B](https://huggingface.co/LumiOpen/Poro-34B/tree/100B)
 * [200B](https://huggingface.co/LumiOpen/Poro-34B/tree/200B)
 * [300B](https://huggingface.co/LumiOpen/Poro-34B/tree/300B)
+* [400B](https://huggingface.co/LumiOpen/Poro-34B/tree/400B)
+* [500B](https://huggingface.co/LumiOpen/Poro-34B/tree/500B)
 The transformers library allows you to load a checkpoint from a branch as follows: