EleutherAI
/

polyglot-ko-5.8b

Text Generation

text-generation-inference

Inference Endpoints

Model card Files Files and versions Community

hyunwoongko commited on Oct 8, 2022

Commit

f7a396b

·

1 Parent(s): eae0ac9

Update README.md

Files changed (1) hide show

README.md +1 -1

README.md CHANGED Viewed

@@ -58,7 +58,7 @@ Furthermore, in order to avoid the model memorizing and generating personally id
 * `<|tell|>` : phone number
 ## Training procedure
-Polyglot-Ko-5.8B was trained for 219 billion tokens over 105,000 steps on 256 A100 GPUs with the [GPT-NeoX framework](https://github.com/EleutherAI/gpt-neox). It was trained as an autoregressive language model, using cross-entropy loss to maximize the likelihood of predicting the next token.
 ## How to use

 * `<|tell|>` : phone number
 ## Training procedure
+Polyglot-Ko-5.8B was trained for 172 billion tokens over 320,000 steps on 256 A100 GPUs with the [GPT-NeoX framework](https://github.com/EleutherAI/gpt-neox). It was trained as an autoregressive language model, using cross-entropy loss to maximize the likelihood of predicting the next token.
 ## How to use