khalidsaifullaah
commited on
Commit
•
364393a
1
Parent(s):
dacff11
Update README.md
Browse files
README.md
CHANGED
@@ -9,7 +9,7 @@ Bengali GPT-2 demo. Part of the [Huggingface JAX/Flax event](https://discuss.hug
|
|
9 |
|
10 |
# Model Description
|
11 |
|
12 |
-
OpenAI GPT-2 model was proposed in [Language Models are Unsupervised Multitask Learners](https://paperswithcode.com/paper/language-models-are-unsupervised-multitask) paper .Original GPT2 model was a causal (unidirectional) transformer pretrained using language modeling on a very large corpus of ~40 GB of text data. This model has same configuration but has been pretrained on bengali corpus of mC4(multilingual C4) dataset. The code for training the model has all been open-sourced [here](https://huggingface.co/flax-community/gpt2-bengali/tree/main.
|
13 |
|
14 |
# Training Details
|
15 |
|
|
|
9 |
|
10 |
# Model Description
|
11 |
|
12 |
+
OpenAI GPT-2 model was proposed in [Language Models are Unsupervised Multitask Learners](https://paperswithcode.com/paper/language-models-are-unsupervised-multitask) paper .Original GPT2 model was a causal (unidirectional) transformer pretrained using language modeling on a very large corpus of ~40 GB of text data. This model has same configuration but has been pretrained on bengali corpus of mC4(multilingual C4) dataset. The code for training the model has all been open-sourced [here](https://huggingface.co/flax-community/gpt2-bengali/tree/main).
|
13 |
|
14 |
# Training Details
|
15 |
|