Update README.md
Browse files
README.md
CHANGED
@@ -21,6 +21,8 @@ I undertook the following preprocessing steps:
|
|
21 |
|
22 |
The result is a corpus of ~390 million tokens.
|
23 |
|
|
|
|
|
24 |
### Contact
|
25 |
|
26 |
For contact, reach out to Phillip Ströbel [via mail](mailto:[email protected]) or [via Twitter](https://twitter.com/CLingophil).
|
|
|
21 |
|
22 |
The result is a corpus of ~390 million tokens.
|
23 |
|
24 |
+
The dataset used to train this model is available [HERE](https://huggingface.co/datasets/pstroe/cc100-latin).
|
25 |
+
|
26 |
### Contact
|
27 |
|
28 |
For contact, reach out to Phillip Ströbel [via mail](mailto:[email protected]) or [via Twitter](https://twitter.com/CLingophil).
|