license: mit | |
datasets: | |
- wikipedia | |
- oscar | |
language: | |
- ja | |
- ko | |
tags: | |
- kenlm | |
- perplexity | |
- n-gram | |
- kneser-ney | |
- bigscience | |
# KenLM models | |
This repo is a copy of [edugp/kenlm](https://huggingface.co/edugp/kenlm) but for the Japanese and Korean languages. | |
The Wikipedia models were trained using the `20231106` dump. |