license: mit
language:
- ja
- ko
pipeline_tag: translation
Japanese to Korean translator
This model used datasets from 'The Open AI Dataset Project (AI-Hub, South Korea)'.
All data information can be accessed through 'AI-Hub (aihub.or.kr)'.
(In order for a corporation, organization, or individual located outside of Korea to use AI data, etc., a separate agreement is required with the performing organization and the Korea National Information Society agency(NIA). In order to export AI data, etc. outside the country, a separate agreement is required with the performing organization and the NIA. Link)
์ด ๋ชจ๋ธ์ ๊ณผํ๊ธฐ์ ์ ๋ณดํต์ ๋ถ์ ์ฌ์์ผ๋ก ํ๊ตญ์ง๋ฅ์ ๋ณด์ฌํ์งํฅ์์ ์ง์์ ๋ฐ์ ๊ตฌ์ถ๋ ๋ฐ์ดํฐ์
์ ํ์ฉํ์ฌ ์ํ๋ ์ฐ๊ตฌ์
๋๋ค.
๋ณธ ๋ชจ๋ธ์ ํ์ฉ๋ ๋ฐ์ดํฐ๋ AI ํ๋ธ(aihub.or.kr)์์ ๋ค์ด๋ก๋ ๋ฐ์ผ์ค ์ ์์ต๋๋ค.
(๊ตญ์ธ์ ์์ฌํ๋ ๋ฒ์ธ, ๋จ์ฒด ๋๋ ๊ฐ์ธ์ด AI๋ฐ์ดํฐ ๋ฑ์ ์ด์ฉํ๊ธฐ ์ํด์๋ ์ํ๊ธฐ๊ด ๋ฑ ๋ฐ ํ๊ตญ์ง๋ฅ์ ๋ณด์ฌํ์งํฅ์๊ณผ ๋ณ๋๋ก ํฉ์๊ฐ ํ์ํฉ๋๋ค.
๋ณธ AI๋ฐ์ดํฐ ๋ฑ์ ๊ตญ์ธ ๋ฐ์ถ์ ์ํด์๋ ์ํ๊ธฐ๊ด ๋ฑ ๋ฐ ํ๊ตญ์ง๋ฅ์ ๋ณด์ฌํ์งํฅ์๊ณผ ๋ณ๋๋ก ํฉ์๊ฐ ํ์ํฉ๋๋ค. [์ถ์ฒ])
Dataset list
The dataset used to train the model is merged following sub-datasets:
- ์ผ์์ํ ๋ฐ ๊ตฌ์ด์ฒด ํ-์ค, ํ-์ผ ๋ฒ์ญ ๋ณ๋ ฌ ๋ง๋ญ์น ๋ฐ์ดํฐ [Link]
- ํ๊ตญ์ด-๋ค๊ตญ์ด(์์ด ์ ์ธ) ๋ฒ์ญ ๋ง๋ญ์น(๊ธฐ์ ๊ณผํ) [Link]
- ํ๊ตญ์ด-๋ค๊ตญ์ด ๋ฒ์ญ ๋ง๋ญ์น(๊ธฐ์ด๊ณผํ) [Link]
- ํ๊ตญ์ด-๋ค๊ตญ์ด ๋ฒ์ญ ๋ง๋ญ์น (์ธ๋ฌธํ) [Link]
- ํ๊ตญ์ด-์ผ๋ณธ์ด ๋ฒ์ญ ๋ง๋ญ์น [Link]
To reproduce the the merged dataset, you can use the code in below link:
https://github.com/sappho192/aihub-translation-dataset