Model Description
OpenAI์ whisper-base ๋ชจ๋ธ์ ์๋ ์ธ๊ฐ์ง ๋ฐ์ดํฐ์ ์ผ๋ก ํ์ตํ ๋ชจ๋ธ์ ๋๋ค.
- ํ๊ตญ์ด ์์ฑ (https://www.aihub.or.kr/aihubdata/data/view.do?currMenu=115&topMenu=100&aihubDataSe=realm&dataSetSn=123)
- ์ฃผ์ ์์ฑ ๋ฐ์ดํฐ (https://www.aihub.or.kr/aihubdata/data/view.do?currMenu=115&topMenu=100&aihubDataSe=data&dataSetSn=71556)
- ์ฃผ์ ์์ญ๋ณ ํ์ ์์ฑ์ธ์ ๋ฐ์ดํฐ (https://www.aihub.or.kr/aihubdata/data/view.do?currMenu=115&topMenu=100&aihubDataSe=data&dataSetSn=464)
train_steps: 20000
warmup_steps: 2000
lr scheduler: linear warmup cosine decay
max learning rate: 1e-4
batch size: 256
max_grad_norm: 1.0
adamw_beta1: 0.9
adamw_beta2: 0.98
Evaluation
https://github.com/rtzr/Awesome-Korean-Speech-Recognition
์ ๋ ํฌ์งํ ๋ฆฌ์์ ์ฃผ์ ์์ญ๋ณ ํ์ ์์ฑ์ ์ ์ธํ ํ ์คํธ์ ๊ฒฐ๊ณผ์ ๋๋ค. ์๋ ํ ์ด๋ธ์์ whisper_base_komix๊ฐ ๋ณธ ๋ชจ๋ธ ์ฑ๋ฅ์ ๋๋ค.
+--------------------------+----------+-----------+---------------+------------+---------------+--------------+-------------+-------------+
| Model | cv_15_ko | fleurs_ko | kcall_testset | kconf_test | kcounsel_test | klec_testset | kspon_clean | kspon_other |
+--------------------------+----------+-----------+---------------+------------+---------------+--------------+-------------+-------------+
| whisper_base | 21.16 | 11.89 | 42.56 | 27.62 | 22.24 | 28.65 | 30.41 | 27.02 |
| whisper_base_kspon | 26.63 | 13.95 | 42.05 | 29.61 | 26.21 | 28.72 | 12.58 | 13.48 |
| whisper_base_komix | 15.42 | 7.16 | 20.86 | 14.24 | 12.64 | 13.44 | 12.26 | 12.12 |
| whisper_turbo | 5.38 | 3.95 | 5.89 | 9.77 | 4.21 | 9.27 | 16.49 | 13.54 |
+--------------------------+----------+-----------+---------------+------------+---------------+--------------+-------------+-------------+
- Downloads last month
- 9
Inference Providers
NEW
This model is not currently available via any of the supported Inference Providers.