metadata
license: cc
datasets:
- farabi-lab/kazakh-stt
language:
- kk
base_model:
- openai/whisper-small
tags:
- asr
- kazakh
AitASR
AitASR is a fine-tuned version of OpenAI's Whisper Small model for Automatic Speech Recognition (ASR) in the Kazakh language. It was trained on the farabi-lab/kazakh-stt
dataset to improve transcription quality for Kazakh audio.
🔧 Intended Use
The model is designed for ASR tasks involving Kazakh-language audio.
It is suitable for:
- Transcription of Kazakh speech
- Voice command recognition
- Speech-driven applications in Kazakh
⚠️ Limitations
- May perform poorly on:
- Low-quality or noisy audio
- Audio from domains significantly different from the training data
- Not optimized for real-time use without further engineering
5. Citation
If you use this model, please cite it as follows:
@article{kadyrbek2023ksd,
author = {Kadyrbek, N.; Mansurova, M.; Shomanov, A.; Makharova, G.},
title = {The Development of a Kazakh Speech Recognition Model Using a Convolutional Neural Network with Fixed Character Level Filters},
journal = {Big Data and Cognitive Computing},
year = {2023},
volume = {7},
number = {3},
pages = {132},
doi = {https://doi.org/10.3390/bdcc7030132}
}```
---
Commercial Use
For commercial use, please contact the author directly to discuss licensing terms and permissions.