|
--- |
|
license: cc |
|
datasets: |
|
- farabi-lab/kazakh-stt |
|
language: |
|
- kk |
|
base_model: |
|
- openai/whisper-small |
|
tags: |
|
- asr |
|
- kazakh |
|
--- |
|
# AitASR |
|
**AitASR** is a fine-tuned version of OpenAI's Whisper Small model for Automatic Speech Recognition (ASR) in the Kazakh language. It was trained on the [`farabi-lab/kazakh-stt`](https://huggingface.co/datasets/farabi-lab/kazakh-stt) dataset to improve transcription quality for Kazakh audio. |
|
|
|
--- |
|
|
|
## 🔧 Intended Use |
|
The model is designed for ASR tasks involving Kazakh-language audio. |
|
It is suitable for: |
|
- Transcription of Kazakh speech |
|
- Voice command recognition |
|
- Speech-driven applications in Kazakh |
|
|
|
--- |
|
|
|
## ⚠️ Limitations |
|
- May perform poorly on: |
|
- Low-quality or noisy audio |
|
- Audio from domains significantly different from the training data |
|
- Not optimized for real-time use without further engineering |
|
|
|
## 5. Citation |
|
If you use this model, please cite it as follows: |
|
|
|
```bibtex |
|
@article{kadyrbek2023ksd, |
|
author = {Kadyrbek, N.; Mansurova, M.; Shomanov, A.; Makharova, G.}, |
|
title = {The Development of a Kazakh Speech Recognition Model Using a Convolutional Neural Network with Fixed Character Level Filters}, |
|
journal = {Big Data and Cognitive Computing}, |
|
year = {2023}, |
|
volume = {7}, |
|
number = {3}, |
|
pages = {132}, |
|
doi = {https://doi.org/10.3390/bdcc7030132} |
|
}``` |
|
|
|
--- |
|
Commercial Use |
|
For commercial use, please contact the author directly to discuss licensing terms and permissions. |