ait-asr / README.md
nur-dev's picture
Create README.md
e148b6e verified
metadata
license: cc
datasets:
  - farabi-lab/kazakh-stt
language:
  - kk
base_model:
  - openai/whisper-small
tags:
  - asr
  - kazakh

AitASR

AitASR is a fine-tuned version of OpenAI's Whisper Small model for Automatic Speech Recognition (ASR) in the Kazakh language. It was trained on the farabi-lab/kazakh-stt dataset to improve transcription quality for Kazakh audio.


🔧 Intended Use

The model is designed for ASR tasks involving Kazakh-language audio.
It is suitable for:

  • Transcription of Kazakh speech
  • Voice command recognition
  • Speech-driven applications in Kazakh

⚠️ Limitations

  • May perform poorly on:
    • Low-quality or noisy audio
    • Audio from domains significantly different from the training data
  • Not optimized for real-time use without further engineering

5. Citation

If you use this model, please cite it as follows:

@article{kadyrbek2023ksd,
  author = {Kadyrbek, N.; Mansurova, M.; Shomanov, A.; Makharova, G.},
  title = {The Development of a Kazakh Speech Recognition Model Using a Convolutional Neural Network with Fixed Character Level Filters},
  journal = {Big Data and Cognitive Computing},
  year = {2023},
  volume = {7},
  number = {3},
  pages = {132},
  doi = {https://doi.org/10.3390/bdcc7030132}
}```

---
Commercial Use
For commercial use, please contact the author directly to discuss licensing terms and permissions.