nur-dev
/

ait-asr

Model card Files Files and versions Community

ait-asr / README.md

nur-dev's picture

Create README.md

e148b6e verified 28 days ago

|

history blame contribute delete

1.46 kB

	---
	license: cc
	datasets:
	- farabi-lab/kazakh-stt
	language:
	- kk
	base_model:
	- openai/whisper-small
	tags:
	- asr
	- kazakh
	---
	# AitASR
	AitASR is a fine-tuned version of OpenAI's Whisper Small model for Automatic Speech Recognition (ASR) in the Kazakh language. It was trained on the [`farabi-lab/kazakh-stt`](https://huggingface.co/datasets/farabi-lab/kazakh-stt) dataset to improve transcription quality for Kazakh audio.

	---

	## 🔧 Intended Use
	The model is designed for ASR tasks involving Kazakh-language audio.
	It is suitable for:
	- Transcription of Kazakh speech
	- Voice command recognition
	- Speech-driven applications in Kazakh

	---

	## ⚠️ Limitations
	- May perform poorly on:
	- Low-quality or noisy audio
	- Audio from domains significantly different from the training data
	- Not optimized for real-time use without further engineering

	## 5. Citation
	If you use this model, please cite it as follows:

	```bibtex
	@article{kadyrbek2023ksd,
	author = {Kadyrbek, N.; Mansurova, M.; Shomanov, A.; Makharova, G.},
	title = {The Development of a Kazakh Speech Recognition Model Using a Convolutional Neural Network with Fixed Character Level Filters},
	journal = {Big Data and Cognitive Computing},
	year = {2023},
	volume = {7},
	number = {3},
	pages = {132},
	doi = {https://doi.org/10.3390/bdcc7030132}
	}```

	---
	Commercial Use
	For commercial use, please contact the author directly to discuss licensing terms and permissions.