kzs2t / README.md
bektim's picture
Update README.md
c153579 verified
---
license: apache-2.0
title: Speech 2 Text Kazakh
sdk: gradio
emoji: πŸš€
colorFrom: yellow
colorTo: yellow
sdk_version: 5.12.0
---
This is a Gradio web application that uses Meta's SeamlessM4T model to transcribe speech to text. The app supports both file uploads and microphone input.
## Features
- Real-time speech-to-text transcription
- Support for microphone input and file uploads
- GPU acceleration when available
- Simple and intuitive interface
## Usage
1. Open the app in your browser
2. Either upload an audio file or record using your microphone
3. Wait for the transcription to appear
## Technical Details
- Model: facebook/seamless-m4t-v2-large
- Framework: Gradio
- Backend: PyTorch, Transformers