File size: 719 Bytes
70fbf37 c153579 70fbf37 6ff77d3 1c64252 6ff77d3 |
1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 28 |
---
license: apache-2.0
title: Speech 2 Text Kazakh
sdk: gradio
emoji: π
colorFrom: yellow
colorTo: yellow
sdk_version: 5.12.0
---
This is a Gradio web application that uses Meta's SeamlessM4T model to transcribe speech to text. The app supports both file uploads and microphone input.
## Features
- Real-time speech-to-text transcription
- Support for microphone input and file uploads
- GPU acceleration when available
- Simple and intuitive interface
## Usage
1. Open the app in your browser
2. Either upload an audio file or record using your microphone
3. Wait for the transcription to appear
## Technical Details
- Model: facebook/seamless-m4t-v2-large
- Framework: Gradio
- Backend: PyTorch, Transformers |