|
--- |
|
license: apache-2.0 |
|
title: Speech 2 Text Kazakh |
|
sdk: gradio |
|
emoji: π |
|
colorFrom: yellow |
|
colorTo: yellow |
|
sdk_version: 5.12.0 |
|
--- |
|
|
|
|
|
This is a Gradio web application that uses Meta's SeamlessM4T model to transcribe speech to text. The app supports both file uploads and microphone input. |
|
|
|
## Features |
|
- Real-time speech-to-text transcription |
|
- Support for microphone input and file uploads |
|
- GPU acceleration when available |
|
- Simple and intuitive interface |
|
|
|
## Usage |
|
1. Open the app in your browser |
|
2. Either upload an audio file or record using your microphone |
|
3. Wait for the transcription to appear |
|
|
|
## Technical Details |
|
- Model: facebook/seamless-m4t-v2-large |
|
- Framework: Gradio |
|
- Backend: PyTorch, Transformers |