--- license: apache-2.0 title: Speech 2 Text Kazakh sdk: gradio emoji: 🚀 colorFrom: yellow colorTo: yellow sdk_version: 5.12.0 --- This is a Gradio web application that uses Meta's SeamlessM4T model to transcribe speech to text. The app supports both file uploads and microphone input. ## Features - Real-time speech-to-text transcription - Support for microphone input and file uploads - GPU acceleration when available - Simple and intuitive interface ## Usage 1. Open the app in your browser 2. Either upload an audio file or record using your microphone 3. Wait for the transcription to appear ## Technical Details - Model: facebook/seamless-m4t-v2-large - Framework: Gradio - Backend: PyTorch, Transformers