File size: 719 Bytes
70fbf37
 
 
 
 
 
 
c153579
70fbf37
 
 
6ff77d3
1c64252
6ff77d3
 
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
---
license: apache-2.0
title: Speech 2 Text Kazakh
sdk: gradio
emoji: πŸš€
colorFrom: yellow
colorTo: yellow
sdk_version: 5.12.0
---


This is a Gradio web application that uses Meta's SeamlessM4T model to transcribe speech to text. The app supports both file uploads and microphone input.

## Features
- Real-time speech-to-text transcription
- Support for microphone input and file uploads
- GPU acceleration when available
- Simple and intuitive interface

## Usage
1. Open the app in your browser
2. Either upload an audio file or record using your microphone
3. Wait for the transcription to appear

## Technical Details
- Model: facebook/seamless-m4t-v2-large
- Framework: Gradio
- Backend: PyTorch, Transformers