medspeechrec / README.md
Kabir259's picture
Update README.md
8377c77 verified

A newer version of the Gradio SDK is available: 5.15.0

Upgrade
metadata
title: Medical Speech Recognition
emoji: ⚕️
colorFrom: blue
colorTo: pink
sdk: gradio
sdk_version: 5.6.0
app_file: app.py
pinned: true
license: mit
short_description: ASR for medical audio snippets! Speak away your symptoms :)

Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference

Medical Audio Consultation System

This is a Gradio application that processes audio input through ASR models and provides medical consultation using an LLM.

Models Used

  • ASR Models:
    • wav2vec2-base
    • wav2vec2-BERT
    • whisper-small

Features

  • Audio recording and processing
  • Multiple ASR model options
  • Audio filtering pipeline

Usage

  1. Select an ASR model from the dropdown
  2. Record your audio