Any efficient way to do diarization while keeping this model accuracy at transcribing multi-language audio?

#26

by raresmose - opened Oct 12, 2024

Oct 12, 2024

I am looking to detect speakers efficiently but haven't found a way.

I've tried the most popular solutions like AssemblyAI out there but they only work for English and I need a multi-language solution.

Do you know any?

psimm

Nov 27, 2024

https://github.com/Vaibhavs10/insanely-fast-whisper works with this model and combines it with pyannote for diarization.
Diarization quality isn't great though.

Upload images, audio, and videos by dragging in the text input, pasting, or clicking here.

Tap or paste here to upload images

· Sign up or log in to comment