microsoft/Phi-4-multimodal-instruct Automatic Speech Recognition โข Updated 12 days ago โข 620k โข 1.31k
view post Post 13135 We did it. Kokoro TTS (v1.0) can now run 100% locally in your browser w/ WebGPU acceleration. Real-time text-to-speech without a server. โก๏ธGenerate 10 seconds of speech in ~1 second for $0.What will you build? ๐ฅ webml-community/kokoro-webgpuThe most difficult part was getting the model running in the first place, but the next steps are simple:โ๏ธ Implement sentence splitting, allowing for streamed responses๐ Multilingual support (only phonemization left)Who wants to help? See translation 11 replies ยท ๐ฅ 32 32 ๐ 14 14 ๐ 8 8 ๐ค 5 5 ๐ 2 2 + Reply
Running 306 306 Kokoro Text-to-Speech (WebGPU) ๐ฃ High-quality speech synthesis powered by Kokoro TTS