2 30 260

david lee

dwidlee

fritzprix

AI & ML interests

None yet

Recent Activity

liked a model 2 days ago

SparkAudio/Spark-TTS-0.5B

liked a model 4 days ago

calcuis/wan-gguf

liked a model 6 days ago

onnx-community/Qwen2.5-0.5B-Instruct

View all activity

Organizations

dwidlee's activity

liked a model 2 days ago

SparkAudio/Spark-TTS-0.5B

Text-to-Speech • Updated 6 days ago • 6.81k • 351

liked a model 4 days ago

calcuis/wan-gguf

Text-to-Video • Updated 12 days ago • 61k • 43

liked 2 models 6 days ago

onnx-community/Qwen2.5-0.5B-Instruct

Text Generation • Updated Oct 8, 2024 • 690 • 6

HuggingFaceTB/SmolLM2-1.7B-Instruct

Text Generation • Updated 7 days ago • 420k • • 573

liked a model 7 days ago

Comfy-Org/Wan_2.1_ComfyUI_repackaged

Updated 6 days ago • 278

updated a collection 9 days ago

Disruptive

Collection

2 items • Updated 9 days ago

liked a model 12 days ago

facebook/mms-tts-fra

Text-to-Speech • Updated Sep 1, 2023 • 3.37k • • 8

liked a Space 13 days ago

180

Kokoro Text-to-Speech

🗣

High-quality speech synthesis powered by Kokoro TTS

reacted to Xenova's post with 😎 13 days ago

Post

6436

Introducing Kokoro.js, a new JavaScript library for running Kokoro TTS, an 82 million parameter text-to-speech model, 100% locally in the browser w/ WASM. Powered by 🤗 Transformers.js. WebGPU support coming soon!
👉 npm i kokoro-js 👈

Try it out yourself: webml-community/kokoro-web
Link to models/samples: onnx-community/Kokoro-82M-ONNX

You can get started in just a few lines of code!

import { KokoroTTS } from "kokoro-js";

const tts = await KokoroTTS.from_pretrained(
  "onnx-community/Kokoro-82M-ONNX",
  { dtype: "q8" }, // fp32, fp16, q8, q4, q4f16
);

const text = "Life is like a box of chocolates. You never know what you're gonna get.";
const audio = await tts.generate(text,
  { voice: "af_sky" }, // See `tts.list_voices()`
);
audio.save("audio.wav");

Huge kudos to the Kokoro TTS community, especially taylorchu for the ONNX exports and Hexgrad for the amazing project! None of this would be possible without you all! 🤗

The model is also extremely resilient to quantization. The smallest variant is only 86 MB in size (down from the original 326 MB), with no noticeable difference in audio quality! 🤯