Text-to-Speech
English
Kokoro-82M / VOICES.md
hexgrad's picture
Upload 2 files
c03b18b verified
|
raw
history blame
3.88 kB

Voices

For each voice, the given grades are intended to be estimates of the quality and quantity of its associated training data, both of which impact overall inference quality.

Subjectively, voices will sound better or worse to different people.

Target Quality

  • How high quality is the reference voice? This grade may be impacted by audio quality, artifacts, compression, & sample rate.
  • How well do the text labels match the audio? Text/audio misalignment (e.g. from hallucinations) will lower this grade.

Training Duration

  • How much audio was seen during training? Smaller durations result in a lower overall grade.

American ๐Ÿ‡บ๐Ÿ‡ธ

American G2P: misaki[en] with en-us espeak-ng fallback

Name Traits Target Quality Training Duration Overall Grade
af_alloy ๐Ÿšบ B MM minutes C
af_aoede ๐Ÿšบ B H hours C+
af_bella ๐Ÿšบ๐Ÿ”ฅ A HH hours A-
af_jessica ๐Ÿšบ C MM minutes D
af_kore ๐Ÿšบ B H hours C+
af_nicole ๐Ÿšบ๐ŸŽง B HH hours B-
af_nova ๐Ÿšบ B MM minutes C
af_river ๐Ÿšบ C MM minutes D
af_sarah ๐Ÿšบ B H hours C+
af_sky ๐Ÿšบ B M minutes C-
am_adam ๐Ÿšน D H hours F+
am_echo ๐Ÿšน C MM minutes D
am_eric ๐Ÿšน C MM minutes D
am_fenrir ๐Ÿšน B H hours C+
am_liam ๐Ÿšน C MM minutes D
am_michael ๐Ÿšน B H hours C+
am_onyx ๐Ÿšน C MM minutes D
am_puck ๐Ÿšน B H hours C+

British ๐Ÿ‡ฌ๐Ÿ‡ง

British G2P: misaki[en] with en-gb espeak-ng fallback

Name Traits Target Quality Training Duration Overall Grade
bf_alice ๐Ÿšบ C MM minutes D
bf_emma ๐Ÿšบ B HH hours B-
bf_isabella ๐Ÿšบ B MM minutes C
bf_lily ๐Ÿšบ C MM minutes D
bm_daniel ๐Ÿšน C MM minutes D
bm_fable ๐Ÿšน B MM minutes C
bm_george ๐Ÿšน B MM minutes C
bm_lewis ๐Ÿšน C H hours D+

French ๐Ÿ‡ซ๐Ÿ‡ท

French G2P: espeak-ng fr-fr

Name Traits Target Quality Training Duration Overall Grade
ff_siwis ๐Ÿšบ B <11 hours B-

This table lists all French training data seen by Kokoro.

Hindi ๐Ÿ‡ฎ๐Ÿ‡ณ

Hindi G2P: espeak-ng hi

Name Traits Target Quality Training Duration Overall Grade
hf_alpha ๐Ÿšบ B MM minutes C
hf_beta ๐Ÿšบ B MM minutes C
hm_omega ๐Ÿšน B MM minutes C
hm_psi ๐Ÿšน B MM minutes C

This table lists all Hindi training data seen by Kokoro, which totals about 6 hours.

Japanese ๐Ÿ‡ฏ๐Ÿ‡ต

Japanese G2P: misaki[ja]

Name Traits Target Quality Training Duration Overall Grade
jf_alpha ๐Ÿšบ B H hours C+

Mandarin Chinese ๐Ÿ‡จ๐Ÿ‡ณ

Mandarin Chinese G2P: misaki[zh]

Name Traits Target Quality Training Duration Overall Grade
zf_xiaobei ๐Ÿšบ C MM minutes D
zf_xiaoni ๐Ÿšบ C MM minutes D
zf_xiaoxiao ๐Ÿšบ C MM minutes D
zf_xiaoyi ๐Ÿšบ C MM minutes D
zm_yunjian ๐Ÿšน C MM minutes D
zm_yunxi ๐Ÿšน C MM minutes D
zm_yunxia ๐Ÿšน C MM minutes D
zm_yunyang ๐Ÿšน C MM minutes D