DRAFT

#1
by dkounadis - opened

Soudscapes via AudioGen

Prompt

Generation

See level classhing on the iceberg of lighhouse

Loug scene of restaurant in medieval city

Nikolskoe bei der Wasser an der Havel in Berlin-Zehlendorf, nahe der Pfaueninsel. Beliebtes Berliner Ausflugsziel

Draft after this line

Following examples are the Harvard sentences synthesized via StyleTTS2 - using Mimic-3 or accelerated 4x speed Mimic-3 styles or Librispeech segments

Trial 3

New Tablo - tts_harvard.py =======================================================

Prompt - (Sweet dreams are made of this..!)

StyleTTS2 - (First 20 Harvard Sentences)

Mimic-3 English

StyleTTS - (Mimic-3 English)

From Above

Mimic-3 English 4x

StyleTTS2 - (Mimic-3 English 4x)

Human

StyleTTS2 - (Human)

Mimic-3 Foreign

StyleTTS2 - (Mimic-3 Foreign)

Mimix-3 Foreign 4x

StyleTTS2 - (Mimic-3 Foreign 4x)

dkounadis changed discussion title from StyleTTS2 via Mimic-3 styles to StyleTTS2 - Mimic-3 styles
dkounadis changed discussion title from StyleTTS2 - Mimic-3 styles to StyleTTS2 - via - Mimic-3 Prompts
dkounadis changed discussion title from StyleTTS2 - via - Mimic-3 Prompts to StyleTTS2 - via - Mimic-3 Prompt
dkounadis changed discussion title from StyleTTS2 - via - Mimic-3 Prompt to StyleTTS2 - via - Mimic-3
dkounadis changed discussion title from StyleTTS2 - via - Mimic-3 to StyleTTS2 via Mimic-3
dkounadis changed discussion title from StyleTTS2 via Mimic-3 to StyleTTS2 - via - Mimic3 Stylisation
dkounadis changed discussion title from StyleTTS2 - via - Mimic3 Stylisation to StyleTTS2 - via - Mimic-3 Stylisation
dkounadis changed discussion title from StyleTTS2 - via - Mimic-3 Stylisation to DRAFT - StyleTTS2 - via - Mimic-3 Stylisation (Libri)
dkounadis changed discussion title from DRAFT - StyleTTS2 - via - Mimic-3 Stylisation (Libri) to DRAFT - StyleTTS2 - via - Mimic-3 Stylisation (LibriSpeech)
dkounadis changed discussion title from DRAFT - StyleTTS2 - via - Mimic-3 Stylisation (LibriSpeech) to DRAFT - StyleTTS2 - via - Mimic-3 Stylisation
dkounadis changed discussion title from DRAFT - StyleTTS2 - via - Mimic-3 Stylisation to DRAFT

Please edit for MOS annotation

A

B

0

1

2

3

4

5

6

7

8

9

10

11

4 4
1 4
5 5
5 4
3 4
1 4

2 3
1 4
5 4
5 4
2 3
1 2

A B

4* 5
3* 5
5 5
5 4
2 5
1 4


*ignoring the breath-noise between sentences

dkounadis changed discussion status to closed

DEBUG Multi-sentence Foreign Langs

Language & Call

TxT

python tts.py --text d.txt --voice grc --out ancient_grc_file

δέομαι ὑμῶν, ὦ ἄνδρες Ἀθηναῖοι, ἐθελῆσαί μου μετ᾽ εὐνοίας ἀκοῦσαι λέγοντος, ὑπολογιζομένους τό τε μέγεθος τοῦ κινδύνου καὶ τὸ πλῆθος τῶν αἰτιῶν πρὸς ἂς ἀπολογήσασθαί με δεῖ, καὶ τὰς τέχνας καὶ τὰς κατασκευὰς τοῦ κατηγόρου καὶ τὴν ὠμότητα, ὃς ἐτόλμησε παρακελεύσασθαι
πρὸς ἄνδρας ὀμωμοκότας τῶν ἀντιδίκων ὁμοίως ἀμφοτέρων ἀκούσεσθαι τοῦ
κινδυνεύοντος φωνὴν μὴ ὑπομένειν.

Native English Voices TTS

Diffusion steps = 7 / artficially generated styles how mimic3 english 4x speed - python

Non native English TTS

# first start Flask api.py
python tts.py --text a.txt --voice "af_ZA_google-nwu_0184"
voice TTS

0

af_ZA_google-nwu_0184

1

af_ZA_google-nwu_1919

2

af_ZA_google-nwu_2418

3

af_ZA_google-nwu_6590

4

af_ZA_google-nwu_7130

5

af_ZA_google-nwu_7214

6

af_ZA_google-nwu_8148

7

af_ZA_google-nwu_8924

8

af_ZA_google-nwu_8963

9

bn_multi_00737

10

bn_multi_00779

11

bn_multi_01232

12

bn_multi_01701

13

bn_multi_03042

14

bn_multi_0834

15

bn_multi_1010

16

bn_multi_3108

17

bn_multi_3713

18

bn_multi_3958

19

bn_multi_4046

20

bn_multi_4811

21

bn_multi_5958

22

bn_multi_9169

23

bn_multi_rm

24

de_DE_m-ailabs_angela_merkel

25

de_DE_m-ailabs_eva_k

26

de_DE_m-ailabs_karlsson

27

de_DE_m-ailabs_ramona_deininger

28

de_DE_m-ailabs_rebecca_braunert_plunkett

29

de_DE_thorsten-emotion_amused

30

de_DE_thorsten-emotion_angry

31

de_DE_thorsten-emotion_disgusted

32

de_DE_thorsten-emotion_drunk

33

de_DE_thorsten-emotion_neutral

34

de_DE_thorsten-emotion_sleepy

35

de_DE_thorsten-emotion_surprised

36

de_DE_thorsten-emotion_whisper

37

de_DE_thorsten

38

el_GR_rapunzelina

39

es_ES_carlfm

40

es_ES_m-ailabs_karen_savage

41

es_ES_m-ailabs_tux

42

es_ES_m-ailabs_victor_villarraza

43

fa_haaniye

44

fi_FI_harri-tapani-ylilammi

45

fr_FR_m-ailabs_bernard

46

fr_FR_m-ailabs_ezwa

47

fr_FR_m-ailabs_gilles_g_le_blanc

48

fr_FR_m-ailabs_nadine_eckert_boulet

49

fr_FR_m-ailabs_zeckou

50

fr_FR_siwis

51

fr_FR_tom

52

gu_IN_cmu-indic_cmu_indic_guj_ad

53

gu_IN_cmu-indic_cmu_indic_guj_dp

54

gu_IN_cmu-indic_cmu_indic_guj_kt

55

ha_NE_openbible

56

hu_HU_diana-majlinger

57

it_IT_mls_10446

58

it_IT_mls_1157

59

it_IT_mls_12428

60

it_IT_mls_12804

61

it_IT_mls_1595

62

it_IT_mls_1725

63

it_IT_mls_1989

64

it_IT_mls_2019

65

it_IT_mls_2033

66

it_IT_mls_277

67

it_IT_mls_4649

68

it_IT_mls_4705

69

it_IT_mls_4971

70

it_IT_mls_4974

71

it_IT_mls_4975

72

it_IT_mls_4998

73

it_IT_mls_5010

74

it_IT_mls_5421

75

it_IT_mls_6001

76

it_IT_mls_6299

77

it_IT_mls_6348

78

it_IT_mls_643

79

it_IT_mls_644

80

it_IT_mls_659

81

it_IT_mls_6744

82

it_IT_mls_6807

83

it_IT_mls_7405

84

it_IT_mls_7440

85

it_IT_mls_7444

86

it_IT_mls_7936

87

it_IT_mls_8181

88

it_IT_mls_8207

89

it_IT_mls_8384

90

it_IT_mls_844

91

it_IT_mls_8461

92

it_IT_mls_8828

93

it_IT_mls_8842

94

it_IT_mls_9185

95

it_IT_mls_9772

96

it_IT_riccardo-fasol

97

jv_ID_google-gmu_00027

98

jv_ID_google-gmu_00264

99

jv_ID_google-gmu_00658

100

jv_ID_google-gmu_01392

101

jv_ID_google-gmu_01519

102

jv_ID_google-gmu_01932

103

jv_ID_google-gmu_02059

104

jv_ID_google-gmu_02326

105

jv_ID_google-gmu_02884

106

jv_ID_google-gmu_03187

107

jv_ID_google-gmu_03314

108

jv_ID_google-gmu_03424

109

jv_ID_google-gmu_03727

110

jv_ID_google-gmu_04175

111

jv_ID_google-gmu_04285

112

jv_ID_google-gmu_04588

113

jv_ID_google-gmu_04679

114

jv_ID_google-gmu_04715

115

jv_ID_google-gmu_04982

116

jv_ID_google-gmu_05219

117

jv_ID_google-gmu_05522

118

jv_ID_google-gmu_05540

119

jv_ID_google-gmu_05667

120

jv_ID_google-gmu_05970

121

jv_ID_google-gmu_06080

122

jv_ID_google-gmu_06207

123

jv_ID_google-gmu_06383

124

jv_ID_google-gmu_06510

125

jv_ID_google-gmu_06941

126

jv_ID_google-gmu_07335

127

jv_ID_google-gmu_07638

128

jv_ID_google-gmu_07765

129

jv_ID_google-gmu_07875

130

jv_ID_google-gmu_08002

131

jv_ID_google-gmu_08178

132

jv_ID_google-gmu_08305

133

jv_ID_google-gmu_08736

134

jv_ID_google-gmu_09039

135

jv_ID_google-gmu_09724

136

ko_KO_kss

137

ne_NP_ne-google_0258

138

ne_NP_ne-google_0283

139

ne_NP_ne-google_0546

140

ne_NP_ne-google_0649

141

ne_NP_ne-google_0883

142

ne_NP_ne-google_2027

143

ne_NP_ne-google_2099

144

ne_NP_ne-google_2139

145

ne_NP_ne-google_3154

146

ne_NP_ne-google_3614

147

ne_NP_ne-google_3960

148

ne_NP_ne-google_3997

149

ne_NP_ne-google_5687

150

ne_NP_ne-google_6329

151

ne_NP_ne-google_6587

152

ne_NP_ne-google_6834

153

ne_NP_ne-google_7957

154

ne_NP_ne-google_9407

155

nl_bart-de-leeuw

156

nl_flemishguy

157

nl_nathalie

158

nl_pmk

159

nl_rdh

160

pl_PL_m-ailabs_nina_brown

161

pl_PL_m-ailabs_piotr_nater

162

ru_RU_multi_hajdurova

163

ru_RU_multi_minaev

164

ru_RU_multi_nikolaev

165

sw_lanfrica

166

te_IN_cmu-indic_kpn

167

te_IN_cmu-indic_sk

168

te_IN_cmu-indic_ss

169

tn_ZA_google-nwu_0045

170

tn_ZA_google-nwu_0378

171

tn_ZA_google-nwu_0441

172

tn_ZA_google-nwu_1483

173

tn_ZA_google-nwu_1498

174

tn_ZA_google-nwu_1932

175

tn_ZA_google-nwu_2839

176

tn_ZA_google-nwu_3342

177

tn_ZA_google-nwu_3629

178

tn_ZA_google-nwu_4506

179

tn_ZA_google-nwu_4850

180

tn_ZA_google-nwu_5628

181

tn_ZA_google-nwu_6116

182

tn_ZA_google-nwu_6206

183

tn_ZA_google-nwu_6234

184

tn_ZA_google-nwu_6459

185

tn_ZA_google-nwu_7674

186

tn_ZA_google-nwu_7693

187

tn_ZA_google-nwu_7866

188

tn_ZA_google-nwu_7896

189

tn_ZA_google-nwu_8333

190

tn_ZA_google-nwu_8512

191

tn_ZA_google-nwu_8532

192

tn_ZA_google-nwu_8914

193

tn_ZA_google-nwu_9061

194

tn_ZA_google-nwu_9365

195

vi_VN_vais1000

196

yo_openbible

Native English Voices

# first start Flask api.py
python tts.py --text a.txt --voice "  en_US_cmu_arctic_aew"
voice TTS

0

en_US_cmu_arctic_aew

1

en_US_cmu_arctic_ahw

2

en_US_cmu_arctic_aup

3

en_US_cmu_arctic_awbrms

4

en_US_cmu_arctic_axb

5

en_US_cmu_arctic_bdl

6

en_US_cmu_arctic_clb

7

en_US_cmu_arctic_eey

8

en_US_cmu_arctic_fem

9

en_US_cmu_arctic_gka

10

en_US_cmu_arctic_jmk

11

en_US_cmu_arctic_ksp

12

en_US_cmu_arctic_ljm

13

en_US_cmu_arctic_lnh

14

en_US_cmu_arctic_rxr

15

en_US_cmu_arctic_slp

16

en_US_cmu_arctic_slt

17

en_US_hifi-tts_6097

18

en_US_hifi-tts_9017

19

en_US_hifi-tts_92

20

en_US_ljspeech

21

en_US_m-ailabs_elliot_miller

22

en_US_m-ailabs_judy_bieber

23

en_US_m-ailabs_mary_ann

24

en_US_vctk_p225

25

en_US_vctk_p226

26

en_US_vctk_p227

27

en_US_vctk_p228

28

en_US_vctk_p229

29

en_US_vctk_p230

30

en_US_vctk_p231

31

en_US_vctk_p232

32

en_US_vctk_p233

33

en_US_vctk_p234

34

en_US_vctk_p236

35

en_US_vctk_p237

36

en_US_vctk_p238

37

en_US_vctk_p239

38

en_US_vctk_p240

39

en_US_vctk_p241

40

en_US_vctk_p243

41

en_US_vctk_p244

42

en_US_vctk_p245

43

en_US_vctk_p246

44

en_US_vctk_p247

45

en_US_vctk_p248

46

en_US_vctk_p249

47

en_US_vctk_p250

48

en_US_vctk_p251

49

en_US_vctk_p252

50

en_US_vctk_p253

51

en_US_vctk_p254

52

en_US_vctk_p255

53

en_US_vctk_p256

54

en_US_vctk_p257

55

en_US_vctk_p258

56

en_US_vctk_p259

57

en_US_vctk_p260

58

en_US_vctk_p261

59

en_US_vctk_p262

60

en_US_vctk_p263

61

en_US_vctk_p264

62

en_US_vctk_p265

63

en_US_vctk_p266

64

en_US_vctk_p267

65

en_US_vctk_p268

66

en_US_vctk_p269

67

en_US_vctk_p270

68

en_US_vctk_p271

69

en_US_vctk_p272

70

en_US_vctk_p273

71

en_US_vctk_p274

72

en_US_vctk_p275

73

en_US_vctk_p276

74

en_US_vctk_p277

75

en_US_vctk_p278

76

en_US_vctk_p279

77

en_US_vctk_p280

78

en_US_vctk_p281

79

en_US_vctk_p282

80

en_US_vctk_p283

81

en_US_vctk_p284

82

en_US_vctk_p285

83

en_US_vctk_p286

84

en_US_vctk_p287

85

en_US_vctk_p288

86

en_US_vctk_p292

87

en_US_vctk_p293

88

en_US_vctk_p294

89

en_US_vctk_p295

90

en_US_vctk_p297

91

en_US_vctk_p298

92

en_US_vctk_p299

93

en_US_vctk_p300

94

en_US_vctk_p301

95

en_US_vctk_p302

96

en_US_vctk_p303

97

en_US_vctk_p304

98

en_US_vctk_p305

99

en_US_vctk_p306

100

en_US_vctk_p307

101

en_US_vctk_p308

102

en_US_vctk_p310

103

en_US_vctk_p311

104

en_US_vctk_p312

105

en_US_vctk_p313

106

en_US_vctk_p314

107

en_US_vctk_p316

108

en_US_vctk_p317

109

en_US_vctk_p318

110

en_US_vctk_p323

111

en_US_vctk_p326

112

en_US_vctk_p329

113

en_US_vctk_p336

114

en_US_vctk_p340

115

en_US_vctk_p341

116

en_US_vctk_p343

117

en_US_vctk_p345

118

en_US_vctk_p347

119

en_US_vctk_p351

120

en_US_vctk_p360

121

en_US_vctk_p361

122

en_US_vctk_p362

123

en_US_vctk_p363

124

en_US_vctk_p364

125

en_US_vctk_p374

126

en_US_vctk_p376

127

en_US_vctk_s5

Sign up or log in to comment