--- language: - grn tags: - guarani - text-to-speech - speech - audio - vits license: mit --- # MMS-TTS Guarani Model This is a VITS-based text-to-speech model for the Guarani language, based on the MMS-TTS architecture. ## Model Description This model can generate speech from Guarani text input using the VITS architecture. ## Usage ```python from transformers import VitsModel, AutoTokenizer import torch model = VitsModel.from_pretrained("joselobenitezg/mms-grn-tts") tokenizer = AutoTokenizer.from_pretrained("joselobenitezg/mms-grn-tts") text = "some example text in the Guarani language" inputs = tokenizer(text, return_tensors="pt") with torch.no_grad(): output = model(**inputs).waveform # Save the output as a wav file import scipy scipy.io.wavfile.write("output.wav", rate=model.config.sampling_rate, data=output) ```