SauerkrautTTS-Preview-0.1

SauerkrautTTS-Preview-0.1

SauerkrautTTS-Preview-0.1 is a fine-tuned Text-to-Speech (TTS) model based on the powerful canopylabs/orpheus-3b-0.1-ft.

This preview model introduces four distinct German-speaking voicesโ€”Lena, Anna, Max, and Tom-crafted using original audio recordings captured with a Rhode Studio microphone and Mimic Studio, alongside carefully curated synthetic data. The high quality and careful curation of our overall dataset enable the model to produce clear and natural speech outputs, even from this initial release.

Model Details

Speaker Data Breakdown

Speaker Original Data (Hours) Synthetic Data (Hours) Total (Hours)
Tom 1h 3.8h 4.8h
Anna 3h 1.25h 4.25h
Max - 4.78h 4.78h
Lena - 4.87h 4.87h

The synthetic audio data enriches the model, resulting in versatile and expressive voice capabilities.

Example Output and Comparison

Usage

For seamless inference and practical examples, check out our detailed instructions and ready-to-use scripts available on:

To achieve optimal results, we recommend using a lower temperature for clear and stable outputs. Higher temperatures will enhance dynamism and expressiveness but might introduce instability.

Example inference settings:

temperature = 0.5  # Adjust lower for clearer output, higher for creativity

Future Plans

This model represents our first exploratory step into advanced German-language TTS. Expect significant improvements in upcoming versions, including:

  • Enhanced voice clarity
  • Expanded speaker diversity
  • Greater stability across temperature ranges

Stay tuned for future releases and updates!

License

SauerkrautTTS-Preview-0.1 is openly available under the CC BY-NC 4.0 License, encouraging reuse, remixing, and improvements by the community.

Acknowledgments

We thank Unsloth for their invaluable training script, which we utilized in a lightly modified form for training this model.

Downloads last month
87
Safetensors
Model size
3.3B params
Tensor type
BF16
ยท
Inference Providers NEW
This model isn't deployed by any Inference Provider. ๐Ÿ™‹ Ask for provider support

Model tree for VAGOsolutions/SauerkrautTTS-Preview-0.1

Finetuned
(9)
this model
Quantizations
1 model