spark-tts
commited on
Commit
·
ee29f36
1
Parent(s):
1c17251
update readme for Triton
Browse files
README.md
CHANGED
@@ -60,6 +60,8 @@ Spark-TTS is an advanced text-to-speech system that uses the power of large lang
|
|
60 |
|
61 |
- **[2025-03-04]** Our paper on this project has been published! You can read it here: [Spark-TTS](https://arxiv.org/pdf/2503.01710).
|
62 |
|
|
|
|
|
63 |
|
64 |
## Install
|
65 |
**Clone and Install**
|
@@ -141,6 +143,22 @@ For additional CLI and Web UI methods, including alternative implementations and
|
|
141 |
- [CLI and UI by AcTePuKc](https://github.com/SparkAudio/Spark-TTS/issues/10)
|
142 |
|
143 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
144 |
## **Demos**
|
145 |
|
146 |
Here are some demos generated by Spark-TTS using zero-shot voice cloning. For more demos, visit our [demo page](https://sparkaudio.github.io/spark-tts/).
|
|
|
60 |
|
61 |
- **[2025-03-04]** Our paper on this project has been published! You can read it here: [Spark-TTS](https://arxiv.org/pdf/2503.01710).
|
62 |
|
63 |
+
- **[2025-03-12]** Nvidia Triton Inference Serving is now supported. See the Runtime section below for more details.
|
64 |
+
|
65 |
|
66 |
## Install
|
67 |
**Clone and Install**
|
|
|
143 |
- [CLI and UI by AcTePuKc](https://github.com/SparkAudio/Spark-TTS/issues/10)
|
144 |
|
145 |
|
146 |
+
## Runtime
|
147 |
+
|
148 |
+
**Nvidia Triton Inference Serving**
|
149 |
+
|
150 |
+
We now provide a reference for deploying Spark-TTS with Nvidia Triton and TensorRT-LLM. The table below presents benchmark results on a single L20 GPU, using 26 different prompt_audio/target_text pairs (totalling 169 seconds of audio):
|
151 |
+
|
152 |
+
| Model | Note | Concurrency | Avg Latency | RTF |
|
153 |
+
|-------|-----------|-----------------------|---------|--|
|
154 |
+
| Spark-TTS-0.5B | [Code Commit](https://github.com/SparkAudio/Spark-TTS/tree/4d769ff782a868524f29e0be851ca64f8b22ebf1/runtime/triton_trtllm) | 1 | 876.24 ms | 0.1362|
|
155 |
+
| Spark-TTS-0.5B | [Code Commit](https://github.com/SparkAudio/Spark-TTS/tree/4d769ff782a868524f29e0be851ca64f8b22ebf1/runtime/triton_trtllm) | 2 | 920.97 ms | 0.0737|
|
156 |
+
| Spark-TTS-0.5B | [Code Commit](https://github.com/SparkAudio/Spark-TTS/tree/4d769ff782a868524f29e0be851ca64f8b22ebf1/runtime/triton_trtllm) | 4 | 1611.51 ms | 0.0704|
|
157 |
+
|
158 |
+
|
159 |
+
Please see the detailed instructions in [runtime/triton_trtllm/README.md](runtime/triton_trtllm/README.md ) for more information.
|
160 |
+
|
161 |
+
|
162 |
## **Demos**
|
163 |
|
164 |
Here are some demos generated by Spark-TTS using zero-shot voice cloning. For more demos, visit our [demo page](https://sparkaudio.github.io/spark-tts/).
|