spark-tts commited on
Commit
ee29f36
·
1 Parent(s): 1c17251

update readme for Triton

Browse files
Files changed (1) hide show
  1. README.md +18 -0
README.md CHANGED
@@ -60,6 +60,8 @@ Spark-TTS is an advanced text-to-speech system that uses the power of large lang
60
 
61
  - **[2025-03-04]** Our paper on this project has been published! You can read it here: [Spark-TTS](https://arxiv.org/pdf/2503.01710).
62
 
 
 
63
 
64
  ## Install
65
  **Clone and Install**
@@ -141,6 +143,22 @@ For additional CLI and Web UI methods, including alternative implementations and
141
  - [CLI and UI by AcTePuKc](https://github.com/SparkAudio/Spark-TTS/issues/10)
142
 
143
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
144
  ## **Demos**
145
 
146
  Here are some demos generated by Spark-TTS using zero-shot voice cloning. For more demos, visit our [demo page](https://sparkaudio.github.io/spark-tts/).
 
60
 
61
  - **[2025-03-04]** Our paper on this project has been published! You can read it here: [Spark-TTS](https://arxiv.org/pdf/2503.01710).
62
 
63
+ - **[2025-03-12]** Nvidia Triton Inference Serving is now supported. See the Runtime section below for more details.
64
+
65
 
66
  ## Install
67
  **Clone and Install**
 
143
  - [CLI and UI by AcTePuKc](https://github.com/SparkAudio/Spark-TTS/issues/10)
144
 
145
 
146
+ ## Runtime
147
+
148
+ **Nvidia Triton Inference Serving**
149
+
150
+ We now provide a reference for deploying Spark-TTS with Nvidia Triton and TensorRT-LLM. The table below presents benchmark results on a single L20 GPU, using 26 different prompt_audio/target_text pairs (totalling 169 seconds of audio):
151
+
152
+ | Model | Note | Concurrency | Avg Latency | RTF |
153
+ |-------|-----------|-----------------------|---------|--|
154
+ | Spark-TTS-0.5B | [Code Commit](https://github.com/SparkAudio/Spark-TTS/tree/4d769ff782a868524f29e0be851ca64f8b22ebf1/runtime/triton_trtllm) | 1 | 876.24 ms | 0.1362|
155
+ | Spark-TTS-0.5B | [Code Commit](https://github.com/SparkAudio/Spark-TTS/tree/4d769ff782a868524f29e0be851ca64f8b22ebf1/runtime/triton_trtllm) | 2 | 920.97 ms | 0.0737|
156
+ | Spark-TTS-0.5B | [Code Commit](https://github.com/SparkAudio/Spark-TTS/tree/4d769ff782a868524f29e0be851ca64f8b22ebf1/runtime/triton_trtllm) | 4 | 1611.51 ms | 0.0704|
157
+
158
+
159
+ Please see the detailed instructions in [runtime/triton_trtllm/README.md](runtime/triton_trtllm/README.md ) for more information.
160
+
161
+
162
  ## **Demos**
163
 
164
  Here are some demos generated by Spark-TTS using zero-shot voice cloning. For more demos, visit our [demo page](https://sparkaudio.github.io/spark-tts/).