Spaces:
Running
on
Zero
Running
on
Zero
add roadmap
Browse files
README.md
CHANGED
@@ -4,6 +4,36 @@
|
|
4 |
|
5 |
For `SenseVoice`, visit [SenseVoice repo](https://github.com/FunAudioLLM/SenseVoice) and [SenseVoice space](https://www.modelscope.cn/studios/iic/SenseVoice).
|
6 |
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
|
7 |
## Install
|
8 |
|
9 |
**Clone and install**
|
|
|
4 |
|
5 |
For `SenseVoice`, visit [SenseVoice repo](https://github.com/FunAudioLLM/SenseVoice) and [SenseVoice space](https://www.modelscope.cn/studios/iic/SenseVoice).
|
6 |
|
7 |
+
## Roadmap
|
8 |
+
|
9 |
+
- [x] 2024/07
|
10 |
+
|
11 |
+
- [x] Flow matching training support
|
12 |
+
- [x] WeTextProcessing support when ttsfrd is not avaliable
|
13 |
+
- [x] Fastapi server and client
|
14 |
+
|
15 |
+
- [ ] 2024/08
|
16 |
+
|
17 |
+
- [ ] Repetition Aware Sampling(RAS) inference for llm stability
|
18 |
+
- [ ] Streaming inference mode support, including kv cache and sdpa for rtf optimization
|
19 |
+
|
20 |
+
- [ ] 2024/09
|
21 |
+
|
22 |
+
- [ ] 50hz llm model which supports 10 language
|
23 |
+
|
24 |
+
- [ ] 2024/10
|
25 |
+
|
26 |
+
- [ ] 50hz llama based llm model which supports lora finetune
|
27 |
+
|
28 |
+
- [ ] TBD
|
29 |
+
|
30 |
+
- [ ] Support more instruction mode
|
31 |
+
- [ ] Voice conversion
|
32 |
+
- [ ] Music generation
|
33 |
+
- [ ] Training script sample based on Mandarin
|
34 |
+
- [ ] CosyVoice-500M trained with more multi-lingual data
|
35 |
+
- [ ] More...
|
36 |
+
|
37 |
## Install
|
38 |
|
39 |
**Clone and install**
|