File size: 1,516 Bytes
57ac55d
 
 
 
86766cd
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
6523b2e
86766cd
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
6523b2e
86766cd
 
 
 
 
 
 
 
 
 
 
 
 
 
1
2
3
4
5
6
7
8
9
10
11
12
13
14
15
16
17
18
19
20
21
22
23
24
25
26
27
28
29
30
31
32
33
34
35
36
37
38
39
40
41
42
43
44
45
46
47
48
49
50
51
52
53
54
55
56
57
---
license: mit
library_name: transformers
pipeline_tag: text-to-audio
---
# 🎵🎵🎵AudioLCM:Text-to-Audio Generation with Latent Consistency Models

We develop **AudioLCM** building on  LCM (latent consistency models) for text-to-audio generation.



## code

Our code is released here : [https://github.com/liuhuadai/AudioLCM)](https://github.com/liuhuadai/AudioLCM)

Please follow the instructions in the repository for installation, usage and experiments.



## Quickstart Guide

Download the **AudioLCM** model and generate audio from a text prompt:

```python
from pythonscripts.InferAPI import AudioLCMInfer


prompt="Constant rattling noise and sharp vibrations"
config_path="./audiolcm.yaml"
model_path="./audiolcm.ckpt"
vocoder_path="./model/vocoder"
audio_path = AudioLCMInfer(prompt, config_path=config_path, model_path=model_path, vocoder_path=vocoder_path)


```



Use the `AudioLCMBatchInfer` function to generate multiple audio samples for a batch of text prompts:

```python
from pythonscripts.InferAPI import AudioLCMBatchInfer


prompts=[
    "Constant rattling noise and sharp vibrations",
    "A rocket flies by followed by a loud explosion and fire crackling as a truck engine runs idle",
    "Humming and vibrating with a man and children speaking and laughing"
        ]
config_path="./audiolcm.yaml"
model_path="./audiolcm.ckpt"
vocoder_path="./model/vocoder"
audio_path = AudioLCMBatchInfer(prompts, config_path=config_path, model_path=model_path, vocoder_path=vocoder_path)
```