liuhuadai
/

AudioLCM

Inference Endpoints

Model card Files Files and versions Community

liuhuadai commited on Jun 5, 2024

Commit

86766cd

·

verified ·

1 Parent(s): 4480dee

Update README.md

Files changed (1) hide show

README.md +56 -1

README.md CHANGED Viewed

@@ -2,4 +2,59 @@
 license: mit
 library_name: transformers
 pipeline_tag: text-to-audio
----

 license: mit
 library_name: transformers
 pipeline_tag: text-to-audio
+---
+# 🎵🎵🎵AudioLCM：Text-to-Audio Generation with Latent Consistency Models
+We develop **AudioLCM** building on  LCM (latent consistency models) for text-to-audio generation.
+## code
+Our code is released here : [https://github.com/liuhuadai/AudioLCM)](https://github.com/liuhuadai/AudioLCM)
+Please follow the instructions in the repository for installation, usage and experiments.
+## Quickstart Guide
+Download the **AudioLCM** model and generate audio from a text prompt:
+```python
+import IPython
+import soundfile as sf
+from infer import AudioLCMInfer
+prompt="Constant rattling noise and sharp vibrations"
+config_path="./audiolcm.yaml"
+model_path="./audiolcm.ckpt"
+vocoder_path="./model/vocoder"
+audio_path = AudioLCMInfer(prompt, config_path=config_path, model_path=model_path, vocoder_path=vocoder_path)
+```
+Use the `AudioLCMBatchInfer` function to generate multiple audio samples for a batch of text prompts:
+```python
+import IPython
+import soundfile as sf
+from infer import AudioLCMBatchInfer
+prompts=[
+    "Constant rattling noise and sharp vibrations",
+    "A rocket flies by followed by a loud explosion and fire crackling as a truck engine runs idle",
+    "Humming and vibrating with a man and children speaking and laughing"
+        ]
+config_path="./audiolcm.yaml"
+model_path="./audiolcm.ckpt"
+vocoder_path="./model/vocoder"
+audio_path = AudioLCMBatchInfer(prompts, config_path=config_path, model_path=model_path, vocoder_path=vocoder_path)
+```