Spaces:
Running
on
Zero
Running
on
Zero
Update app.py
#18
by
ylacombe
- opened
app.py
CHANGED
@@ -13,7 +13,7 @@ device = "cuda:0" if torch.cuda.is_available() else "cpu"
|
|
13 |
|
14 |
|
15 |
repo_id = "parler-tts/parler-tts-mini-v1"
|
16 |
-
repo_id_large = "
|
17 |
|
18 |
model = ParlerTTSForConditionalGeneration.from_pretrained(repo_id).to(device)
|
19 |
model_large = ParlerTTSForConditionalGeneration.from_pretrained(repo_id_large).to(device)
|
@@ -154,7 +154,7 @@ with gr.Blocks(css=css) as block:
|
|
154 |
are trained using 45k hours of narrated English audiobooks. It generates high-quality speech
|
155 |
with features that can be controlled using a simple text prompt (e.g. gender, background noise, speaking rate, pitch and reverberation).</p>
|
156 |
|
157 |
-
<p>By default, Parler-TTS generates π² random voice. To ensure π― <b> speaker consistency </b> across generations, these checkpoints were also trained on 34 speakers, characterized by name (e.g. Jon, Lea, Gary, Jenna, Mike, Laura)
|
158 |
|
159 |
<p>To take advantage of this, simply adapt your text description to specify which speaker to use: `Jon's voice is monotone...`</p>
|
160 |
"""
|
|
|
13 |
|
14 |
|
15 |
repo_id = "parler-tts/parler-tts-mini-v1"
|
16 |
+
repo_id_large = "parler-tts/parler-tts-large-v1"
|
17 |
|
18 |
model = ParlerTTSForConditionalGeneration.from_pretrained(repo_id).to(device)
|
19 |
model_large = ParlerTTSForConditionalGeneration.from_pretrained(repo_id_large).to(device)
|
|
|
154 |
are trained using 45k hours of narrated English audiobooks. It generates high-quality speech
|
155 |
with features that can be controlled using a simple text prompt (e.g. gender, background noise, speaking rate, pitch and reverberation).</p>
|
156 |
|
157 |
+
<p>By default, Parler-TTS generates π² random voice. To ensure π― <b> speaker consistency </b> across generations, these checkpoints were also trained on 34 speakers, characterized by name (e.g. Jon, Lea, Gary, Jenna, Mike, Laura). Learn more about this <a href="https://github.com/huggingface/parler-tts/blob/main/INFERENCE.md#speaker-consistency"> here </a>.</p>
|
158 |
|
159 |
<p>To take advantage of this, simply adapt your text description to specify which speaker to use: `Jon's voice is monotone...`</p>
|
160 |
"""
|