Wenetspeech4TTS (WenetSpeech4TTS)

kxxia

updated a model 5 months ago

Wenetspeech4TTS/Amphion-NaturalSpeech2-Wenetspeech4TTS

Text-to-Speech • Updated Aug 31, 2024

dukkkk

updated a dataset 6 months ago

Wenetspeech4TTS/WenetSpeech4TTS

Updated Jul 25, 2024 • 553 • 69

dukkkk

authored 4 papers 6 months ago

Text-aware and Context-aware Expressive Audiobook Speech Synthesis

Paper • 2406.05672 • Published Jun 9, 2024

WenetSpeech4TTS: A 12,800-hour Mandarin TTS Corpus for Large Speech Generation Model Benchmark

Paper • 2406.05763 • Published Jun 9, 2024

HiGNN-TTS: Hierarchical Prosody Modeling with Graph Neural Networks for Expressive Long-form TTS

Paper • 2309.13907 • Published Sep 25, 2023

Automatic channel selection and spatial feature integration for multi-channel speech recognition across various array topologies

Paper • 2312.09746 • Published Dec 15, 2023

Bakerbunker

authored 5 papers 6 months ago

AIR-Bench: Benchmarking Large Audio-Language Models via Generative Comprehension

Paper • 2402.07729 • Published Feb 12, 2024

Qwen2-Audio Technical Report

Paper • 2407.10759 • Published Jul 15, 2024 • 56

Vec-Tok Speech: speech vectorization and tokenization for neural speech generation

Paper • 2310.07246 • Published Oct 11, 2023 • 1

FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter

Paper • 2406.08196 • Published Jun 12, 2024

SALT: Distinguishable Speaker Anonymization Through Latent Space Transformation

Paper • 2310.05051 • Published Oct 8, 2023

kxxia

updated a model 7 months ago

Wenetspeech4TTS/Amphion-Valle-Wenetspeech4TTS

Text-to-Speech • Updated Jun 20, 2024

dukkkk

updated a model 7 months ago

Wenetspeech4TTS/Audiodec-Valle-Wenetspeech4TTS

Updated Jun 20, 2024 • 8

WenetSpeech4TTS

AI & ML interests

Wenetspeech4TTS's activity

Wenetspeech4TTS/Amphion-NaturalSpeech2-Wenetspeech4TTS

Wenetspeech4TTS/WenetSpeech4TTS

Text-aware and Context-aware Expressive Audiobook Speech Synthesis

WenetSpeech4TTS: A 12,800-hour Mandarin TTS Corpus for Large Speech Generation Model Benchmark

HiGNN-TTS: Hierarchical Prosody Modeling with Graph Neural Networks for Expressive Long-form TTS

Automatic channel selection and spatial feature integration for multi-channel speech recognition across various array topologies

AIR-Bench: Benchmarking Large Audio-Language Models via Generative Comprehension

Qwen2-Audio Technical Report

Vec-Tok Speech: speech vectorization and tokenization for neural speech generation

FreeV: Free Lunch For Vocoders Through Pseudo Inversed Mel Filter

SALT: Distinguishable Speaker Anonymization Through Latent Space Transformation

Wenetspeech4TTS/Amphion-Valle-Wenetspeech4TTS

Wenetspeech4TTS/Audiodec-Valle-Wenetspeech4TTS

AI & ML interests

Team members 5

Wenetspeech4TTS's activity