speech-to-speech-translation

Sleeping

hewliyang

use whisper-large-v3 & mms-tts-zlm

0323180 12 months ago

711 Bytes

	---
	title: Speech To Speech Translation
	emoji: 🏆
	colorFrom: pink
	colorTo: indigo
	sdk: gradio
	sdk_version: 3.36.1
	app_file: app.py
	pinned: false
	---

	Part of the HuggingFace Audio Processing course.

	This is a Gradio wrapper around a (X -> Malay) speech2speech pipeline, where X is any language supported by
	`openai/whisper-base`.

	The TTS model used is `facebook/mms-tts-zlm`, a pretrained checkpoint for speech in Malay which is part of their Massively Multilingual Speech project. The underlying architecture is based on VITS, which generates waveforms directly and does not need a seperate vocoder.

	Check out the configuration reference at https://huggingface.co/docs/hub/spaces-config-reference