reazonspeech-espnet-v1

reazonspeech-espnet-v1 is an ESPnet model trained for Japanese automatic speech recognition (ASR).

  • This model was trained on 15,000 hours of ReazonSpeech corpus.
  • Make sure that your audio file is sampled at 16khz when using this model.

For more details, please visit the official project page.

Downloads last month
0
Inference Providers NEW
This model is not currently available via any of the supported third-party Inference Providers, and the model is not deployed on the HF Inference API.

Dataset used to train Dallyana/daya24mile_asr