Speech-MASSIVE: A Multilingual Speech Dataset for SLU and Beyond Paper • 2408.03900 • Published Aug 7, 2024 • 10
Building and better understanding vision-language models: insights and future directions Paper • 2408.12637 • Published Aug 22, 2024 • 124
Vietnamese speech dataset Collection for speech-related tasks: speech-to-text & text-to-speech • 25 items • Updated Oct 6, 2024 • 13