--- datasets: - mozilla-foundation/common_voice_17_0 - TalTechNLP/VoxLingua107 language: - af - ar - ak - en - ee - fr - ha - ig - li - ln - mg - xh - yo - zu - pt - wo - ts - to - sw - sn - ny base_model: - utter-project/mHuBERT-147 --- ## AfriHuBERT: A self-supervised speech representation model for African languages ### Model description This is multilingual self-supervised speech model based on mHuBERT-147. ### Pretraining data - Dataset: AfriHuBERT was trained on data sources from 8 major sources which include: BibleTTS ### Language Coverage AfriHuBERT covers 44 languages in total