Aya Model: An Instruction Finetuned Open-Access Multilingual Language Model Paper โข 2402.07827 โข Published Feb 12, 2024 โข 47
Naijaweb datasets ๐ณ๐ฌ Collection A recreation of the fineweb collection for Nigerians โข 3 items โข Updated Oct 24, 2024 โข 5
OpenCulture Collection A multilingual dataset of public domain books and newspapers. โข 27 items โข Updated Nov 6, 2024 โข 122
Distil-Whisper: Robust Knowledge Distillation via Large-Scale Pseudo Labelling Paper โข 2311.00430 โข Published Nov 1, 2023 โข 57