NB's picture

NB

Skier8402

AI & ML interests

Practicing Computer Vision, Optimization, NLP and multimodal system implementation.

Recent Activity

updated a collection 3 days ago
Swahili models
updated a collection 3 days ago
Swahili models
liked a model 3 days ago
Alfaxad/gemma2-2b-swahili-preview
View all activity

Organizations

fast.ai community's profile picture Blog-explorers's profile picture Tangu Kale Labs's profile picture UltimateControllers's profile picture blacksheepinc's profile picture Social Post Explorers's profile picture Epidemiology World's profile picture Transcriptors's profile picture Hugging Face Discord Community's profile picture

Skier8402's activity

updated a Space 4 days ago
upvoted an article 8 days ago
view article
Article

Upgrading Kokoro: natural TTS for short bursts

By hexgrad โ€ข
โ€ข 22
reacted to hexgrad's post with ๐Ÿ”ฅ 8 days ago
view post
Post
15420
๐Ÿ“ฃ Looking for labeled, high-quality synthetic audio/TTS data ๐Ÿ“ฃ Have you been or are you currently calling API endpoints from OpenAI, ElevenLabs, etc? Do you have labeled audio data sitting around gathering dust? Let's talk! Join https://discord.gg/QuGxSWBfQy or comment down below.

If your data exceeds quantity & quality thresholds and is approved into the next hexgrad/Kokoro-82M training mix, and you permissively DM me the data under an effective Apache license, then I will DM back the corresponding voicepacks for YOUR data if/when the next Apache-licensed Kokoro base model drops.

What does this mean? If you've been calling closed-source TTS or audio API endpoints to:
- Build voice agents
- Make long-form audio, like audiobooks or podcasts
- Handle customer support, etc
Then YOU can contribute to the training mix and get useful artifacts in return. โค๏ธ

More details at hexgrad/Kokoro-82M#21
ยท