11 8 122

NB

Skier8402

https://nyab.notion.site

Shuyib

AI & ML interests

Practicing Computer Vision, Optimization, NLP and multimodal system implementation.

Recent Activity

updated a collection 3 days ago

Swahili models

updated a collection 3 days ago

Swahili models

liked a model 3 days ago

Alfaxad/gemma2-2b-swahili-preview

View all activity

Organizations

Skier8402's activity

updated a collection 3 days ago

Swahili models

Collection

3 items • Updated 3 days ago

liked a model 3 days ago

Alfaxad/gemma2-2b-swahili-preview

Text Generation • Updated 6 days ago • 32 • 4

updated a Space 4 days ago

Running

🏆

Interesting finds

Collection

Very cool apps I wish I could build • 35 items • Updated 5 days ago

liked a dataset 5 days ago

NovaSky-AI/Sky-T1_data_17k

Viewer • Updated 5 days ago • 16.4k • 1.74k • 124

updated 2 collections 5 days ago

Datasets

Collection

Interesting datasets to help train LLMs and beyond • 21 items • Updated 5 days ago

Interesting finds

Collection

Very cool apps I wish I could build • 35 items • Updated 5 days ago

liked a model 5 days ago

NovaSky-AI/Sky-T1-32B-Preview

Text Generation • Updated 6 days ago • 7.51k • 476

updated a Space 6 days ago

Running

👀

Image Splitter

This is an app that breaks down the image into tiles

updated a Space 7 days ago

Running

💻

CLL Exp Annot

An annotation tool we made to select cells.

liked a Space 8 days ago

Running

156

🔥

Attention Visualization

Vision Transformer Attention Visualization

upvoted an article 8 days ago

Article

Upgrading Kokoro: natural TTS for short bursts

•

Nov 22, 2024

• 22

updated a Space 8 days ago

Sleeping

📚

CLL Exp Annot

An annotation tool we made to select cells.

updated a collection 8 days ago

Speech apps

Collection

Various applications to help deal with speech better. • 16 items • Updated 8 days ago

liked a Space 8 days ago

Running on Zero

1.16k

❤️

Kokoro TTS

Now in 5 languages!

reacted to hexgrad's post with 🔥 8 days ago

Post

15420

📣 Looking for labeled, high-quality synthetic audio/TTS data 📣 Have you been or are you currently calling API endpoints from OpenAI, ElevenLabs, etc? Do you have labeled audio data sitting around gathering dust? Let's talk! Join https://discord.gg/QuGxSWBfQy or comment down below.

If your data exceeds quantity & quality thresholds and is approved into the next hexgrad/Kokoro-82M training mix, and you permissively DM me the data under an effective Apache license, then I will DM back the corresponding voicepacks for YOUR data if/when the next Apache-licensed Kokoro base model drops.

What does this mean? If you've been calling closed-source TTS or audio API endpoints to:
- Build voice agents
- Make long-form audio, like audiobooks or podcasts
- Handle customer support, etc
Then YOU can contribute to the training mix and get useful artifacts in return. ❤️

More details at hexgrad/Kokoro-82M#21