1 51 21

Krinal Joshi

krinal

kjdeveloper8

AI & ML interests

NLP, Speech

Recent Activity

reacted to nicolay-r's post with 👍 about 8 hours ago

📢 So far I been passioned about making NLP pipeline for handling iterator of texts with no-string dependency from besides third-party providers of your choice. By starting with text-translation, delighted to share the related notebooks that might save you time for handling your data ⭐ https://github.com/nicolay-r/nlp-thirdgate Example of using GoogleTranslate API in no-string for handling textual data iterators with spans: 📙 https://github.com/nicolay-r/nlp-thirdgate/blob/master/tutorials/translate_texts_with_spans_via_googletrans.ipynb The key concept is that all these API examples could be tied into a single pipeline using AREkit 📘 https://github.com/nicolay-r/AREkit 🛠️ The further plan is to popualte this repo with 1. NER (DeepPavlov models wrapper) 2. LLM with fancy out-of-the-box chain-of-thought declaration support.

liked a model about 9 hours ago

geneing/Kokoro

upvoted an article 2 days ago

Train 400x faster Static Embedding Models with Sentence Transformers

View all activity

Organizations

krinal's activity

upvoted an article 2 days ago

Article

Train 400x faster Static Embedding Models with Sentence Transformers

5 days ago

• 105

upvoted a paper 3 days ago

Tensor Product Attention Is All You Need

Paper • 2501.06425 • Published 8 days ago • 66

upvoted 2 articles 6 days ago

Article

Mastering Tensor Dimensions in Transformers

•

7 days ago

• 33

Article

Use Models from the Hugging Face Hub in LM Studio

•

Nov 28, 2024

• 132

upvoted a collection 6 months ago

Llama 3.1

Collection

This collection hosts the transformers and original repos of the Llama 3.1, Llama Guard 3 and Prompt Guard models • 11 items • Updated Dec 6, 2024 • 640

upvoted a paper 9 months ago

FlashSpeech: Efficient Zero-Shot Speech Synthesis

Paper • 2404.14700 • Published Apr 23, 2024 • 30

upvoted an article 9 months ago

Article

Welcome Llama 3 - Meta's new open LLM

Apr 18, 2024

• 282

upvoted a paper 12 months ago

Proactive Detection of Voice Cloning with Localized Watermarking

Paper • 2401.17264 • Published Jan 30, 2024 • 18

upvoted 12 papers about 1 year ago

mPLUG-Owl2: Revolutionizing Multi-modal Large Language Model with Modality Collaboration

Paper • 2311.04257 • Published Nov 7, 2023 • 20

Schrodinger Bridges Beat Diffusion Models on Text-to-Speech Synthesis

Paper • 2312.03491 • Published Dec 6, 2023 • 33

Multimodal Data and Resource Efficient Device-Directed Speech Detection with Large Foundation Models

Paper • 2312.03632 • Published Dec 6, 2023 • 4

Mamba: Linear-Time Sequence Modeling with Selective State Spaces

Paper • 2312.00752 • Published Dec 1, 2023 • 139

Segment and Caption Anything

Paper • 2312.00869 • Published Dec 1, 2023 • 18