9 34 24

Rajdeep Borgohain

rbgo

RajdeepBorgohain

AI & ML interests

Solving language barriers.

Recent Activity

reacted to benjamin-paine's post with 👍 2 days ago

Hello HuggingFace 🤗, and happy new year! 🎆 I'm thrilled to be releasing the first iteration of a project I've been working on for quite awhile now. It's called Taproot, and it's a seamlessly scalable open-source AI/ML inference engine designed for letting developers build real-time experiences clustered across a small-to-mid-sized cluster, without the burden of hyperscale infrastructure. Along with the server and task framework is a client library for node and the browser. And what good is a server and client without an app to go alongside it? To that end, I'm also releasing Anachrovox, a fun, real-time hands-free voice assistant that can run on mid-level devices in <12GB VRAM, with web search, weather, and other tools. It uses my real-time browser wake-word library to detect utterances of the phrase 'Hey Vox', 'Hi Vox', 'Okay Vox', 'Anachrovox' or just 'Vox' (alongside some others.) Releasing this many things at once will definitely result in bugs, so please report them when sighted! Thank you all! Taproot: https://github.com/painebenjamin/taproot Taproot JS Client: https://github.com/painebenjamin/taproot.js Anachrovox: https://github.com/painebenjamin/anachrovox The Anachrovox Spaces are networked together, balancing load across them to keep all front-ends responsive. You only have to choose what color you like the most! https://huggingface.co/spaces/benjamin-paine/anachrovox-emerald https://huggingface.co/spaces/benjamin-paine/anachrovox-amber https://huggingface.co/spaces/benjamin-paine/anachrovox-azure

upvoted a collection 11 days ago

PaliGemma 2 Release

upvoted a paper 15 days ago

Qwen2.5 Technical Report

View all activity

Organizations

rbgo's activity

reacted to benjamin-paine's post with 👍 2 days ago

Post

2233

Hello HuggingFace 🤗, and happy new year! 🎆

I'm thrilled to be releasing the first iteration of a project I've been working on for quite awhile now. It's called Taproot, and it's a seamlessly scalable open-source AI/ML inference engine designed for letting developers build real-time experiences clustered across a small-to-mid-sized cluster, without the burden of hyperscale infrastructure.

Along with the server and task framework is a client library for node and the browser. And what good is a server and client without an app to go alongside it? To that end, I'm also releasing Anachrovox, a fun, real-time hands-free voice assistant that can run on mid-level devices in <12GB VRAM, with web search, weather, and other tools. It uses my real-time browser wake-word library to detect utterances of the phrase 'Hey Vox', 'Hi Vox', 'Okay Vox', 'Anachrovox' or just 'Vox' (alongside some others.)

Releasing this many things at once will definitely result in bugs, so please report them when sighted! Thank you all!

Taproot: https://github.com/painebenjamin/taproot
Taproot JS Client: https://github.com/painebenjamin/taproot.js
Anachrovox: https://github.com/painebenjamin/anachrovox

The Anachrovox Spaces are networked together, balancing load across them to keep all front-ends responsive. You only have to choose what color you like the most!

benjamin-paine/anachrovox-emerald
benjamin-paine/anachrovox-amber
benjamin-paine/anachrovox-azure

11 replies

upvoted a collection 11 days ago

PaliGemma 2 Release

Collection

Vision-Language Models available in multiple 3B, 10B and 28B variants. • 23 items • Updated 21 days ago • 122

upvoted a paper 15 days ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published 15 days ago • 334

liked a dataset 17 days ago

NTU-NLP-sg/xCodeEval

Updated Jun 6, 2024 • 48.9k • 40

upvoted a collection 19 days ago

Qwen2.5

Collection

Qwen2.5 language models, including pretrained and instruction-tuned models of 7 sizes, including 0.5B, 1.5B, 3B, 7B, 14B, 32B, and 72B. • 45 items • Updated Nov 28, 2024 • 452

liked a Space 21 days ago

Running

312

💻

Qwen2.5 Turbo 1M Demo

upvoted 2 collections 23 days ago

Qwen2

Collection

Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 39 items • Updated Nov 28, 2024 • 353

Llama 3.2

Collection

This collection hosts the transformers and original repos of the Llama 3.2 and Llama Guard 3 • 15 items • Updated 28 days ago • 551

upvoted a collection 24 days ago

Qwen

Collection

Qwen • 16 items • Updated Nov 28, 2024 • 14

updated a collection 29 days ago

All About LLMs

Collection

2 items • Updated 29 days ago

liked a Space 29 days ago

Running

📈

Number Tokenization Blog

liked a model about 1 month ago

Qwen/QwQ-32B-Preview

Text Generation • Updated Nov 29, 2024 • 105k • • 1.49k

reacted to m-ric's post with 👍 about 1 month ago

Post

787

🔍 Meta teams use a fine-tuned Llama model to fix production issues in seconds

One of Meta's engineering teams shared how they use a fine-tuned small Llama (Llama-2-7B, so not even a very recent model) to identify the root cause of production issues with 42% accuracy.

🤔 42%, is that not too low?
➡️ Usually, whenever there's an issue in production, engineers dive into recent code changes to find the offending commit. At Meta's scale (thousands of daily changes), this is like finding a needle in a haystack.
💡 So when the LLM-based suggestion is right, it cuts incident resolution time from hours to seconds!

How did they do it?

🔄 Two-step approach:
‣ Heuristics (code ownership, directory structure, runtime graphs) reduce thousands of potential changes to a manageable set
‣ Fine-tuned Llama 2 7B ranks the most likely culprits

🎓 Training pipeline:
‣ Continued pre-training on Meta's internal docs and wikis
‣ Supervised fine-tuning on past incident investigations
‣ Training data mimicked real-world constraints (2-20 potential changes per incident)

🔮 Now future developments await:
‣ Language models could handle more of the incident response workflow (runbooks, mitigation, post-mortems)
‣ Improvements in model reasoning should boost accuracy further

Read it in full 👉 https://www.tryparity.com/blog/how-meta-uses-llms-to-improve-incident-response