geronimo PRO

g-ronimo

AI & ML interests

fafo

Recent Activity

Articles

Organizations

AblateIt's profile picture Blog-explorers's profile picture

g-ronimo's activity

New activity in sayakpaul/vae-sd-imagenet-256-latents about 14 hours ago

purpose

1
#1 opened about 14 hours ago by
g-ronimo
upvoted an article 2 days ago
updated a model 4 days ago
reacted to hexgrad's post with ๐Ÿ”ฅ 8 days ago
view post
Post
3872
Merry Christmas! ๐ŸŽ„ Open sourced a small TTS model at hexgrad/Kokoro-82M
  • 2 replies
ยท
reacted to Xenova's post with ๐Ÿš€๐Ÿ”ฅโค๏ธ 15 days ago
view post
Post
2696
Introducing Moonshine Web: real-time speech recognition running 100% locally in your browser!
๐Ÿš€ Faster and more accurate than Whisper
๐Ÿ”’ Privacy-focused (no data leaves your device)
โšก๏ธ WebGPU accelerated (w/ WASM fallback)
๐Ÿ”ฅ Powered by ONNX Runtime Web and Transformers.js

Demo: webml-community/moonshine-web
Source code: https://github.com/huggingface/transformers.js-examples/tree/main/moonshine-web
ยท
reacted to lewtun's post with โค๏ธ 18 days ago
view post
Post
6631
We outperform Llama 70B with Llama 3B on hard math by scaling test-time compute ๐Ÿ”ฅ

How? By combining step-wise reward models with tree search algorithms :)

We show that smol models can match or exceed the performance of their much larger siblings when given enough "time to think"

We're open sourcing the full recipe and sharing a detailed blog post.

In our blog post we cover:

๐Ÿ“ˆ Compute-optimal scaling: How we implemented DeepMind's recipe to boost the mathematical capabilities of open models at test-time.

๐ŸŽ„ Diverse Verifier Tree Search (DVTS): An unpublished extension we developed to the verifier-guided tree search technique. This simple yet effective method improves diversity and delivers better performance, particularly at large test-time compute budgets.

๐Ÿงญ Search and Learn: A lightweight toolkit for implementing search strategies with LLMs and built for speed with vLLM

Here's the links:

- Blog post: HuggingFaceH4/blogpost-scaling-test-time-compute

- Code: https://github.com/huggingface/search-and-learn

Enjoy!
  • 2 replies
ยท
reacted to sayakpaul's post with โค๏ธ 25 days ago
view post
Post
2114
The Control family of Flux from @black-forest-labs should be discussed more!

It enables structural controls like ControlNets while being significantly less expensive to run!

So, we're working on a Control LoRA training script ๐Ÿค—

It's still WIP, so go easy:
https://github.com/huggingface/diffusers/pull/10130
New activity in onnx-community/BackgroundMattingV2-4k 26 days ago

Model ready?

#1 opened 26 days ago by
g-ronimo