Jeremie Tisby's picture
1 45

Jeremie Tisby

Frobenius
Ā·

AI & ML interests

None yet

Recent Activity

liked a model 18 days ago
openai/whisper-large-v3-turbo
liked a model 18 days ago
NexaAIDev/OmniAudio-2.6B
liked a model 18 days ago
microsoft/OmniParser
View all activity

Organizations

Hugging Face Discord Community's profile picture

Frobenius's activity

replied to lewtun's post 20 days ago
view reply

Wow people... This is CRACKED! THANK YOU HF!!!

reacted to lewtun's post with šŸ”„ 20 days ago
view post
Post
6660
We outperform Llama 70B with Llama 3B on hard math by scaling test-time compute šŸ”„

How? By combining step-wise reward models with tree search algorithms :)

We show that smol models can match or exceed the performance of their much larger siblings when given enough "time to think"

We're open sourcing the full recipe and sharing a detailed blog post.

In our blog post we cover:

šŸ“ˆ Compute-optimal scaling: How we implemented DeepMind's recipe to boost the mathematical capabilities of open models at test-time.

šŸŽ„ Diverse Verifier Tree Search (DVTS): An unpublished extension we developed to the verifier-guided tree search technique. This simple yet effective method improves diversity and delivers better performance, particularly at large test-time compute budgets.

šŸ§­ Search and Learn: A lightweight toolkit for implementing search strategies with LLMs and built for speed with vLLM

Here's the links:

- Blog post: HuggingFaceH4/blogpost-scaling-test-time-compute

- Code: https://github.com/huggingface/search-and-learn

Enjoy!
  • 2 replies
Ā·
updated a collection 5 months ago