40 36 121

Stefano Fiorucci PRO

anakin87

AI & ML interests

Contributing to Haystack LLM framework 🏗️. Language Models: orchestration, post-training, synthetic data...

Recent Activity

liked a model about 15 hours ago

VAGOsolutions/SauerkrautLM-gemma-2-2b-it

View all activity

Articles

🇮🇹🇯🇵🇧🇷 Generating multilingual instruction datasets with Magpie 🐦‍⬛

Oct 21, 2024

• 18

Selective fine-tuning of Language Models with Spectrum

Sep 3, 2024

• 30

Organizations

anakin87's activity

upvoted a collection 15 days ago

alignment_24_best

Collection

33 items • Updated Oct 21, 2024 • 2

upvoted a paper 15 days ago

Unpacking DPO and PPO: Disentangling Best Practices for Learning from Preference Feedback

Paper • 2406.09279 • Published Jun 13, 2024 • 2

upvoted a paper 16 days ago

SPaR: Self-Play with Tree-Search Refinement to Improve Instruction-Following in Large Language Models

Paper • 2412.11605 • Published 19 days ago • 16

upvoted a paper 30 days ago

Reverse Thinking Makes LLMs Stronger Reasoners

Paper • 2411.19865 • Published Nov 29, 2024 • 19

upvoted a collection about 1 month ago

🇮🇹👓 LLaVA-NDiNO

Collection

HF Collection for the models of the paper "LLaVA-NDiNO: Empowering LLMs with Multimodality for the Italian Language" • 7 items • Updated Oct 20, 2024 • 3

upvoted 3 papers about 2 months ago

upvoted an article about 2 months ago

Article

SauerkrautLM's Multi-Phase Spectrum Training: A Technical Deep Dive

•

Nov 9, 2024

• 9

upvoted 2 articles 2 months ago

Article

Introducing GGUF-my-LoRA

•

Nov 1, 2024

• 12

Article

🇮🇹🇯🇵🇧🇷 Generating multilingual instruction datasets with Magpie 🐦‍⬛

•

Oct 21, 2024

• 18

upvoted an article 3 months ago

Article

Model2Vec: Distill a Small Fast Model from any Sentence Transformer

•

Oct 14, 2024

• 61

upvoted a paper 3 months ago

Non Verbis, Sed Rebus: Large Language Models are Weak Solvers of Italian Rebuses

Paper • 2408.00584 • Published Aug 1, 2024 • 6

upvoted an article 4 months ago

Article

Selective fine-tuning of Language Models with Spectrum

•

Sep 3, 2024

• 30

upvoted a paper 5 months ago

Spectrum: Targeted Training on Signal to Noise Ratio

Paper • 2406.06623 • Published Jun 7, 2024 • 12

upvoted a collection 5 months ago

🧩 Verbalized Rebus @ CLiC-it 2024

Collection

Materials for the paper "Non Verbis, Sed Rebus: Large Language Models are Weak Solvers of Italian Rebuses" • 13 items • Updated Aug 5, 2024 • 3

upvoted 2 articles 5 months ago

Article

🔥 Argilla 2.0: the data-centric tool for AI makers 🤗

•

Jul 30, 2024

• 37

Article

MMLU-PRO-ITA a new eval for Italian LLMs

•

Jul 23, 2024

• 3

upvoted 2 articles 6 months ago

Article

Mixedbread 🤝 deepset: Announcing our New German/English Embedding Model

•

Jul 19, 2024

• 15

Article

🦙⚗️ Using Llama3 and distilabel to build fine-tuning datasets

•

Jun 4, 2024

• 73