Samuele Colombo's picture

26 2 9

Samuele Colombo

FinancialSupport

·

https://www.linkedin.com/in/samuele-colombo-ml/

AI & ML interests

None yet

Recent Activity

reacted to anakin87's post with 👍 about 16 hours ago

𝐍𝐞𝐰 𝐈𝐭𝐚𝐥𝐢𝐚𝐧 𝐒𝐦𝐚𝐥𝐥 𝐋𝐚𝐧𝐠𝐮𝐚𝐠𝐞 𝐌𝐨𝐝𝐞𝐥𝐬: 𝐆𝐞𝐦𝐦𝐚 𝐍𝐞𝐨𝐠𝐞𝐧𝐞𝐬𝐢𝐬 𝐜𝐨𝐥𝐥𝐞𝐜𝐭𝐢𝐨𝐧 💎🌍🇮🇹 I am happy to release two new language models for the Italian Language! 💪 Gemma 2 9B Neogenesis ITA https://huggingface.co/anakin87/gemma-2-9b-neogenesis-ita Building on the impressive work by VAGO Solutions, I applied Direct Preference Optimization with a mix of Italian and English data. Using Spectrum, I trained 20% of model layers. 📊 Evaluated on the Open ITA LLM leaderboard (https://huggingface.co/spaces/mii-llm/open_ita_llm_leaderboard), this model achieves strong performance. To beat it on this benchmark, you'd need a 27B model 😎 🤏 Gemma 2 2B Neogenesis ITA https://huggingface.co/anakin87/gemma-2-2b-neogenesis-ita This smaller variant is fine-tuned from the original Gemma 2 2B it by Google. Through a combination of Supervised Fine-Tuning and Direct Preference Optimization, I trained 25% of the layers using Spectrum. 📈 Compared to the original model, it shows improved Italian proficiency, good for its small size. Both models were developed during the recent #gemma competition on Kaggle. 📓 Training code: https://www.kaggle.com/code/anakin87/post-training-gemma-for-italian-and-beyond 🙏 Thanks @FinancialSupport and mii-llm for the help during evaluation.

updated a dataset 4 days ago

mii-llm/requests

updated a dataset 4 days ago

mii-llm/results

View all activity

Organizations

FinancialSupport's activity

New activity in mgoin/Nemotron-4-340B-Instruct-hf-FP8 5 months ago

Good job

#2 opened 5 months ago by

FinancialSupport

New activity in iGeniusAI/Italia-9B-Instruct-v0.1 7 months ago

warning during loading of the model

#1 opened 7 months ago by

FinancialSupport

New activity in mii-llm/open_ita_llm_leaderboard 7 months ago

Cannot reproduce results: evaluation tasks not found

#15 opened 7 months ago by

Update model types

#14 opened 7 months ago by

New activity in sapienzanlp/modello-italia-9b 8 months ago

Update README.md

#1 opened 8 months ago by

FinancialSupport

New activity in rstless-research/DanteLLM-7B-Instruct-Italian-v0.1 8 months ago

Upload 3 files

#2 opened 8 months ago by

FinancialSupport

New activity in mii-llm/open_ita_llm_leaderboard 8 months ago

Update app.py

#13 opened 8 months ago by

Update app.py

#12 opened 8 months ago by

deleted DeepMount model that no longer exists

#11 opened 9 months ago by

New activity in FairMind/Minerva-3B-Instruct-v1.0 9 months ago

Update README.md

#1 opened 9 months ago by

FinancialSupport

New activity in swap-uniba/LLaMAntino-2-70b-hf-UltraChat-ITA 9 months ago

Update README.md

#2 opened 9 months ago by

FinancialSupport

New activity in FairMind/Llama-3-8B-4bit-UltraChat-Ita 9 months ago

Update README.md

#1 opened 9 months ago by

FinancialSupport

New activity in mii-llm/open_ita_llm_leaderboard 9 months ago

Update leaderboard_general.csv

#10 opened 9 months ago by

New activity in FinancialSupport/random_stuff 9 months ago

Librarian Bot: Add language metadata for dataset

#2 opened 9 months ago by

New activity in meta-llama/Meta-Llama-3-8B-Instruct 9 months ago

Non-English language capabilities

#2 opened 9 months ago by

New activity in mii-llm/open_ita_llm_leaderboard 9 months ago

Classifica RAG

#9 opened 10 months ago by

What benchmarks are used for the evaluation?

#6 opened 10 months ago by

New activity in mii-llm/open_ita_llm_leaderboard 10 months ago

What is `m_mmul` benchmark?

#7 opened 10 months ago by

Upload app.py

#8 opened 10 months ago by

Where is `General classification of Italian LLMs` CSV file?

#5 opened 10 months ago by