Samuele Colombo's picture

Samuele Colombo

FinancialSupport

AI & ML interests

None yet

Recent Activity

reacted to anakin87's post with šŸ‘ about 16 hours ago
ššžš° šˆš­ššš„š¢ššš§ š’š¦ššš„š„ š‹ššš§š š®ššš šž šŒšØššžš„š¬: š†šžš¦š¦šš ššžšØš šžš§šžš¬š¢š¬ šœšØš„š„šžšœš­š¢šØš§ šŸ’ŽšŸŒšŸ‡®šŸ‡¹ I am happy to release two new language models for the Italian Language! šŸ’Ŗ Gemma 2 9B Neogenesis ITA https://huggingface.co/anakin87/gemma-2-9b-neogenesis-ita Building on the impressive work by VAGO Solutions, I applied Direct Preference Optimization with a mix of Italian and English data. Using Spectrum, I trained 20% of model layers. šŸ“Š Evaluated on the Open ITA LLM leaderboard (https://huggingface.co/spaces/mii-llm/open_ita_llm_leaderboard), this model achieves strong performance. To beat it on this benchmark, you'd need a 27B model šŸ˜Ž šŸ¤ Gemma 2 2B Neogenesis ITA https://huggingface.co/anakin87/gemma-2-2b-neogenesis-ita This smaller variant is fine-tuned from the original Gemma 2 2B it by Google. Through a combination of Supervised Fine-Tuning and Direct Preference Optimization, I trained 25% of the layers using Spectrum. šŸ“ˆ Compared to the original model, it shows improved Italian proficiency, good for its small size. Both models were developed during the recent #gemma competition on Kaggle. šŸ““ Training code: https://www.kaggle.com/code/anakin87/post-training-gemma-for-italian-and-beyond šŸ™ Thanks @FinancialSupport and mii-llm for the help during evaluation.
updated a dataset 4 days ago
mii-llm/requests
updated a dataset 4 days ago
mii-llm/results
View all activity

Organizations

Everai's profile picture Interuniversity Research Centre for Public Services's profile picture mii-community's profile picture mii-llm's profile picture Coloss's profile picture

FinancialSupport's activity

New activity in mgoin/Nemotron-4-340B-Instruct-hf-FP8 5 months ago

Good job

#2 opened 5 months ago by
FinancialSupport
New activity in sapienzanlp/modello-italia-9b 8 months ago

Update README.md

1
#1 opened 8 months ago by
FinancialSupport
New activity in mii-llm/open_ita_llm_leaderboard 8 months ago

Update app.py

#13 opened 8 months ago by
giux78

Update app.py

1
#12 opened 8 months ago by
giux78
New activity in FairMind/Minerva-3B-Instruct-v1.0 9 months ago

Update README.md

#1 opened 9 months ago by
FinancialSupport

Update README.md

#2 opened 9 months ago by
FinancialSupport
New activity in FairMind/Llama-3-8B-4bit-UltraChat-Ita 9 months ago

Update README.md

#1 opened 9 months ago by
FinancialSupport
New activity in mii-llm/open_ita_llm_leaderboard 9 months ago

Update leaderboard_general.csv

#10 opened 9 months ago by
giux78
New activity in mii-llm/open_ita_llm_leaderboard 9 months ago