Omar Sanseviero's picture

Omar Sanseviero

osanseviero

AI & ML interests

Llamas, model merging, massive ASR for data collection, 3D ML, on-device ML, quantization, model judging, ML in browser, healthcare applications, education, intersection of art and ML.🦙

Recent Activity

Articles

Organizations

Google's profile picture Notebooks-explorers's profile picture scikit-learn-examples's profile picture BigScience Workshop's profile picture Neuropark's profile picture Spaces-explorers's profile picture Flax Community's profile picture Templates's profile picture Gensim's profile picture NLP en ES's profile picture Whisper Fine-Tuning Event's profile picture Keras's profile picture Hackathon Somos NLP 2023: Los LLMs hablan Español's profile picture Training Transformers Together's profile picture Spaces Examples's profile picture I Hackathon Somos NLP: PLN en Español's profile picture fast.ai community's profile picture SomosNLP's profile picture HugGAN Community's profile picture Gradio-Themes-Party's profile picture University of Groningen Workshop's profile picture AI Guru's profile picture Huggingface.js's profile picture Gradio-Blocks-Party's profile picture Data Days Zurich's profile picture Webhooks Explorers (BETA)'s profile picture JAX ♥️ Diffusers 🧨's profile picture Team 7's profile picture Open-Source AI Meetup's profile picture EuroPython 2022's profile picture fastai X Hugging Face Group 2022's profile picture ICML 2022's profile picture Language Tools's profile picture Platzi Community's profile picture Keras Dreambooth Event's profile picture Active Learning Example's profile picture CompVis Community's profile picture Stable Diffusion concepts library's profile picture DeepFloyd's profile picture Stable Diffusion Dreambooth Concepts Library's profile picture Whispering GPT's profile picture Open Generative AI's profile picture OpenShape's profile picture LocalCodeLLMs's profile picture Hugging Face Extreme-Scale's profile picture Hugging Face H4 Community's profile picture Blog-explorers's profile picture UniverseTBD's profile picture Hands-On Generative AI with Transformers and Diffusion Models's profile picture Editing Images's profile picture Hacktoberfest 2023's profile picture ICCV2023's profile picture huggingPartyParis's profile picture ZeroGPU Explorers's profile picture Editing Audio's profile picture T5 community's profile picture BERT community's profile picture gg-hf's profile picture Llamas's profile picture MLX Community's profile picture TTS AGI's profile picture Social Post Explorers's profile picture Kato's profile picture La Leaderboard's profile picture Dev Mode Explorers's profile picture Chinese LLMs on Hugging Face's profile picture Paris AI Running Club's profile picture gg-tt's profile picture ONNX Community's profile picture Distillation Hugs's profile picture Hugging Face Discord Community's profile picture Hugging Face Party @ PyTorch Conference's profile picture dummyosan's profile picture open/ acc's profile picture Data Is Better Together Contributor's profile picture

Posts 19

view post
Post
10118
Diaries of Open Source. Part 15 🤗

🕵️‍♀️Idefics 2 is out, a multimodal open-source model with very nice capabilities
Models, demo, and datasets: HuggingFaceM4/idefics2-661d1971b7c50831dd3ce0fe
Blog: https://hf.co/blog/idefics2

💾Snowflake released snowflake-arctic-embed, a family of powerful small embedding models
Model: Snowflake/snowflake-arctic-embed-m
Blog: https://www.snowflake.com/blog/introducing-snowflake-arctic-embed-snowflakes-state-of-the-art-text-embedding-family-of-models/

✨Pile-T5, EleutherAI's T5 model trained on 2T tokens
Blog: https://blog.eleuther.ai/pile-t5/
Models: EleutherAI/pile-t5-65a76a0d0022dd270b385a66
GitHub: https://github.com/EleutherAI/improved-t5

🤖CodeQwen1.5-7B base and chat models. Models trained on 3T tokens strong benchmark results for code generation, editing and SQL
Blog post: https://qwenlm.github.io/blog/codeqwen1.5/
Demo: Qwen/CodeQwen1.5-7b-Chat-demo
Models: Qwen/CodeQwen1.5-7B and Qwen/CodeQwen1.5-7B-Chat

Misc
🦉 DocOwl1.5: Unified Stucture Learning for OCR-free Document Understanding mPLUG/DocOwl
👀Cerule - a tiny Vision LM model Tensoic/Cerule-v0.1
ChemLLM - a LLM for chemistry and molecule science ⚗️https://hf.co/AI4Chem/ChemLLM-7B-Chat-1.5-DPO
Distil Whisper Large
📝New pdf/OCR datasets with 19 samples pixparse/pdf-document-ocr-datasets-660701430b0346f97c4bc628
🔥Gretel AI high quality text-to-sql synthetic dataset gretelai/synthetic_text_to_sql
view post
Post
9571
Diaries of Open Source. Part 14 🤗

🔥CohereForAI releases Command R+, an open 104B model with:
- Tool usage capabilities
- Specialized in RAGs
- Multilingual
It's one of the first models to surpass GPT-4 in the lmsys arena, check it out!
Model: CohereForAI/c4ai-command-r-plus
Official demo: https://hf.co/spaces/CohereForAI/c4ai-command-r-plus
Quantized: CohereForAI/c4ai-command-r-plus-4bit

🎉Google releases a new version of their Gemma instruct models, with improved quality, nicer to converse, and a fancier RL algorithm. The model is similar to Llama 2 70B in the Chat Arena!
Models: google/gemma-release-65d5efbccdbb8c4202ec078b
Try it out in HuggingChat https://hf.co/chat/models/google/gemma-1.1-7b-it

🪄VoiceCraft, a speech editing and TTS SOTA open model
Paper: VoiceCraft: Zero-Shot Speech Editing and Text-to-Speech in the Wild (2403.16973)
Model: pyp1/VoiceCraft

💻Google released CodeGemma, a family of code generation, completion, and chat models
Blog post: https://hf.co/blog/codegemma
Models: google/codegemma-release-66152ac7b683e2667abdee11
Report: https://storage.googleapis.com/deepmind-media/gemma/codegemma_report.pdf

Misc models:
🦖T-Rex2, a very powerful object detection model for many applications https://github.com/IDEA-Research/T-Rex
👀 CT-RATE : A 3D dataset paired with text reports ibrahimhamamci/CT-RATE
🐙Octopus v2: a Gemma-based model trained for Android API - extremely fast, better than Llama+RAG, great results NexaAIDev/Octopus-v2