Juan CM

jucamohedano
Ā·

AI & ML interests

Deep Learning and Robotics šŸš€šŸ¤–

Recent Activity

updated a model 2 months ago
jucamohedano/paligemma_a-okvqa
updated a model 4 months ago
jucamohedano/char-lstm-shakespeare
liked a dataset 4 months ago
karpathy/tiny_shakespeare
View all activity

Organizations

SomosNLP's profile picture scikit-learn's profile picture

jucamohedano's activity

upvoted an article 8 months ago
view article
Article

PaliGemma ā€“ Google's Cutting-Edge Open Vision Language Model

ā€¢ 233
reacted to merve's post with šŸš€ 8 months ago
view post
Post
1761
New open Vision Language Model by @Google : PaliGemma šŸ’™šŸ¤

šŸ“ Comes in 3B, pretrained, mix and fine-tuned models in 224, 448 and 896 resolution
šŸ§© Combination of Gemma 2B LLM and SigLIP image encoder
šŸ¤— Supported in transformers

PaliGemma can do..
šŸ§© Image segmentation and detection! šŸ¤Æ
šŸ“‘ Detailed document understanding and reasoning
šŸ™‹ Visual question answering, captioning and any other VLM task!

Read our blog šŸ”– hf.co/blog/paligemma
Try the demo šŸŖ€ hf.co/spaces/google/paligemma
Check out the Spaces and the models all in the collection šŸ“š google/paligemma-release-6643a9ffbf57de2ae0448dda
Collection of fine-tuned PaliGemma models google/paligemma-ft-models-6643b03efb769dad650d2dda
Ā·
upvoted an article 9 months ago
view article
Article

SeeMoE: Implementing a MoE Vision Language Model from Scratch

By AviSoori1x ā€¢
ā€¢ 34
upvoted 2 articles 9 months ago
view article
Article

Fine-tuning a large language model on Kaggle Notebooks (or even on your own computer) for solving real-world tasks

By lmassaron ā€¢
ā€¢ 14
view article
Article

Design choices for Vision Language Models in 2024

By gigant ā€¢
ā€¢ 25
upvoted an article 9 months ago
view article
Article

Vision Language Models Explained

ā€¢ 245
upvoted an article 9 months ago
view article
Article

Mixture of Experts Explained

ā€¢ 257