26 21 60

geronimo PRO

g-ronimo

AI & ML interests

fafo

Recent Activity

new activity about 14 hours ago

sayakpaul/vae-sd-imagenet-256-latents:purpose

upvoted an article 2 days ago

Fine-tune ModernBERT for text classification using synthetic data

liked a dataset 2 days ago

sayakpaul/vae-sd-imagenet-256-latents

View all activity

Articles

SemScore: Evaluating LLMs with Semantic Similarity

Mar 9, 2024

• 12

Phinetuning 2.0

Jan 31, 2024

• 2

Organizations

g-ronimo's activity

upvoted an article 2 days ago

Article

Fine-tune ModernBERT for text classification using synthetic data

•

5 days ago

• 17

upvoted a paper 12 days ago

CLEAR: Conv-Like Linearization Revs Pre-Trained Diffusion Transformers Up

Paper • 2412.16112 • Published 14 days ago • 21

upvoted 2 papers about 1 month ago

VisualLens: Personalization through Visual History

Paper • 2411.16034 • Published Nov 25, 2024 • 16

UnifiedCrawl: Aggregated Common Crawl for Affordable Adaptation of LLMs on Low-Resource Languages

Paper • 2411.14343 • Published Nov 21, 2024 • 7

upvoted 2 papers about 2 months ago

Add-it: Training-Free Object Insertion in Images With Pretrained Diffusion Models

Paper • 2411.07232 • Published Nov 11, 2024 • 63

JanusFlow: Harmonizing Autoregression and Rectified Flow for Unified Multimodal Understanding and Generation

Paper • 2411.07975 • Published Nov 12, 2024 • 27

upvoted 2 articles 4 months ago

Article

"Diffusers Image Fill" guide

•

Sep 13, 2024

• 42

Article

Extending Transformer layers as Painters to DiT's

•

Aug 31, 2024

• 9

upvoted a paper 8 months ago

LoRA Learns Less and Forgets Less

Paper • 2405.09673 • Published May 15, 2024 • 87

upvoted 2 articles 8 months ago

Article

Train custom AI models with the trainer API and adapt them to 🤗

•

Jun 29, 2024

• 33

Article

SeeMoE: Implementing a MoE Vision Language Model from Scratch

•

Jun 23, 2024

• 34

upvoted a paper 9 months ago

How Good Are Low-bit Quantized LLaMA3 Models? An Empirical Study

Paper • 2404.14047 • Published Apr 22, 2024 • 44

upvoted 5 articles 9 months ago

Article

seemore: Implement a Vision Language Model from Scratch

•

Jun 23, 2024

• 69

Article

Releasing Youtube-Commons: a massive open corpus for conversational and multimodal data

•

Apr 18, 2024

• 22

Article

On Coding Your First Attention

•

Apr 21, 2024

• 7

Article

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

Apr 15, 2024

• 170

Article

DS-MoE: Making MoE Models More Efficient and Less Memory-Intensive

•

Apr 9, 2024

• 29

upvoted a paper 9 months ago

Aurora-M: The First Open Source Multilingual Language Model Red-teamed according to the U.S. Executive Order

Paper • 2404.00399 • Published Mar 30, 2024 • 41

upvoted a collection about 1 year ago

Journal Club

Collection

Candidate papers to read in the H4 journal club • 54 items • Updated Apr 21, 2024 • 28

upvoted a paper over 1 year ago

One Wide Feedforward is All You Need

Paper • 2309.01826 • Published Sep 4, 2023 • 31

geronimo PRO

AI & ML interests

Recent Activity

Articles

SemScore: Evaluating LLMs with Semantic Similarity

Phinetuning 2.0

Organizations

g-ronimo's activity

Fine-tune ModernBERT for text classification using synthetic data

"Diffusers Image Fill" guide

Extending *Transformer layers as Painters* to DiT's

Train custom AI models with the trainer API and adapt them to 🤗

SeeMoE: Implementing a MoE Vision Language Model from Scratch

seemore: Implement a Vision Language Model from Scratch

Releasing Youtube-Commons: a massive open corpus for conversational and multimodal data

On Coding Your First Attention

Introducing Idefics2: A Powerful 8B Vision-Language Model for the community

DS-MoE: Making MoE Models More Efficient and Less Memory-Intensive

Extending Transformer layers as Painters to DiT's