-
Groma: Localized Visual Tokenization for Grounding Multimodal Large Language Models
Paper • 2404.13013 • Published • 30 -
Toward Self-Improvement of LLMs via Imagination, Searching, and Criticizing
Paper • 2404.12253 • Published • 54 -
Data-Efficient Contrastive Language-Image Pretraining: Prioritizing Data Quality over Quantity
Paper • 2403.12267 • Published -
No More Adam: Learning Rate Scaling at Initialization is All You Need
Paper • 2412.11768 • Published • 41
Oliver Wei
Oliver2021
AI & ML interests
None yet
Recent Activity
liked
a Space
5 days ago
fishaudio/fish-speech-1
liked
a dataset
11 days ago
LanguageBind/Open-Sora-Plan-v1.0.0
liked
a dataset
11 days ago
Otolith/FishCLIP
Organizations
None yet
Collections
1
models
None public yet
datasets
None public yet