Vary: Scaling up the Vision Vocabulary for Large Vision-Language Models Paper • 2312.06109 • Published Dec 11, 2023 • 20
Distilling Vision-Language Models on Millions of Videos Paper • 2401.06129 • Published Jan 11, 2024 • 15
MLLMs-Augmented Visual-Language Representation Learning Paper • 2311.18765 • Published Nov 30, 2023 • 1