Flash Diffusion Collection Collection of models distilled using the method proposed in Flash Diffusion paper • 7 items • Updated Jun 18, 2024 • 15
Sana Collection ⚡️Sana: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer • 17 items • Updated 15 days ago • 66
DeTikZify Collection Synthesizing Graphics Programs for Scientific Figures and Sketches with TikZ • 11 items • Updated about 1 month ago • 7
SigLIP Collection Contrastive (sigmoid) image-text models from https://arxiv.org/abs/2303.15343 • 10 items • Updated 21 days ago • 48
LLaVA-o1: Let Vision Language Models Reason Step-by-Step Paper • 2411.10440 • Published Nov 15, 2024 • 111
InstructPix2Pix: Learning to Follow Image Editing Instructions Paper • 2211.09800 • Published Nov 17, 2022 • 3
view article Article Fine-tuning Florence-2 - Microsoft's Cutting-edge Vision Language Models Jun 24, 2024 • 181
view post Post 3113 NEW: Open Source Text/ Image to video model is out - MIT licensed - Rivals Gen-3, Pika & Kling 🔥> Pyramid Flow: Training-efficient Autoregressive Video Generation method> Utilizes Flow Matching> Trains on open-source datasets> Generates high-quality 10-second videos> Video resolution: 768p> Frame rate: 24 FPS> Supports image-to-video generation> Model checkpoints available on the hub 🤗: rain1011/pyramid-flow-sd3 👍 11 11 🔥 7 7 👀 3 3 + Reply