FramePainter: Endowing Interactive Image Editing with Video Diffusion Priors Paper • 2501.08225 • Published 5 days ago • 17
MangaNinja: Line Art Colorization with Precise Reference Following Paper • 2501.08332 • Published 5 days ago • 50
LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs Paper • 2501.06186 • Published 9 days ago • 56
LLaVA-Mini: Efficient Image and Video Large Multimodal Models with One Vision Token Paper • 2501.03895 • Published 12 days ago • 48
Sana Collection ⚡️Sana: Efficient High-Resolution Image Synthesis with Linear Diffusion Transformer • 19 items • Updated 11 days ago • 86
Diffusion as Shader: 3D-aware Video Diffusion for Versatile Video Generation Control Paper • 2501.03847 • Published 12 days ago • 22
Cosmos World Foundation Model Platform for Physical AI Paper • 2501.03575 • Published 12 days ago • 63
STAR: Spatial-Temporal Augmentation with Text-to-Video Models for Real-World Video Super-Resolution Paper • 2501.02976 • Published 13 days ago • 51
TransPixar: Advancing Text-to-Video Generation with Transparency Paper • 2501.03006 • Published 13 days ago • 22