FashionComposer: Compositional Fashion Image Generation Paper • 2412.14168 • Published 16 days ago • 16
Autoregressive Video Generation without Vector Quantization Paper • 2412.14169 • Published 16 days ago • 14
Arbitrary-steps Image Super-resolution via Diffusion Inversion Paper • 2412.09013 • Published 23 days ago • 11
FreeScale: Unleashing the Resolution of Diffusion Models via Tuning-Free Scale Fusion Paper • 2412.09626 • Published 22 days ago • 19
SnapGen: Taming High-Resolution Text-to-Image Models for Mobile Devices with Efficient Architectures and Training Paper • 2412.09619 • Published 22 days ago • 20
UniReal: Universal Image Generation and Editing via Learning Real-world Dynamics Paper • 2412.07774 • Published 24 days ago • 25
DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation Paper • 2412.07589 • Published 25 days ago • 46
Florence-VL: Enhancing Vision-Language Models with Generative Vision Encoder and Depth-Breadth Fusion Paper • 2412.04424 • Published 29 days ago • 58
TryOffDiff: Virtual-Try-Off via High-Fidelity Garment Reconstruction using Diffusion Models Paper • 2411.18350 • Published Nov 27, 2024 • 23
CAT4D: Create Anything in 4D with Multi-View Video Diffusion Models Paper • 2411.18613 • Published Nov 27, 2024 • 50
Pathways on the Image Manifold: Image Editing via Video Generation Paper • 2411.16819 • Published Nov 25, 2024 • 30
VideoRepair: Improving Text-to-Video Generation via Misalignment Evaluation and Localized Refinement Paper • 2411.15115 • Published Nov 22, 2024 • 9
Stylecodes: Encoding Stylistic Information For Image Generation Paper • 2411.12811 • Published Nov 19, 2024 • 11
StableV2V: Stablizing Shape Consistency in Video-to-Video Editing Paper • 2411.11045 • Published Nov 17, 2024 • 11
MagicQuill: An Intelligent Interactive Image Editing System Paper • 2411.09703 • Published Nov 14, 2024 • 62
Adaptive Caching for Faster Video Generation with Diffusion Transformers Paper • 2411.02397 • Published Nov 4, 2024 • 23