-
Faster Diffusion: Rethinking the Role of UNet Encoder in Diffusion Models
Paper β’ 2312.09608 β’ Published β’ 14 -
CodeFusion: A Pre-trained Diffusion Model for Code Generation
Paper β’ 2310.17680 β’ Published β’ 70 -
ZeroNVS: Zero-Shot 360-Degree View Synthesis from a Single Real Image
Paper β’ 2310.17994 β’ Published β’ 8 -
Progressive Knowledge Distillation Of Stable Diffusion XL Using Layer Level Loss
Paper β’ 2401.02677 β’ Published β’ 23
Collections
Discover the best community collections!
Collections including paper arxiv:2401.05252
-
One-for-All: Generalized LoRA for Parameter-Efficient Fine-tuning
Paper β’ 2306.07967 β’ Published β’ 24 -
Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation
Paper β’ 2306.07954 β’ Published β’ 112 -
TryOnDiffusion: A Tale of Two UNets
Paper β’ 2306.08276 β’ Published β’ 72 -
Seeing the World through Your Eyes
Paper β’ 2306.09348 β’ Published β’ 33
-
FusionFrames: Efficient Architectural Aspects for Text-to-Video Generation Pipeline
Paper β’ 2311.13073 β’ Published β’ 57 -
MetaDreamer: Efficient Text-to-3D Creation With Disentangling Geometry and Texture
Paper β’ 2311.10123 β’ Published β’ 16 -
GPT4Motion: Scripting Physical Motions in Text-to-Video Generation via Blender-Oriented GPT Planning
Paper β’ 2311.12631 β’ Published β’ 13 -
VMC: Video Motion Customization using Temporal Attention Adaption for Text-to-Video Diffusion Models
Paper β’ 2312.00845 β’ Published β’ 37
-
Any-Size-Diffusion: Toward Efficient Text-Driven Synthesis for Any-Size HD Images
Paper β’ 2308.16582 β’ Published β’ 11 -
DreamSpace: Dreaming Your Room Space with Text-Driven Panoramic Texture Propagation
Paper β’ 2310.13119 β’ Published β’ 12 -
DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior
Paper β’ 2310.16818 β’ Published β’ 31 -
Text-to-3D with classifier score distillation
Paper β’ 2310.19415 β’ Published β’ 5
-
FreeU: Free Lunch in Diffusion U-Net
Paper β’ 2309.11497 β’ Published β’ 65 -
Imagic: Text-Based Real Image Editing with Diffusion Models
Paper β’ 2210.09276 β’ Published -
On Architectural Compression of Text-to-Image Diffusion Models
Paper β’ 2305.15798 β’ Published β’ 4 -
Wuerstchen: Efficient Pretraining of Text-to-Image Models
Paper β’ 2306.00637 β’ Published β’ 12
-
PhotoVerse: Tuning-Free Image Customization with Text-to-Image Diffusion Models
Paper β’ 2309.05793 β’ Published β’ 50 -
3D Gaussian Splatting for Real-Time Radiance Field Rendering
Paper β’ 2308.04079 β’ Published β’ 174 -
stabilityai/stable-diffusion-xl-base-1.0
Text-to-Image β’ Updated β’ 2.6M β’ β’ 6.25k -
Ryukijano/lora-trained-xl-kaggle-p100
Text-to-Image β’ Updated β’ 10 β’ β’ 1