-
The Chosen One: Consistent Characters in Text-to-Image Diffusion Models
Paper • 2311.10093 • Published • 56 -
Concept Sliders: LoRA Adaptors for Precise Control in Diffusion Models
Paper • 2311.12092 • Published • 21 -
DREAM: Diffusion Rectification and Estimation-Adaptive Models
Paper • 2312.00210 • Published • 14 -
HiFi Tuner: High-Fidelity Subject-Driven Fine-Tuning for Diffusion Models
Paper • 2312.00079 • Published • 14
Collections
Discover the best community collections!
Collections including paper arxiv:2311.10093
-
Instant3D: Instant Text-to-3D Generation
Paper • 2311.08403 • Published • 45 -
One-2-3-45++: Fast Single Image to 3D Objects with Consistent Multi-View Generation and 3D Diffusion
Paper • 2311.07885 • Published • 39 -
Drivable 3D Gaussian Avatars
Paper • 2311.08581 • Published • 46 -
DMV3D: Denoising Multi-View Diffusion using 3D Large Reconstruction Model
Paper • 2311.09217 • Published • 21
-
De-Diffusion Makes Text a Strong Cross-Modal Interface
Paper • 2311.00618 • Published • 21 -
The Chosen One: Consistent Characters in Text-to-Image Diffusion Models
Paper • 2311.10093 • Published • 56 -
Using Human Feedback to Fine-tune Diffusion Models without Any Reward Model
Paper • 2311.13231 • Published • 26 -
Diffusion Model Alignment Using Direct Preference Optimization
Paper • 2311.12908 • Published • 47
-
A Picture is Worth a Thousand Words: Principled Recaptioning Improves Image Generation
Paper • 2310.16656 • Published • 40 -
CommonCanvas: An Open Diffusion Model Trained with Creative-Commons Images
Paper • 2310.16825 • Published • 32 -
Matryoshka Diffusion Models
Paper • 2310.15111 • Published • 41 -
I2VGen-XL: High-Quality Image-to-Video Synthesis via Cascaded Diffusion Models
Paper • 2311.04145 • Published • 32
-
BitNet: Scaling 1-bit Transformers for Large Language Models
Paper • 2310.11453 • Published • 96 -
Self-RAG: Learning to Retrieve, Generate, and Critique through Self-Reflection
Paper • 2310.11511 • Published • 75 -
In-Context Learning Creates Task Vectors
Paper • 2310.15916 • Published • 42 -
Matryoshka Diffusion Models
Paper • 2310.15111 • Published • 41
-
DreamLLM: Synergistic Multimodal Comprehension and Creation
Paper • 2309.11499 • Published • 58 -
FoleyGen: Visually-Guided Audio Generation
Paper • 2309.10537 • Published • 8 -
Set-of-Mark Prompting Unleashes Extraordinary Visual Grounding in GPT-4V
Paper • 2310.11441 • Published • 26 -
The Chosen One: Consistent Characters in Text-to-Image Diffusion Models
Paper • 2311.10093 • Published • 56