InstructPix2Pix: Learning to Follow Image Editing Instructions
Abstract
We propose a method for editing images from human instructions: given an input image and a written instruction that tells the model what to do, our model follows these instructions to edit the image. To obtain training data for this problem, we combine the knowledge of two large pretrained models -- a language model (GPT-3) and a text-to-image model (Stable Diffusion) -- to generate a large dataset of image editing examples. Our conditional diffusion model, InstructPix2Pix, is trained on our generated data, and generalizes to real images and user-written instructions at inference time. Since it performs edits in the forward pass and does not require per example fine-tuning or inversion, our model edits images quickly, in a matter of seconds. We show compelling editing results for a diverse collection of input images and written instructions.
Community
EXAMPLE
deleted
InstructPix2Pix: Revolutionizing Image Editing with AI Instructions
Links π:
π Subscribe: https://www.youtube.com/@Arxflix
π Twitter: https://x.com/arxflix
π LMNT (Partner): https://lmnt.com/
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- ReEdit: Multimodal Exemplar-Based Image Editing with Diffusion Models (2024)
- OmniEdit: Building Image Editing Generalist Models Through Specialist Supervision (2024)
- Diffusion Self-Distillation for Zero-Shot Customized Image Generation (2024)
- AnyEdit: Mastering Unified High-Quality Image Editing for Any Idea (2024)
- UniReal: Universal Image Generation and Editing via Learning Real-world Dynamics (2024)
- InsightEdit: Towards Better Instruction Following for Image Editing (2024)
- LoRA of Change: Learning to Generate LoRA for the Editing Instruction from A Single Before-After Image Pair (2024)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment:
@librarian-bot
recommend
Models citing this paper 10
Browse 10 models citing this paperDatasets citing this paper 2
Spaces citing this paper 95
Collections including this paper 0
No Collection including this paper