Rectified Diffusion: Straightness Is Not Your Need in Rectified Flow
Abstract
Diffusion models have greatly improved visual generation but are hindered by slow generation speed due to the computationally intensive nature of solving generative ODEs. Rectified flow, a widely recognized solution, improves generation speed by straightening the ODE path. Its key components include: 1) using the diffusion form of flow-matching, 2) employing boldsymbol v-prediction, and 3) performing rectification (a.k.a. reflow). In this paper, we argue that the success of rectification primarily lies in using a pretrained diffusion model to obtain matched pairs of noise and samples, followed by retraining with these matched noise-sample pairs. Based on this, components 1) and 2) are unnecessary. Furthermore, we highlight that straightness is not an essential training target for rectification; rather, it is a specific case of flow-matching models. The more critical training target is to achieve a first-order approximate ODE path, which is inherently curved for models like DDPM and Sub-VP. Building on this insight, we propose Rectified Diffusion, which generalizes the design space and application scope of rectification to encompass the broader category of diffusion models, rather than being restricted to flow-matching models. We validate our method on Stable Diffusion v1-5 and Stable Diffusion XL. Our method not only greatly simplifies the training procedure of rectified flow-based previous works (e.g., InstaFlow) but also achieves superior performance with even lower training cost. Our code is available at https://github.com/G-U-N/Rectified-Diffusion.
Community
In this paper, we propose Rectified Diffusion and show
- π The nature of rectification proposed in rectified flow is retraining with paired noise-sample.
- π The rectification is not limited to flow-matching models. We extend the application scope of rectification to general diffusion models. We achieve better performance with smaller training cost.
- π The essential training target of rectification is to obtain the first-order ODE instead of straight ODE path. For diffusion models like DDPM, the first-order ODE path is inherently curved.
- π We extend rectified diffusion to multiphase setting and surpass the performance of multiphase-based rectified flow method.
[paper]: https://arxiv.org/pdf/2410.07303
[GitHub]: https://github.com/G-U-N/Rectified-Diffusion/tree/master
[HuggingFace]: https://huggingface.co/wangfuyun/Rectified-Diffusion
This is an automated message from the Librarian Bot. I found the following papers similar to this paper.
The following papers were recommended by the Semantic Scholar API
- OSV: One Step is Enough for High-Quality Image to Video Generation (2024)
- FlowTurbo: Towards Real-time Flow-Based Image Generation with Velocity Refiner (2024)
- Distillation-Free One-Step Diffusion for Real-World Image Super-Resolution (2024)
- Target-Driven Distillation: Consistency Distillation with Target Timestep Selection and Decoupled Guidance (2024)
- Decouple-Then-Merge: Towards Better Training for Diffusion Models (2024)
Please give a thumbs up to this comment if you found it helpful!
If you want recommendations for any Paper on Hugging Face checkout this Space
You can directly ask Librarian Bot for paper recommendations by tagging it in a comment:
@librarian-bot
recommend
Models citing this paper 1
Datasets citing this paper 0
No dataset linking this paper
Spaces citing this paper 0
No Space linking this paper