Step-aware Preference Optimization: Aligning Preference with Denoising Performance at Each Step Paper • 2406.04314 • Published Jun 6, 2024 • 28
ReVideo: Remake a Video with Motion and Content Control Paper • 2405.13865 • Published May 22, 2024 • 23
LiteVAE: Lightweight and Efficient Variational Autoencoders for Latent Diffusion Models Paper • 2405.14477 • Published May 23, 2024 • 17
What matters when building vision-language models? Paper • 2405.02246 • Published May 3, 2024 • 101
PixArt-α: Fast Training of Diffusion Transformer for Photorealistic Text-to-Image Synthesis Paper • 2310.00426 • Published Sep 30, 2023 • 61