-
Rolling Diffusion Models
Paper • 2402.09470 • Published • 11 -
DreamMatcher: Appearance Matching Self-Attention for Semantically-Consistent Text-to-Image Personalization
Paper • 2402.09812 • Published • 14 -
Self-Play Fine-Tuning of Diffusion Models for Text-to-Image Generation
Paper • 2402.10210 • Published • 33 -
Animated Stickers: Bringing Stickers to Life with Video Diffusion
Paper • 2402.06088 • Published • 10
Collections
Discover the best community collections!
Collections including paper arxiv:2402.06088
-
Mastering Text-to-Image Diffusion: Recaptioning, Planning, and Generating with Multimodal LLMs
Paper • 2401.11708 • Published • 30 -
Weaver: Foundation Models for Creative Writing
Paper • 2401.17268 • Published • 44 -
PokéLLMon: A Human-Parity Agent for Pokémon Battles with Large Language Models
Paper • 2402.01118 • Published • 31 -
Training-Free Consistent Text-to-Image Generation
Paper • 2402.03286 • Published • 66
-
BlockFusion: Expandable 3D Scene Generation using Latent Tri-plane Extrapolation
Paper • 2401.17053 • Published • 32 -
Can Mamba Learn How to Learn? A Comparative Study on In-Context Learning Tasks
Paper • 2402.04248 • Published • 31 -
DeepSeekMath: Pushing the Limits of Mathematical Reasoning in Open Language Models
Paper • 2402.03300 • Published • 79 -
WebLINX: Real-World Website Navigation with Multi-Turn Dialogue
Paper • 2402.05930 • Published • 39
-
CreativeSynth: Creative Blending and Synthesis of Visual Arts based on Multimodal Diffusion
Paper • 2401.14066 • Published • 9 -
UNIMO-G: Unified Image Generation through Multimodal Conditional Diffusion
Paper • 2401.13388 • Published • 11 -
Animated Stickers: Bringing Stickers to Life with Video Diffusion
Paper • 2402.06088 • Published • 10
-
Make Pixels Dance: High-Dynamic Video Generation
Paper • 2311.10982 • Published • 68 -
Emu Video: Factorizing Text-to-Video Generation by Explicit Image Conditioning
Paper • 2311.10709 • Published • 25 -
AutoStory: Generating Diverse Storytelling Images with Minimal Human Effort
Paper • 2311.11243 • Published • 15 -
LivePhoto: Real Image Animation with Text-guided Motion Control
Paper • 2312.02928 • Published • 17
-
Story-to-Motion: Synthesizing Infinite and Controllable Character Animation from Long Text
Paper • 2311.07446 • Published • 29 -
MagicScroll: Nontypical Aspect-Ratio Image Generation for Visual Storytelling via Multi-Layered Semantic-Aware Denoising
Paper • 2312.10899 • Published • 14 -
Animated Stickers: Bringing Stickers to Life with Video Diffusion
Paper • 2402.06088 • Published • 10 -
Vision-Based Hand Gesture Customization from a Single Demonstration
Paper • 2402.08420 • Published • 9