-
Language Agent Tree Search Unifies Reasoning Acting and Planning in Language Models
Paper β’ 2310.04406 β’ Published β’ 8 -
Chain-of-Thought Reasoning Without Prompting
Paper β’ 2402.10200 β’ Published β’ 104 -
ICDPO: Effectively Borrowing Alignment Capability of Others via In-context Direct Preference Optimization
Paper β’ 2402.09320 β’ Published β’ 6 -
Self-Discover: Large Language Models Self-Compose Reasoning Structures
Paper β’ 2402.03620 β’ Published β’ 114
Collections
Discover the best community collections!
Collections including paper arxiv:2311.12229
-
aMUSEd: An Open MUSE Reproduction
Paper β’ 2401.01808 β’ Published β’ 28 -
From Audio to Photoreal Embodiment: Synthesizing Humans in Conversations
Paper β’ 2401.01885 β’ Published β’ 27 -
SteinDreamer: Variance Reduction for Text-to-3D Score Distillation via Stein Identity
Paper β’ 2401.00604 β’ Published β’ 4 -
LARP: Language-Agent Role Play for Open-World Games
Paper β’ 2312.17653 β’ Published β’ 31
-
One-for-All: Generalized LoRA for Parameter-Efficient Fine-tuning
Paper β’ 2306.07967 β’ Published β’ 24 -
Rerender A Video: Zero-Shot Text-Guided Video-to-Video Translation
Paper β’ 2306.07954 β’ Published β’ 112 -
TryOnDiffusion: A Tale of Two UNets
Paper β’ 2306.08276 β’ Published β’ 72 -
Seeing the World through Your Eyes
Paper β’ 2306.09348 β’ Published β’ 33
-
NeuroPrompts: An Adaptive Framework to Optimize Prompts for Text-to-Image Generation
Paper β’ 2311.12229 β’ Published β’ 26 -
Principled Instructions Are All You Need for Questioning LLaMA-1/2, GPT-3.5/4
Paper β’ 2312.16171 β’ Published β’ 34 -
DreamDistribution: Prompt Distribution Learning for Text-to-Image Diffusion Models
Paper β’ 2312.14216 β’ Published β’ 10
-
The Chosen One: Consistent Characters in Text-to-Image Diffusion Models
Paper β’ 2311.10093 β’ Published β’ 56 -
NeuroPrompts: An Adaptive Framework to Optimize Prompts for Text-to-Image Generation
Paper β’ 2311.12229 β’ Published β’ 26 -
Diffusion Model Alignment Using Direct Preference Optimization
Paper β’ 2311.12908 β’ Published β’ 47 -
VMC: Video Motion Customization using Temporal Attention Adaption for Text-to-Video Diffusion Models
Paper β’ 2312.00845 β’ Published β’ 36
-
Instant3D: Instant Text-to-3D Generation
Paper β’ 2311.08403 β’ Published β’ 45 -
One-2-3-45++: Fast Single Image to 3D Objects with Consistent Multi-View Generation and 3D Diffusion
Paper β’ 2311.07885 β’ Published β’ 39 -
Drivable 3D Gaussian Avatars
Paper β’ 2311.08581 β’ Published β’ 46 -
DMV3D: Denoising Multi-View Diffusion using 3D Large Reconstruction Model
Paper β’ 2311.09217 β’ Published β’ 21