-
EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
Paper ā¢ 2402.17485 ā¢ Published ā¢ 190 -
VividTalk: One-Shot Audio-Driven Talking Head Generation Based on 3D Hybrid Prior
Paper ā¢ 2312.01841 ā¢ Published ā¢ 1 -
MagicAnimate: Temporally Consistent Human Image Animation using Diffusion Model
Paper ā¢ 2311.16498 ā¢ Published ā¢ 1 -
GaussianAvatar: Towards Realistic Human Avatar Modeling from a Single Video via Animatable 3D Gaussians
Paper ā¢ 2312.02134 ā¢ Published ā¢ 2
Collections
Discover the best community collections!
Collections including paper arxiv:2407.06938
-
4K4DGen: Panoramic 4D Generation at 4K Resolution
Paper ā¢ 2406.13527 ā¢ Published ā¢ 8 -
Style-NeRF2NeRF: 3D Style Transfer From Style-Aligned Multi-View Images
Paper ā¢ 2406.13393 ā¢ Published ā¢ 5 -
YouDream: Generating Anatomically Controllable Consistent Text-to-3D Animals
Paper ā¢ 2406.16273 ā¢ Published ā¢ 41 -
EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything Model
Paper ā¢ 2406.20076 ā¢ Published ā¢ 9
-
GECO: Generative Image-to-3D within a SECOnd
Paper ā¢ 2405.20327 ā¢ Published ā¢ 10 -
Ouroboros3D: Image-to-3D Generation via 3D-aware Recursive Diffusion
Paper ā¢ 2406.03184 ā¢ Published ā¢ 19 -
NPGA: Neural Parametric Gaussian Avatars
Paper ā¢ 2405.19331 ā¢ Published ā¢ 10 -
Unified Text-to-Image Generation and Retrieval
Paper ā¢ 2406.05814 ā¢ Published ā¢ 12
-
MotionLLM: Understanding Human Behaviors from Human Motions and Videos
Paper ā¢ 2405.20340 ā¢ Published ā¢ 20 -
Spectrally Pruned Gaussian Fields with Neural Compensation
Paper ā¢ 2405.00676 ā¢ Published ā¢ 8 -
Paint by Inpaint: Learning to Add Image Objects by Removing Them First
Paper ā¢ 2404.18212 ā¢ Published ā¢ 27 -
LoRA Land: 310 Fine-tuned LLMs that Rival GPT-4, A Technical Report
Paper ā¢ 2405.00732 ā¢ Published ā¢ 119
-
RecurrentGemma: Moving Past Transformers for Efficient Open Language Models
Paper ā¢ 2404.07839 ā¢ Published ā¢ 43 -
Direct Nash Optimization: Teaching Language Models to Self-Improve with General Preferences
Paper ā¢ 2404.03715 ā¢ Published ā¢ 60 -
MoMA: Multimodal LLM Adapter for Fast Personalized Image Generation
Paper ā¢ 2404.05674 ā¢ Published ā¢ 14 -
Agentless: Demystifying LLM-based Software Engineering Agents
Paper ā¢ 2407.01489 ā¢ Published ā¢ 42
-
2D Gaussian Splatting for Geometrically Accurate Radiance Fields
Paper ā¢ 2403.17888 ā¢ Published ā¢ 27 -
TC4D: Trajectory-Conditioned Text-to-4D Generation
Paper ā¢ 2403.17920 ā¢ Published ā¢ 16 -
RodinHD: High-Fidelity 3D Avatar Generation with Diffusion Models
Paper ā¢ 2407.06938 ā¢ Published ā¢ 23 -
TencentARC/InstantMesh
Image-to-3D ā¢ Updated ā¢ 39.7k ā¢ 264
-
ViewDiff: 3D-Consistent Image Generation with Text-to-Image Models
Paper ā¢ 2403.01807 ā¢ Published ā¢ 7 -
TripoSR: Fast 3D Object Reconstruction from a Single Image
Paper ā¢ 2403.02151 ā¢ Published ā¢ 12 -
OOTDiffusion: Outfitting Fusion based Latent Diffusion for Controllable Virtual Try-on
Paper ā¢ 2403.01779 ā¢ Published ā¢ 28 -
MagicClay: Sculpting Meshes With Generative Neural Fields
Paper ā¢ 2403.02460 ā¢ Published ā¢ 6
-
TextureDreamer: Image-guided Texture Synthesis through Geometry-aware Diffusion
Paper ā¢ 2401.09416 ā¢ Published ā¢ 10 -
SHINOBI: Shape and Illumination using Neural Object Decomposition via BRDF Optimization In-the-wild
Paper ā¢ 2401.10171 ā¢ Published ā¢ 13 -
DMV3D: Denoising Multi-View Diffusion using 3D Large Reconstruction Model
Paper ā¢ 2311.09217 ā¢ Published ā¢ 21 -
GALA: Generating Animatable Layered Assets from a Single Scan
Paper ā¢ 2401.12979 ā¢ Published ā¢ 7
-
DiffusionGAN3D: Boosting Text-guided 3D Generation and Domain Adaption by Combining 3D GANs and Diffusion Priors
Paper ā¢ 2312.16837 ā¢ Published ā¢ 5 -
Learning the 3D Fauna of the Web
Paper ā¢ 2401.02400 ā¢ Published ā¢ 9 -
Zero123++: a Single Image to Consistent Multi-view Diffusion Base Model
Paper ā¢ 2310.15110 ā¢ Published ā¢ 2 -
Zero-1-to-3: Zero-shot One Image to 3D Object
Paper ā¢ 2303.11328 ā¢ Published ā¢ 5