Collections
Discover the best community collections!
Collections including paper arxiv:2402.17177
-
Generative AI meets 3D: A Survey on Text-to-3D in AIGC Era
Paper • 2305.06131 • Published • 2 -
Perpetual Humanoid Control for Real-time Simulated Avatars
Paper • 2305.06456 • Published • 1 -
Drag Your GAN: Interactive Point-based Manipulation on the Generative Image Manifold
Paper • 2305.10973 • Published • 33 -
LDM3D: Latent Diffusion Model for 3D
Paper • 2305.10853 • Published • 10
-
Brain2Music: Reconstructing Music from Human Brain Activity
Paper • 2307.11078 • Published • 40 -
Decoding speech from non-invasive brain recordings
Paper • 2208.12266 • Published • 4 -
Seeing through the Brain: Image Reconstruction of Visual Perception from Human Brain Signals
Paper • 2308.02510 • Published • 21 -
DreamDiffusion: Generating High-Quality Images from Brain EEG Signals
Paper • 2306.16934 • Published • 31
-
Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models
Paper • 2402.17177 • Published • 88 -
EMO: Emote Portrait Alive - Generating Expressive Portrait Videos with Audio2Video Diffusion Model under Weak Conditions
Paper • 2402.17485 • Published • 190 -
VisionLLaMA: A Unified LLaMA Interface for Vision Tasks
Paper • 2403.00522 • Published • 44 -
PixArt-Σ: Weak-to-Strong Training of Diffusion Transformer for 4K Text-to-Image Generation
Paper • 2403.04692 • Published • 39
-
Sora: A Review on Background, Technology, Limitations, and Opportunities of Large Vision Models
Paper • 2402.17177 • Published • 88 -
Mora: Enabling Generalist Video Generation via A Multi-Agent Framework
Paper • 2403.13248 • Published • 78 -
LLaVA-Plus: Learning to Use Tools for Creating Multimodal Agents
Paper • 2311.05437 • Published • 48 -
UniAff: A Unified Representation of Affordances for Tool Usage and Articulation with Vision-Language Models
Paper • 2409.20551 • Published • 14