Inishds
's Collections
3D recent
updated
4K4DGen: Panoramic 4D Generation at 4K Resolution
Paper
•
2406.13527
•
Published
•
8
Style-NeRF2NeRF: 3D Style Transfer From Style-Aligned Multi-View Images
Paper
•
2406.13393
•
Published
•
5
YouDream: Generating Anatomically Controllable Consistent Text-to-3D
Animals
Paper
•
2406.16273
•
Published
•
40
EVF-SAM: Early Vision-Language Fusion for Text-Prompted Segment Anything
Model
Paper
•
2406.20076
•
Published
•
9
GaussianDreamerPro: Text to Manipulable 3D Gaussians with Highly
Enhanced Quality
Paper
•
2406.18462
•
Published
•
12
SVG: 3D Stereoscopic Video Generation via Denoising Frame Matrix
Paper
•
2407.00367
•
Published
•
9
RealTalk: Real-time and Realistic Audio-driven Face Generation with 3D
Facial Prior-guided Identity Alignment Network
Paper
•
2406.18284
•
Published
•
19
Magic Insert: Style-Aware Drag-and-Drop
Paper
•
2407.02489
•
Published
•
20
CRiM-GS: Continuous Rigid Motion-Aware Gaussian Splatting from Motion
Blur Images
Paper
•
2407.03923
•
Published
•
7
UltraEdit: Instruction-based Fine-Grained Image Editing at Scale
Paper
•
2407.05282
•
Published
•
13
Tailor3D: Customized 3D Assets Editing and Generation with Dual-Side
Images
Paper
•
2407.06191
•
Published
•
12
RodinHD: High-Fidelity 3D Avatar Generation with Diffusion Models
Paper
•
2407.06938
•
Published
•
23
Vision language models are blind
Paper
•
2407.06581
•
Published
•
83
CrowdMoGen: Zero-Shot Text-Driven Collective Motion Generation
Paper
•
2407.06188
•
Published
•
1
Controlling Space and Time with Diffusion Models
Paper
•
2407.07860
•
Published
•
16
LLaVA-NeXT-Interleave: Tackling Multi-image, Video, and 3D in Large
Multimodal Models
Paper
•
2407.07895
•
Published
•
40
StyleSplat: 3D Object Style Transfer with Gaussian Splatting
Paper
•
2407.09473
•
Published
•
11
GRUtopia: Dream General Robots in a City at Scale
Paper
•
2407.10943
•
Published
•
23
Click-Gaussian: Interactive Segmentation to Any 3D Gaussians
Paper
•
2407.11793
•
Published
•
3
Animate3D: Animating Any 3D Model with Multi-view Video Diffusion
Paper
•
2407.11398
•
Published
•
8
DreamCatalyst: Fast and High-Quality 3D Editing via Controlling
Editability and Identity Preservation
Paper
•
2407.11394
•
Published
•
11
VD3D: Taming Large Video Diffusion Transformers for 3D Camera Control
Paper
•
2407.12781
•
Published
•
13
AppWorld: A Controllable World of Apps and People for Benchmarking
Interactive Coding Agents
Paper
•
2407.18901
•
Published
•
33
Bridging the Gap: Studio-like Avatar Creation from a Monocular Phone
Capture
Paper
•
2407.19593
•
Published
•
12
3D Question Answering for City Scene Understanding
Paper
•
2407.17398
•
Published
•
22
Cycle3D: High-quality and Consistent Image-to-3D Generation via
Generation-Reconstruction Cycle
Paper
•
2407.19548
•
Published
•
25
Theia: Distilling Diverse Vision Foundation Models for Robot Learning
Paper
•
2407.20179
•
Published
•
47
SpaRP: Fast 3D Object Reconstruction and Pose Estimation from Sparse
Views
Paper
•
2408.10195
•
Published
•
12