PeppePasti
's Collections
Computer Vision
updated
DepthCrafter: Generating Consistent Long Depth Sequences for Open-world
Videos
Paper
•
2409.02095
•
Published
•
36
General OCR Theory: Towards OCR-2.0 via a Unified End-to-end Model
Paper
•
2409.01704
•
Published
•
83
CDM: A Reliable Metric for Fair and Accurate Formula Recognition
Evaluation
Paper
•
2409.03643
•
Published
•
19
UniDet3D: Multi-dataset Indoor 3D Object Detection
Paper
•
2409.04234
•
Published
•
7
Evaluating Multiview Object Consistency in Humans and Image Models
Paper
•
2409.05862
•
Published
•
8
LEIA: Latent View-invariant Embeddings for Implicit 3D Articulation
Paper
•
2409.06703
•
Published
•
2
Hi3D: Pursuing High-Resolution Image-to-3D Generation with Video
Diffusion Models
Paper
•
2409.07452
•
Published
•
20
Instant Facial Gaussians Translator for Relightable and Interactable
Facial Rendering
Paper
•
2409.07441
•
Published
•
10
InstantDrag: Improving Interactivity in Drag-based Image Editing
Paper
•
2409.08857
•
Published
•
31
MIMO: Controllable Character Video Synthesis with Spatial Decomposed
Modeling
Paper
•
2409.16160
•
Published
•
33
Lotus: Diffusion-based Visual Foundation Model for High-quality Dense
Prediction
Paper
•
2409.18124
•
Published
•
32