SPAR3D: Stable Point-Aware Reconstruction of 3D Objects from Single Images Paper β’ 2501.04689 β’ Published 9 days ago β’ 16
Chirpy3D: Continuous Part Latents for Creative 3D Bird Generation Paper β’ 2501.04144 β’ Published 10 days ago β’ 17
On Computational Limits and Provably Efficient Criteria of Visual Autoregressive Models: A Fine-Grained Complexity Analysis Paper β’ 2501.04377 β’ Published 10 days ago β’ 13
Are VLMs Ready for Autonomous Driving? An Empirical Study from the Reliability, Data, and Metric Perspectives Paper β’ 2501.04003 β’ Published 10 days ago β’ 23
The GAN is dead; long live the GAN! A Modern GAN Baseline Paper β’ 2501.05441 β’ Published 8 days ago β’ 77
Multi-subject Open-set Personalization in Video Generation Paper β’ 2501.06187 β’ Published 7 days ago β’ 10
LlamaV-o1: Rethinking Step-by-step Visual Reasoning in LLMs Paper β’ 2501.06186 β’ Published 7 days ago β’ 55
MinMo: A Multimodal Large Language Model for Seamless Voice Interaction Paper β’ 2501.06282 β’ Published 8 days ago β’ 32