Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction

This model belongs to the family of official Lotus models.
Some training normals in the Hypersim dataset are not properly oriented towards the camera. This models was re-trained using aligned surface normals, referred to GeoWizard, and achieves significantly improved results.

Paper Paper HuggingFace Demo GitHub

Developed by: Jing He, Haodong Li, Wei Yin, Yixun Liang, Leheng Li, Kaiqiang Zhou, Hongbo Zhang, Bingbing Liu, Ying-Cong Chen

teaser teaser

Usage

Please refer to this page.

Downloads last month
3
Inference API
Unable to determine this model’s pipeline type. Check the docs .

Space using jingheya/lotus-normal-g-v1-1 1