Lotus: Diffusion-based Visual Foundation Model for High-quality Dense Prediction

  • This model belongs to the family of official Lotus models.
  • Compared to the previous version, this model is trained in disparity space (inverse depth), achieving better performance and more stable video depth estimation.

Paper Paper HuggingFace Demo GitHub

Developed by: Jing He✱, Haodong Li✱, Wei Yin, Yixun Liang, Leheng Li, Kaiqiang Zhou, Hongbo Zhang, Bingbing Liu, Ying-Cong Chenβœ‰

teaser teaser

Usage

Please refer to this page.

Downloads last month
547
Inference Providers NEW
This model isn't deployed by any Inference Provider. πŸ™‹ Ask for provider support

Space using jingheya/lotus-depth-d-v2-0-disparity 1