Abstract: In urban scenes, there are man-made ground objects with complex structures and significant height differences, which leads to challenges in generating large-scale true digital orthophoto ...
This work presents Depth Anything, a highly practical solution for robust monocular depth estimation by training on a combination of 1.5M labeled images and 62M+ unlabeled images.