Skip to main content


In smart mobility based road navigation, object detection, depth estimation and tracking are very important tasks for improvement of the environment perception quality. In the recent years, a surge of deep-learning based depth estimation methods for monocular cameras has lead to significant progress in this field. In this paper, we propose an evaluation of state-of-the-art depth estimation algorithms based on single single input on both the KITTI dataset and the recently published NUScenes dataset. The models evaluated in this paper include an unsupervised method (Monodepth2) and a supervised method (BTS). Our work lies in the elaboration of novel depth evaluation protocols, object depth evaluation and distance ranges evaluation. We validated our new protocols on both KITTI and NUScenes datasets, allowing us to get a more comprehensive evaluation for depth estimation, especially for applications in scene understanding for both road and rail environment.