Semantic monocular depth estimation based on artificial intelligence

Depth estimation provides essential information to perform autonomous driving and driver assistance. A promising line of work consists of introducing additional semantic information about the traffic scene when training CNNs for depth estimation. In practice, this means that the depth data used for...

Descripción completa

Detalles Bibliográficos
Autores: Gurram, Akhil|||0000-0002-2544-1510, Urfalioglu, Onay, Halfaoui, Ibrahim, Bouzaraa, Fahd, López Peña, Antonio M.|||0000-0002-6979-5783
Tipo de recurso: artículo
Fecha de publicación:2021
País:España
Institución:Universitat Autònoma de Barcelona
Repositorio:Dipòsit Digital de Documents de la UAB
Idioma:inglés
OAI Identifier:oai:ddd.uab.cat:274825
Acceso en línea:https://ddd.uab.cat/record/274825
https://dx.doi.org/urn:doi:10.1109/MITS.2019.2926263
Access Level:acceso abierto
Palabra clave:Monocular depth estimation
Semantic segmentation
Multi-task learning
Descripción
Sumario:Depth estimation provides essential information to perform autonomous driving and driver assistance. A promising line of work consists of introducing additional semantic information about the traffic scene when training CNNs for depth estimation. In practice, this means that the depth data used for CNN training is complemented with images having pixel-wise semantic labels where the same raw training data is associated with both types of ground truth, i.e., depth and semantic labels. The main contribution of this paper is to show that this hard constraint can be circumvented, i.e., that we can train CNNs for depth estimation by leveraging the depth and semantic information coming from heterogeneous datasets. In order to illustrate the benefits of our approach, we combine KITTI depth and Cityscapes semantic segmentation datasets, outperforming state-of-the-art results on monocular depth estimation.