Out-of-Distribution Detection for Monocular Depth Estimation

In monocular depth estimation, uncertainty estimation approaches mainly target the data uncertainty introduced by image noise. In contrast to prior work, we address the uncertainty due to lack of knowledge, which is relevant for the detection of data not represented by the training distribution, the so-called out-of-distribution (OOD) data. Motivated by anomaly detection, we propose to detect OOD images from an encoder-decoder depth estimation model based on the reconstruction error. Given the features extracted with the fixed depth encoder, we train an image decoder for image reconstruction using only in-distribution data. Consequently, OOD images result in a high reconstruction error, which we use to distinguish between in- and out-of-distribution samples. We built our experiments on the standard NYU Depth V2 and KITTI benchmarks as in-distribution data. Our post hoc method performs astonishingly well on different models and outperforms existing uncertainty estimation approaches without modifying the trained encoder-decoder depth estimation model.

[1]  Vasileios Belagiannis,et al.  Heatmap-based Out-of-Distribution Detection , 2022, 2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV).

[2]  Chetan Arora,et al.  Attention Attention Everywhere: Monocular Depth Prediction with Skip Attention , 2022, 2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV).

[3]  S. Mattoccia,et al.  MonoViT: Self-Supervised Monocular Depth Estimation with a Vision Transformer , 2022, 2022 International Conference on 3D Vision (3DV).

[4]  Vasileios Belagiannis,et al.  Gradient-based Uncertainty for Monocular Depth Estimation , 2022, ECCV.

[5]  Qiang Xu,et al.  Out-of-Distribution Detection with Semantic Mismatch under Masking , 2022, ECCV.

[6]  Zeynep Akata,et al.  BayesCap: Bayesian Identity Cap for Calibrated Uncertainty in Frozen Neural Networks , 2022, ECCV.

[7]  Xiaodong Gu,et al.  Neural Window Fully-connected CRFs for Monocular Depth Estimation , 2022, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[8]  Senqi Cao,et al.  Deep Hybrid Models for Out-of-Distribution Detection , 2022, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[9]  Zhizhong Li,et al.  ViM: Out-Of-Distribution with Virtual-logit Matching , 2022, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[10]  Yixuan Li,et al.  Unknown-Aware Object Detection: Learning What You Don't Know from Videos in the Wild , 2022, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[11]  Yibo Zhou Rethinking Reconstruction Autoencoder-Based Out-of-Distribution Detection , 2022, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[12]  Yixuan Li,et al.  VOS: Learning What You Don't Know by Virtual Outlier Synthesis , 2022, ICLR.

[13]  Gaofeng Meng,et al.  Out-of-distribution Detection with Boundary Aware Learning , 2021, ECCV.

[14]  Yixuan Li,et al.  ReAct: Out-of-distribution Detection With Rectified Activations , 2021, NeurIPS.

[15]  Yixuan Li,et al.  DICE: Leveraging Sparsification for Out-of-Distribution Detection , 2021, ECCV.

[16]  Yixuan Li,et al.  On the Importance of Gradients for Detecting Distributional Shifts in the Wild , 2021, NeurIPS.

[17]  Jaegul Choo,et al.  Standardized Max Logits: A Simple yet Effective Approach for Identifying Unexpected Road Obstacles in Urban-Scene Segmentation , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[18]  Yiran Chen,et al.  Mixture Outlier Exposure: Towards Out-of-Distribution Detection in Fine-grained Environments , 2021, 2023 IEEE/CVF Winter Conference on Applications of Computer Vision (WACV).

[19]  Rui Huang,et al.  MOS: Towards Scaling Out-of-distribution Detection for Large Semantic Space , 2021, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[20]  Ang Li,et al.  Neural Mean Discrepancy for Efficient Out-of-Distribution Detection , 2021, 2022 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[21]  Vladlen Koltun,et al.  Vision Transformers for Dense Prediction , 2021, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[22]  L. Aitchison,et al.  Bayesian OOD detection with aleatoric uncertainty and outlier exposure , 2021, 2102.12959.

[23]  M. Rottmann,et al.  Entropy Maximization and Meta Classification for Out-of-Distribution Detection in Semantic Segmentation , 2020, 2021 IEEE/CVF International Conference on Computer Vision (ICCV).

[24]  Peter Wonka,et al.  AdaBins: Depth Estimation Using Adaptive Bins , 2020, 2021 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[25]  S. Gelly,et al.  An Image is Worth 16x16 Words: Transformers for Image Recognition at Scale , 2020, ICLR.

[26]  Yixuan Li,et al.  Energy-based Out-of-distribution Detection , 2020, NeurIPS.

[27]  Stefano Mattoccia,et al.  On the Uncertainty of Self-Supervised Monocular Depth Estimation , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[28]  Z. Kira,et al.  Generalized ODIN: Detecting Out-of-Distribution Image Without Learning From Out-of-Distribution Data , 2020, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[29]  Naila Murray,et al.  Virtual KITTI 2 , 2020, ArXiv.

[30]  Dawn Song,et al.  Using Self-Supervised Learning Can Improve Model Robustness and Uncertainty , 2019, NeurIPS.

[31]  Thomas B. Schön,et al.  Evaluating Scalable Bayesian Deep Learning Methods for Robust Computer Vision , 2019, 2020 IEEE/CVF Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[32]  Bernd Bischl,et al.  Robust Anomaly Detection in Images using Adversarial Autoencoders , 2019, ECML/PKDD.

[33]  C. V. Jawahar,et al.  IDD: A Dataset for Exploring Problems of Autonomous Navigation in Unconstrained Environments , 2018, 2019 IEEE Winter Conference on Applications of Computer Vision (WACV).

[34]  Thomas G. Dietterich,et al.  Deep Anomaly Detection with Outlier Exposure , 2018, ICLR.

[35]  Andrea Vedaldi,et al.  Supervising the New with the Old: Learning SFM from SFM , 2018, ECCV.

[36]  Kibok Lee,et al.  A Simple Unified Framework for Detecting Out-of-Distribution Samples and Adversarial Attacks , 2018, NeurIPS.

[37]  Jörg Stückler,et al.  Deep Virtual Stereo Odometry: Leveraging Deep Depth Prediction for Monocular Direct Sparse Odometry , 2018, ECCV.

[38]  Gabriel J. Brostow,et al.  Digging Into Self-Supervised Monocular Depth Estimation , 2018, 2019 IEEE/CVF International Conference on Computer Vision (ICCV).

[39]  Bolei Zhou,et al.  Places: A 10 Million Image Database for Scene Recognition , 2018, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[40]  Kibok Lee,et al.  Training Confidence-calibrated Classifiers for Detecting Out-of-Distribution Samples , 2017, ICLR.

[41]  R. Srikant,et al.  Enhancing The Reliability of Out-of-distribution Image Detection in Neural Networks , 2017, ICLR.

[42]  Noah Snavely,et al.  Unsupervised Learning of Depth and Ego-Motion from Video , 2017, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[43]  Kilian Q. Weinberger,et al.  Snapshot Ensembles: Train 1, get M for free , 2017, ICLR.

[44]  Alex Kendall,et al.  What Uncertainties Do We Need in Bayesian Deep Learning for Computer Vision? , 2017, NIPS.

[45]  Charles Blundell,et al.  Simple and Scalable Predictive Uncertainty Estimation using Deep Ensembles , 2016, NIPS.

[46]  Kevin Gimpel,et al.  A Baseline for Detecting Misclassified and Out-of-Distribution Examples in Neural Networks , 2016, ICLR.

[47]  Oisin Mac Aodha,et al.  Unsupervised Monocular Depth Estimation with Left-Right Consistency , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[48]  Nassir Navab,et al.  Deeper Depth Prediction with Fully Convolutional Residual Networks , 2016, 2016 Fourth International Conference on 3D Vision (3DV).

[49]  Zoubin Ghahramani,et al.  Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning , 2015, ICML.

[50]  Rob Fergus,et al.  Depth Map Prediction from a Single Image using a Multi-Scale Deep Network , 2014, NIPS.

[51]  Andreas Geiger,et al.  Vision meets robotics: The KITTI dataset , 2013, Int. J. Robotics Res..

[52]  Derek Hoiem,et al.  Indoor Segmentation and Support Inference from RGBD Images , 2012, ECCV.

[53]  VARUN CHANDOLA,et al.  Anomaly detection: A survey , 2009, CSUR.

[54]  Peter Secretan Learning , 1965, Mental Health.

[55]  Dawn Song,et al.  Scaling Out-of-Distribution Detection for Real-World Settings , 2022, ICML.

[56]  Hao Zhou,et al.  Out-of-Distribution Identification: Let Detector Tell Which I Am Not Sure , 2022, ECCV.

[57]  Nitish Srivastava,et al.  Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[58]  A. Weigend,et al.  Estimating the mean and variance of the target probability distribution , 1994, Proceedings of 1994 IEEE International Conference on Neural Networks (ICNN'94).