论文信息 - Learn-By-Calibrating: Using Calibration As A Training Objective

Learn-By-Calibrating: Using Calibration As A Training Objective

Calibration error is commonly adopted for evaluating the quality of uncertainty estimators in deep neural networks. In this paper, we argue that such a metric is highly beneficial for training predictive models, even when we do not explicitly measure the uncertainties. This is conceptually similar to heteroscedastic neural networks that produce variance estimates for each prediction, with the key difference that we do not place a Gaussian prior on the predictions. We propose a novel algorithm that performs simultaneous interval estimation for different calibration levels and effectively leverages the intervals to refine the mean estimates. Our results show that, our approach is consistently superior to existing regularization strategies in deep regression models. Finally, we propose to augment partial dependence plots, a model-agnostic interpretability tool, with expected prediction intervals to reveal interesting dependencies between data and the target.

Deepta Rajan | Bindya Venkatesh | Jayaraman J. Thiagarajan | Deepta Rajan | Bindya Venkatesh

[1] J. Friedman. Greedy function approximation: A gradient boosting machine. , 2001 .

[2] Zoubin Ghahramani,et al. Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning , 2015, ICML.

[3] Laurence Perreault Levasseur,et al. Uncertainties in Parameters Estimated with Neural Networks: Application to Strong Gravitational Lensing , 2017, 1708.08843.

[4] Siegfried Wahl,et al. Leveraging uncertainty information from deep neural networks for disease detection , 2016, Scientific Reports.

[5] Ralph C. Smith,et al. Uncertainty Quantification: Theory, Implementation, and Applications , 2013 .

[6] Alex Kendall,et al. What Uncertainties Do We Need in Bayesian Deep Learning for Computer Vision? , 2017, NIPS.

[7] Charles Blundell,et al. Simple and Scalable Predictive Uncertainty Estimation using Deep Ensembles , 2016, NIPS.

[8] Tom Heskes,et al. Practical Confidence and Prediction Intervals , 1996, NIPS.

[9] Alex Kendall,et al. Concrete Dropout , 2017, NIPS.

[10] Anne E Carpenter,et al. Opportunities and obstacles for deep learning in biology and medicine , 2017, bioRxiv.

[11] Yarin Gal,et al. Uncertainty in Deep Learning , 2016 .

[12] Zoubin Ghahramani,et al. Probabilistic machine learning and artificial intelligence , 2015, Nature.

[13] Stefano Ermon,et al. Accurate Uncertainties for Deep Learning Using Calibrated Regression , 2018, ICML.