论文信息 - Calibration of Deep Probabilistic Models with Decoupled Bayesian Neural Networks

Calibration of Deep Probabilistic Models with Decoupled Bayesian Neural Networks

Abstract Deep Neural Networks (DNNs) have achieved state-of-the-art accuracy performance in many tasks. However, recent works have pointed out that the outputs provided by these models are not well-calibrated, seriously limiting their use in critical decision scenarios. In this work, we propose to use a decoupled Bayesian stage, implemented with a Bayesian Neural Network (BNN), to map the uncalibrated probabilities provided by a DNN to calibrated ones, consistently improving calibration. Our results evidence that incorporating uncertainty provides more reliable probabilistic models, a critical condition for achieving good calibration. We report a generous collection of experimental results using high-accuracy DNNs in standardized image classification benchmarks, showing the good performance, flexibility and robust behaviour of our approach with respect to several state-of-the-art calibration methods. Code for reproducibility is provided.

[1] Max Welling,et al. Improved Variational Inference with Inverse Autoregressive Flow , 2016, NIPS 2016.

[2] Daniel Ramos,et al. Deconstructing Cross-Entropy for Probabilistic Binary Classifiers , 2018, Entropy.

[3] Karthikeyan Shanmugam,et al. Confidence Scoring Using Whitebox Meta-models with Linear Classifier Probes , 2018, AISTATS.

[4] Xin Zhang,et al. End to End Learning for Self-Driving Cars , 2016, ArXiv.

[5] Jonathan Krause,et al. 3D Object Representations for Fine-Grained Categorization , 2013, 2013 IEEE International Conference on Computer Vision Workshops.

[6] A. Dawid. The Well-Calibrated Bayesian , 1982 .

[7] Maurizio Filippone,et al. Calibrating Deep Convolutional Gaussian Processes , 2018, AISTATS.

[8] Zoubin Ghahramani,et al. Bayesian Convolutional Neural Networks with Bernoulli Approximate Variational Inference , 2015, ArXiv.

[9] Graham W. Taylor,et al. Learning Confidence for Out-of-Distribution Detection in Neural Networks , 2018, ArXiv.

[10] Tara N. Sainath,et al. Deep Neural Networks for Acoustic Modeling in Speech Recognition: The Shared Views of Four Research Groups , 2012, IEEE Signal Processing Magazine.

[11] Zhuowen Tu,et al. Aggregated Residual Transformations for Deep Neural Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[12] Bohyung Han,et al. Learning for Single-Shot Confidence Calibration in Deep Neural Networks Through Stochastic Inferences , 2018, 2019 IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR).

[13] Bianca Zadrozny,et al. Obtaining calibrated probability estimates from decision trees and naive Bayesian classifiers , 2001, ICML.

[14] Jian Sun,et al. Identity Mappings in Deep Residual Networks , 2016, ECCV.

[15] Zoubin Ghahramani,et al. Sparse Gaussian Processes using Pseudo-inputs , 2005, NIPS.

[16] Stefano Ermon,et al. Accurate Uncertainties for Deep Learning Using Calibrated Regression , 2018, ICML.

[17] Kibok Lee,et al. Training Confidence-calibrated Classifiers for Detecting Out-of-Distribution Samples , 2017, ICLR.

[18] Dustin Tran,et al. Hierarchical Variational Models , 2015, ICML.

[19] Sunita Sarawagi,et al. Trainable Calibration Measures For Neural Networks From Kernel Mean Embeddings , 2018, ICML.

[20] David Duvenaud,et al. Inference Suboptimality in Variational Autoencoders , 2018, ICML.

[21] Christopher M. Bishop,et al. Pattern Recognition and Machine Learning (Information Science and Statistics) , 2006 .

[22] Yoshua Bengio,et al. On integrating a language model into neural machine translation , 2017, Comput. Speech Lang..

[23] Charles Blundell,et al. Simple and Scalable Predictive Uncertainty Estimation using Deep Ensembles , 2016, NIPS.

[24] Jeffrey Dean,et al. Distributed Representations of Words and Phrases and their Compositionality , 2013, NIPS.

[25] Mark Sandler,et al. MobileNetV2: Inverted Residuals and Linear Bottlenecks , 2018, 2018 IEEE/CVF Conference on Computer Vision and Pattern Recognition.

[26] Jian Sun,et al. Deep Residual Learning for Image Recognition , 2015, 2016 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[27] Ariel D. Procaccia,et al. Variational Dropout and the Local Reparameterization Trick , 2015, NIPS.

[28] Yarin Gal,et al. Uncertainty in Deep Learning , 2016 .

[29] Mykel J. Kochenderfer,et al. Amortized Inference Regularization , 2018, NeurIPS.

[30] Ole Winther,et al. Auxiliary Deep Generative Models , 2016, ICML.

[31] Max Welling,et al. Sylvester Normalizing Flows for Variational Inference , 2018, UAI.

[32] Andrew Zisserman,et al. Very Deep Convolutional Networks for Large-Scale Image Recognition , 2014, ICLR.

[33] Omkar M. Parkhi,et al. VGGFace2: A Dataset for Recognising Faces across Pose and Age , 2017, 2018 13th IEEE International Conference on Automatic Face & Gesture Recognition (FG 2018).

[34] Theodoros Damoulas,et al. Generalized Variational Inference: Three arguments for deriving new Posteriors , 2019 .

[35] John Platt,et al. Probabilistic Outputs for Support vector Machines and Comparisons to Regularized Likelihood Methods , 1999 .

[36] Max Welling,et al. Auto-Encoding Variational Bayes , 2013, ICLR.

[37] Johannes Gehrke,et al. Intelligible Models for HealthCare: Predicting Pneumonia Risk and Hospital 30-day Readmission , 2015, KDD.

[38] Rich Caruana,et al. Predicting good probabilities with supervised learning , 2005, ICML.

[39] Pietro Perona,et al. The Caltech-UCSD Birds-200-2011 Dataset , 2011 .

[40] Sebastian Nowozin,et al. Fixing Variational Bayes: Deterministic Variational Inference for Bayesian Neural Networks , 2018, ArXiv.

[41] Milos Hauskrecht,et al. Obtaining Well Calibrated Probabilities Using Bayesian Binning , 2015, AAAI.

[42] Tal Hassner,et al. Age and Gender Estimation of Unfiltered Faces , 2014, IEEE Transactions on Information Forensics and Security.

[43] Nikos Komodakis,et al. Wide Residual Networks , 2016, BMVC.

[44] Alex Kendall,et al. What Uncertainties Do We Need in Bayesian Deep Learning for Computer Vision? , 2017, NIPS.

[45] Tianqi Chen,et al. Stochastic Gradient Hamiltonian Monte Carlo , 2014, ICML.

[46] Moisés Goldszmidt,et al. Properties and Benefits of Calibrated Classifiers , 2004, PKDD.

[47] Zoubin Ghahramani,et al. Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning , 2015, ICML.

[48] Enhua Wu,et al. Squeeze-and-Excitation Networks , 2017, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[49] David Barber,et al. An Auxiliary Variational Method , 2004, ICONIP.

[50] Kilian Q. Weinberger,et al. On Calibration of Modern Neural Networks , 2017, ICML.

[51] Geoffrey E. Hinton,et al. Regularizing Neural Networks by Penalizing Confident Output Distributions , 2017, ICLR.

[52] N. Brummer,et al. On calibration of language recognition scores , 2006, 2006 IEEE Odyssey - The Speaker and Language Recognition Workshop.

[53] José Miguel Hernández-Lobato,et al. Ergodic Inference: Accelerate Convergence by Optimisation , 2018 .

[54] Michael Betancourt,et al. A Conceptual Introduction to Hamiltonian Monte Carlo , 2017, 1701.02434.

[55] Jimmy Ba,et al. Adam: A Method for Stochastic Optimization , 2014, ICLR.

[56] Stephen E. Fienberg,et al. The Comparison and Evaluation of Forecasters. , 1983 .

[57] Jeffrey Dean,et al. Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[58] Shakir Mohamed,et al. Variational Inference with Normalizing Flows , 2015, ICML.

[59] Niko Brümmer,et al. Measuring, refining and calibrating speaker and language information extracted from speech , 2010 .

[60] Jonathon Shlens,et al. Explaining and Harnessing Adversarial Examples , 2014, ICLR.

[61] Zoubin Ghahramani,et al. Variational Measure Preserving Flows , 2018, ArXiv.

[62] Lorenzo Rosasco,et al. Dirichlet-based Gaussian Processes for Large-scale Calibrated Classification , 2018, NeurIPS.

[63] Alexander M. Rush,et al. Semi-Amortized Variational Autoencoders , 2018, ICML.

[64] Daan Wierstra,et al. Stochastic Backpropagation and Approximate Inference in Deep Generative Models , 2014, ICML.

[65] Lukasz Kaiser,et al. Attention is All you Need , 2017, NIPS.

[66] Andrew Y. Ng,et al. Reading Digits in Natural Images with Unsupervised Feature Learning , 2011 .

[67] Shuicheng Yan,et al. Dual Path Networks , 2017, NIPS.

[68] Kilian Q. Weinberger,et al. Densely Connected Convolutional Networks , 2016, 2017 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[69] Juan José Murillo-Fuentes,et al. Inference in Deep Gaussian Processes using Stochastic Gradient Hamiltonian Monte Carlo , 2018, NeurIPS.

[70] Bianca Zadrozny,et al. Transforming classifier scores into accurate multiclass probability estimates , 2002, KDD.

[71] Samy Bengio,et al. Density estimation using Real NVP , 2016, ICLR.

[72] Radford M. Neal. MCMC Using Hamiltonian Dynamics , 2011, 1206.1901.

[73] Max Welling,et al. Multiplicative Normalizing Flows for Variational Bayesian Neural Networks , 2017, ICML.