论文信息 - Risk-Controlled Selective Prediction for Regression Deep Neural Network Models

Risk-Controlled Selective Prediction for Regression Deep Neural Network Models

Regression deep neural network (DNN) models have been successfully utilized in numerous fields. In real-world applications, large regression errors on individual samples may result in severe consequences. Selective techniques, also known as reject options, have been used to reject predictions with high uncertainty. However, they have yet been mainly considered in classification neural networks (NNs), in comparison to the limited work in regression NNs. In this paper, we considered the selective regression problem from a risk-coverage point of view, and proposed a method to construct a selective regression model given a trained regression DNN model and a desired regression error risk. Then, we proposed to utilize blending variance to quantify uncertainty in regression NNs. We evaluated both the proposed uncertainty function and selective regression models for two real-world applications, the tropical cyclone (TC) intensity estimation problem and the apparent age estimation problem. Our proposed methods achieved promising results. For example, for the TC intensity estimation problem, our selective regression model guaranteed a risk bound (in terms of the root mean squared error (RMSE)) of 9.5 knots for 75% test coverage with a guided confidence level of 0.05, whereas the RMSE value achieved by the state-of-the-art model without selection was 10.5 knots.

[1] Sergio Escalera,et al. ChaLearn Looking at People 2015: Apparent Age and Cultural Event Recognition Datasets and Results , 2015, 2015 IEEE International Conference on Computer Vision Workshop (ICCVW).

[2] C. K. Chow,et al. An optimum character recognition system using decision functions , 1957, IRE Trans. Electron. Comput..

[3] C. K. Chow,et al. On optimum recognition error and reject tradeoff , 1970, IEEE Trans. Inf. Theory.

[4] H. Chernoff. A Measure of Asymptotic Efficiency for Tests of a Hypothesis Based on the sum of Observations , 1952 .

[5] Ran El-Yaniv,et al. Selective Classification for Deep Neural Networks , 2017, NIPS.

[6] Radu Herbei,et al. Classification with reject option , 2006 .

[7] Todd Berendes,et al. Using Deep Learning for Tropical Cyclone Intensity Estimation , 2017 .

[8] Hsuan-Tien Lin,et al. Rotation-blended CNNs on a New Open Dataset for Tropical Cyclone Image-to-intensity Regression , 2018, KDD.

[9] Luc Van Gool,et al. DEX: Deep EXpectation of Apparent Age from a Single Image , 2015, 2015 IEEE International Conference on Computer Vision Workshop (ICCVW).

[10] Gábor Lugosi,et al. Prediction, learning, and games , 2006 .

[11] John Robert Mendoza,et al. A Convolutional Neural Network Approach for Estimating Tropical Cyclone Intensity Using Satellite-based Infrared Images , 2018, 2018 24th International Conference on Pattern Recognition (ICPR).

[12] Zoubin Ghahramani,et al. Dropout as a Bayesian Approximation: Representing Model Uncertainty in Deep Learning , 2015, ICML.

[13] Li Zhao,et al. Hoeffding bound based evolutionary algorithm for symbolic regression , 2012, Eng. Appl. Artif. Intell..

[14] W. Hoeffding. Probability Inequalities for sums of Bounded Random Variables , 1963 .

[15] Luc Van Gool,et al. Deep Expectation of Real and Apparent Age from a Single Image Without Facial Landmarks , 2016, International Journal of Computer Vision.

[16] Ran El-Yaniv,et al. On the Foundations of Noise-free Selective Classification , 2010, J. Mach. Learn. Res..

[17] Rahul Ramachandran,et al. Tropical Cyclone Intensity Estimation Using Deep Convolutional Neural Networks , 2018 .

[18] Asifullah Khan,et al. Wind power prediction using deep neural network based meta regression and transfer learning , 2017, Appl. Soft Comput..

[19] Xiaoli Li,et al. Deep Convolutional Neural Network Based Regression Approach for Estimation of Remaining Useful Life , 2016, DASFAA.

[20] Mario Vento,et al. A method for improving classification reliability of multilayer perceptrons , 1995, IEEE Trans. Neural Networks.

[21] Nitish Srivastava,et al. Dropout: a simple way to prevent neural networks from overfitting , 2014, J. Mach. Learn. Res..

[22] Ran El-Yaniv,et al. SelectiveNet: A Deep Neural Network with an Integrated Reject Option , 2019, ICML.

[23] Oded Maron,et al. Hoeffding Races : Model Selection for MRI Classification , 2007 .

[24] Guoying Zhao,et al. Recurrent Convolutional Neural Network Regression for Continuous Pain Intensity Estimation in Video , 2016, 2016 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[25] Tal Hassner,et al. Age and gender classification using convolutional neural networks , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition Workshops (CVPRW).

[26] Mario Vento,et al. To reject or not to reject: that is the question-an answer in case of neural classifiers , 2000, IEEE Trans. Syst. Man Cybern. Part C.