Neural Network Uncertainty Assessment Using Bayesian Statistics: A Remote Sensing Application

Neural network (NN) techniques have proved successful for many regression problems, in particular for remote sensing; however, uncertainty estimates are rarely provided. In this article, a Bayesian technique to evaluate uncertainties of the NN parameters (i.e., synaptic weights) is first presented. In contrast to more traditional approaches based on point estimation of the NN weights, we assess uncertainties on such estimates to monitor the robustness of the NN model. These theoretical developments are illustrated by applying them to the problem of retrieving surface skin temperature, microwave surface emissivities, and integrated water vapor content from a combined analysis of satellite microwave and infrared observations over land. The weight uncertainty estimates are then used to compute analytically the uncertainties in the network outputs (i.e., error bars and correlation structure of these errors). Such quantities are very important for evaluating any application of an NN model. The uncertainties on the NN Jacobians are then considered in the third part of this article. Used for regression fitting, NN models can be used effectively to represent highly nonlinear, multivariate functions. In this situation, most emphasis is put on estimating the output errors, but almost no attention has been given to errors associated with the internal structure of the regression model. The complex structure of dependency inside the NN is the essence of the model, and assessing its quality, coherency, and physical character makes all the difference between a blackbox model with small output errors and a reliable, robust, and physically coherent model. Such dependency structures are described to the first order by the NN Jacobians: they indicate the sensitivity of one output with respect to the inputs of the model for given input data. We use a Monte Carlo integration procedure to estimate the robustness of the NN Jacobians. A regularization strategy based on principal component analysis is proposed to suppress the multicollinearities in order to make these Jacobians robust and physically meaningful.

[1]  C. Rodgers,et al.  Retrieval of atmospheric temperature and composition from remote measurements of thermal radiation , 1976 .

[2]  David J. C. MacKay,et al.  A Practical Bayesian Framework for Backpropagation Networks , 1992, Neural Computation.

[3]  Philippe Courtier,et al.  Unified Notation for Data Assimilation : Operational, Sequential and Variational , 1997 .

[4]  Vladimir M. Krasnopolsky,et al.  A Neural Network Multiparameter Algorithm for SSM/I Ocean Retrievals , 2000 .

[5]  Elie Bienenstock,et al.  Neural Networks and the Bias/Variance Dilemma , 1992, Neural Computation.

[6]  Ian T. Nabney,et al.  Netlab: Algorithms for Pattern Recognition , 2002 .

[7]  L. Crone,et al.  Statistical applications of a metric on subspaces to satellite meteorology , 1995 .

[8]  Filipe Aires,et al.  Inferring instantaneous, multivariate and nonlinear sensitivities for the analysis of feedback processes in a dynamical system: Lorenz model case‐study , 2003 .

[9]  Geoffrey E. Hinton,et al.  Learning internal representations by error propagation , 1986 .

[10]  V. Krasnopolsky,et al.  A neural network as a nonlinear transfer function model for retrieving surface wind speeds from the special sensor microwave imager , 1995 .

[11]  Heekuck Oh,et al.  Neural Networks for Pattern Recognition , 1993, Adv. Comput..

[12]  F. Aires,et al.  A new neural network approach including first guess for retrieval of atmospheric water vapor, cloud liquid water path, surface temperature, and emissivities over land from satellite microwave observations , 2001 .

[13]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[14]  Anders Krogh,et al.  Introduction to the theory of neural computation , 1994, The advanced book program.

[15]  Léon Personnaz,et al.  MLPs (Mono-Layer Polynomials and Multi-Layer Perceptrons) for Nonlinear Modeling , 2003, J. Mach. Learn. Res..

[16]  Léon Personnaz,et al.  Construction of confidence intervals for neural networks based on least squares estimation , 2000, Neural Networks.

[17]  C. Rodgers Characterization and Error Analysis of Profiles Retrieved From Remote Sounding Measurements , 1990 .

[18]  A. N. Tikhonov,et al.  Solutions of ill-posed problems , 1977 .

[19]  Yann LeCun,et al.  Optimal Brain Damage , 1989, NIPS.

[20]  A. Tarantola Inverse problem theory : methods for data fitting and model parameter estimation , 1987 .

[21]  C. T. Butler,et al.  Ocean surface wind retrievals from special sensor microwave imager data with neural networks , 1994 .

[22]  C. Prigent,et al.  Retrieval of surface and atmospheric parameters over land from SSM/I: Potential and limitations , 1999 .

[23]  Dan Cornford,et al.  Neural Network Modelling with Input Uncertainty: Theory and Application , 2000, J. VLSI Signal Process..

[24]  Geoffrey E. Hinton,et al.  Bayesian Learning for Neural Networks , 1995 .

[25]  F. Agterberg Introduction to Mathematics of Inversion in Remote Sensing and Indirect Measurements , 1979 .

[26]  Filipe Aires,et al.  Land surface skin temperatures from a combined analysis of microwave and infrared satellite observations for an all-weather evaluation of the differences between air and skin temperatures , 2003 .

[27]  Douglas M. Bates,et al.  Nonlinear Regression Analysis and Its Applications , 1988 .

[28]  Noëlle A. Scott,et al.  The "weight smoothing" regularization of MLP for Jacobian stabilization , 1999, IEEE Trans. Neural Networks.

[29]  Filipe Aires,et al.  Remote sensing from the infrared atmospheric sounding interferometer instrument 1. Compression, denoising, and first-guess retrieval algorithms , 2002 .

[30]  Joseph A. C. Delaney Sensitivity analysis , 2018, The African Continental Free Trade Area: Economic and Distributional Effects.