Extracting the contribution of independent variables in neural network models: a new approach to handle instability

One of the main limitations of artificial neural networks (ANN) is their high inability to know in an explicit way the relations established between explanatory variables (input) and dependent variables (output). This is a major reason why they are usually called “black boxes.” In the last few years, several methods have been proposed to assess the relative importance of each explanatory variable. Nevertheless, it has not been possible to reach a consensus on which is the best-performing method. This is largely due to the different relative importance obtained for each variable depending on the method used. This importance also varies with the designed network architecture and/or with the initial random weights used to train the ANN. This paper proposes a procedure that seeks to minimize these problems and provides consistency in the results obtained from different methods. Essentially, the idea is to work with a set of neural networks instead of a single one. The proposed procedure is validated using a database collected from a customer satisfaction survey, which was conducted on the public transport system of Granada (Spain) in 2007. The results show that, when each method is applied independently, the variable’s importance rankings are similar and, in addition, coincide with the hierarchy established by researchers who have applied other techniques.

[1]  Darccedil,et al.  A neural network (NN) model to predict intersection crashes based upon driver, vehicle and roadway surface characteristics 1 , 2010 .

[2]  Timothy Masters,et al.  Practical neural network recipes in C , 1993 .

[3]  Frank Hansen,et al.  An approach for determining relative input parameter importance and significance in artificial neural networks , 2007 .

[4]  Tamás D. Gedeon,et al.  Balancing Bias and Variance: Network Topology and Pattern Set Reduction Techniques , 1995, IWANN.

[5]  Ramesh Sharda,et al.  Identifying significant predictors of injury severity in traffic accidents using a series of artificial neural networks. , 2006, Accident; analysis and prevention.

[6]  P. Werbos,et al.  Beyond Regression : "New Tools for Prediction and Analysis in the Behavioral Sciences , 1974 .

[7]  G. Bennington Foundations , 2007 .

[8]  Ali Kaveh,et al.  HARMONY SEARCH BASED ALGORITHMS FOR THE OPTIMUM COST DESIGN OF REINFORCED CONCRETE CANTILEVER RETAINING WALLS , 2011 .

[9]  Geoffrey E. Hinton,et al.  Learning representations by back-propagating errors , 1986, Nature.

[10]  Andrew Hunter,et al.  Application of neural networks and sensitivity analysis to improved prediction of trauma survival , 2000, Comput. Methods Programs Biomed..

[11]  Tamás D. Gedeon,et al.  Data Mining of Inputs: Analysing Magnitude and Functional Measures , 1997, Int. J. Neural Syst..

[12]  I. Dimopoulos,et al.  Role of some environmental variables in trout abundance models using neural networks , 1996 .

[13]  G. David Garson,et al.  Interpreting neural-network connection weights , 1991 .

[14]  Luigi dell’Olio,et al.  The quality of service desired by public transport users , 2011 .

[15]  Wei Tang,et al.  Ensembling neural networks: Many could be better than all , 2002, Artif. Intell..

[16]  Kurt Hornik,et al.  Multilayer feedforward networks are universal approximators , 1989, Neural Networks.

[17]  Uygar Özesmi,et al.  An artificial neural network approach to spatial habitat modelling with interspecific interaction , 1999 .

[18]  Yinghua Lin,et al.  A new approach to fuzzy-neural system modeling , 1995, IEEE Trans. Fuzzy Syst..

[19]  Andries Petrus Engelbrecht,et al.  Determining the Significance of Input Parameters using Sensitivity Analysis , 1995, IWANN.

[20]  Jingtao Yao,et al.  Forecasting and Analysis of Marketing Data Using Neural Networks , 1998, J. Inf. Sci. Eng..

[21]  Pizhong Qiao,et al.  Neural network committee-based sensitivity analysis strategy for geotechnical engineering problems , 2008, Neural Computing and Applications.

[22]  I-Cheng Yeh,et al.  First and second order sensitivity analysis of MLP , 2010, Neurocomputing.

[23]  Jacek M. Zurada,et al.  Sensitivity analysis for minimization of input data dimension for feedforward neural network , 1994, Proceedings of IEEE International Symposium on Circuits and Systems - ISCAS '94.

[24]  Yannis Dimopoulos,et al.  Use of some sensitivity criteria for choosing networks with good generalization ability , 1995, Neural Processing Letters.

[25]  Michael J. Watts,et al.  Using artificial neural networks to determine the relative contribution of abiotic factors influencing the establishment of insect pest species , 2008, Ecol. Informatics.

[26]  Paulo Cortez,et al.  Using sensitivity analysis and visualization techniques to open black box data mining models , 2013, Inf. Sci..

[27]  Shigeo Abe,et al.  Neural Networks and Fuzzy Systems , 1996, Springer US.

[28]  Michele Scardi,et al.  Developing an empirical model of phytoplankton primary production: a neural network case study , 1999 .

[29]  Juan José,et al.  Redes neuronales artificiales aplicadas al análisis de datos , 2002 .

[30]  Bart Kosko,et al.  Neural networks and fuzzy systems , 1998 .

[31]  James L. McClelland,et al.  Parallel distributed processing: explorations in the microstructure of cognition, vol. 1: foundations , 1986 .

[32]  Juan de Oña,et al.  Perceived service quality in bus transit service: A structural equation approach , 2013 .

[33]  Juan José Montaño,et al.  Redes neuronales artificiales: abriendo la caja negra.: abriendo la caja negra. , 2002 .

[34]  M. Gevrey,et al.  Two-way interaction of input variables in the sensitivity analysis of neural network models , 2006 .

[35]  L Mussone,et al.  An analysis of urban collisions using an artificial intelligence model. , 1999, Accident; analysis and prevention.

[36]  M. Gevrey,et al.  Review and comparison of methods to study the contribution of variables in artificial neural network models , 2003 .

[37]  Laura Eboli,et al.  How to Capture the Passengers’ Point of View on a Transit Service through Rating and Choice Options , 2010 .

[38]  Juan de Oña,et al.  A classification tree approach to identify key factors of transit service quality , 2012, Expert Syst. Appl..

[39]  Ali Azadeh,et al.  An adaptive neural network algorithm for assessment and improvement of job satisfaction with respect to HSE and ergonomics program: The case of a gas refinery , 2011 .

[40]  Amir Mohammadipour,et al.  The optimization of the geometric cross-section dimensions of raised pedestrian crosswalks: a case study in Qazvin. , 2009, Accident; analysis and prevention.

[41]  Paulo Cortez,et al.  Opening black box Data Mining models using Sensitivity Analysis , 2011, 2011 IEEE Symposium on Computational Intelligence and Data Mining (CIDM).

[42]  Andrew H. Sung,et al.  Ranking importance of input parameters of neural networks , 1998 .

[43]  Edward J. Rzempoluck,et al.  Neural Network Data Analysis Using Simulnet™ , 1997, Springer New York.

[44]  Mukta Paliwal,et al.  Assessing the contribution of variables in feed forward neural network , 2011, Appl. Soft Comput..

[45]  Alfonso Palmer,et al.  Numeric sensitivity analysis applied to feedforward neural networks , 2003, Neural Computing & Applications.

[46]  Luigi dell’Olio,et al.  Modelling user perception of bus transit quality , 2010 .

[47]  I. Dimopoulos,et al.  Application of neural networks to modelling nonlinear relationships in ecology , 1996 .

[48]  M. Poursha,et al.  Prediction of accident severity using artificial neural networks , 2011 .

[49]  Julian D. Olden,et al.  Illuminating the “black box”: a randomization approach for understanding variable contributions in artificial neural networks , 2002 .

[50]  Laura Eboli,et al.  Willingness-to-pay of public transport users for improvement in service quality , 2008 .

[51]  Sovan Lek,et al.  Improved estimation, using neural networks, of the food consumption of fish populations , 1995 .

[52]  Ken-ichi Funahashi,et al.  On the approximate realization of continuous mappings by neural networks , 1989, Neural Networks.

[53]  Martin T. Hagan,et al.  Neural network design , 1995 .

[54]  A. T. C. Goh,et al.  Back-propagation neural networks for modeling complex systems , 1995, Artif. Intell. Eng..

[55]  Russell G. Death,et al.  An accurate comparison of methods for quantifying variable importance in artificial neural networks using simulated data , 2004 .

[56]  Robert H. Kewley,et al.  Data strip mining for the virtual design of pharmaceuticals with neural networks , 2000, IEEE Trans. Neural Networks Learn. Syst..

[57]  D. Rumelhart Parallel Distributed Processing Volume 1: Foundations , 1987 .