论文信息 - Variance-Based Feature Importance in Neural Networks

Variance-Based Feature Importance in Neural Networks

This paper proposes a new method to measure the relative importance of features in Artificial Neural Networks (ANN) models. Its underlying principle assumes that the more important a feature is, the more the weights, connected to the respective input neuron, will change during the training of the model. To capture this behavior, a running variance of every weight connected to the input layer is measured during training. For that, an adaptation of Welford’s online algorithm for computing the online variance is proposed. When the training is finished, for each input, the variances of the weights are combined with the final weights to obtain the measure of relative importance for each feature. This method was tested with shallow and deep neural network architectures on several well-known classification and regression problems. The results obtained confirm that this approach is making meaningful measurements. Moreover, results showed that the importance scores are highly correlated with the variable importance method from Random Forests (RF).

Cláudio Rebelo de Sá | C. Sá

[1] B. Welford. Note on a Method for Calculating Corrected Sums of Squares and Products , 1962 .

[2] Jeff Heaton,et al. Early stabilizing feature importance for TensorFlow deep neural networks , 2017, 2017 International Joint Conference on Neural Networks (IJCNN).

[3] Juan Castellanos,et al. Study of Weight Importance in Neural Networks Working with Colineal Variables in Regression Problems , 1999, IEA/AIE.

[4] G. David Garson,et al. Interpreting neural-network connection weights , 1991 .

[5] Leo Breiman,et al. Random Forests , 2001, Machine Learning.

[6] Julian D. Olden,et al. Illuminating the “black box”: a randomization approach for understanding variable contributions in artificial neural networks , 2002 .

[7] Gaël Varoquaux,et al. Scikit-learn: Machine Learning in Python , 2011, J. Mach. Learn. Res..

[8] Mukta Paliwal,et al. Assessing the contribution of variables in feed forward neural network , 2011, Appl. Soft Comput..

[9] Eran Segal,et al. Regularization Learning Networks , 2018, NeurIPS.