论文信息 - Bootstrap re-sampling for unbalanced data in supervised learning

Bootstrap re-sampling for unbalanced data in supervised learning

Abstract This paper presents a technical framework to assess the impact of re-sampling on the ability of a supervised learning to correctly learn a classification problem. We use the bootstrap expression of the prediction error to identify the optimal re-sampling proportions in binary classification experiments using artificial neural networks. Based on Bayes decision rule and the a priori distribution of the objective data, an estimate for the optimal re-sampling proportion is derived as well as upper and lower bounds for the exact optimal proportion. The analytical considerations to extend the present method to cross-validation and multiple classes are also illustrated.

Georges Dupret | Masato Koda | M. Koda | G. Dupret

[1] Masato Koda. Stochastic sensitivity analysis and Langevin simulation for neural network learning , 1997 .

[2] David M. Allen,et al. The Relationship Between Variable Selection and Data Agumentation and a Method for Prediction , 1974 .

[3] Timothy Masters,et al. Practical neural network recipes in C , 1993 .

[4] Simon Haykin,et al. Neural Networks: A Comprehensive Foundation , 1998 .

[5] R. Palmer,et al. Introduction to the theory of neural computation , 1994, The advanced book program.

[6] Jack P. C. Kleijnen,et al. Bootstrapping and validation of metamodels in simulation , 1998, 1998 Winter Simulation Conference. Proceedings (Cat. No.98CH36274).

[7] M. Stone. Cross‐Validatory Choice and Assessment of Statistical Predictions , 1976 .

[8] M. Koda. Neural network learning based on stochastic sensitivity analysis , 1997, IEEE Trans. Syst. Man Cybern. Part B.

[9] M. Stone. An Asymptotic Equivalence of Choice of Model by Cross‐Validation and Akaike's Criterion , 1977 .

[10] G. Schwarz. Estimating the Dimension of a Model , 1978 .

[11] Masato Koda. Stochastic sensitivity analysis method for neural network learning , 1995 .

[12] S. T. Buckland,et al. An Introduction to the Bootstrap. , 1994 .

[13] Richard O. Duda,et al. Pattern classification and scene analysis , 1974, A Wiley-Interscience publication.

[14] Timothy Masters,et al. Advanced algorithms for neural networks: a C++ sourcebook , 1995 .