论文信息 - The training of neural classifiers with condensed datasets

The training of neural classifiers with condensed datasets

In this paper we apply a k-nearest-neighbor-based data condensing algorithm to the training set of multilayer perceptron neural networks. By removing the overlapping data and retaining only training exemplars adjacent to the decision boundary we are able to significantly speed the network training time while achieving an undegraded misclassification rate compared to a network trained on the unedited training set. We report results on a range of synthetic and real datasets that indicate that a training speed-up of an order of magnitude is typical.

Peter Rockett | Se-Ho Choi

[1] Jenq-Neng Hwang,et al. Query-based learning applied to partially trained multilayer perceptrons , 1991, IEEE Trans. Neural Networks.

[2] Brian D. Ripley,et al. Pattern Recognition and Neural Networks , 1996 .

[3] Kurt Hornik,et al. Cross-validation with active pattern selection for neural-network classifiers , 1998, IEEE Trans. Neural Networks.

[4] Richard Lippmann,et al. Neural Network Classifiers Estimate Bayesian a posteriori Probabilities , 1991, Neural Computation.

[5] Catherine Blake,et al. UCI Repository of machine learning databases , 1998 .

[6] Christian Cachin,et al. Pedagogical pattern selection strategies , 1994, Neural Networks.

[7] Kenji Nakayama,et al. Training Data Selection Method for Generalization by Multilayer Neural Networks , 1998 .

[8] Francesc J. Ferri,et al. Considerations about sample-size sensitivity of a family of edited nearest-neighbor rules , 1999, IEEE Trans. Syst. Man Cybern. Part B.