论文信息 - Pattern Selection for Support Vector Classifiers

Pattern Selection for Support Vector Classifiers

SVMs tend to take a very long time to train with a large data set. If "redundant" patterns are identified and deleted in pre-processing, the training time could be reduced significantly. We propose a k-nearest neighbors(k-NN) based pattern selection method. The method tries to select the patterns that are near the decision boundary and that are correctly labeled. The simulations over synthetic data sets showed promising results: (1) By converting a non-separable problem to a separable one, the search for an optimal error tolerance parameter became unnecessary. (2) SVM training time decreased by two orders of magnitude without any loss of accuracy. (3) The redundant SVs were substantially reduced.

Sungzoon Cho | Hyunjung Shin | Hyunjung Shin | Sungzoon Cho

[1] Aníbal R. Figueiras-Vidal,et al. Sample selection via clustering to construct support vector-like classifiers , 1999, IEEE Trans. Neural Networks.

[2] Antônio de Pádua Braga,et al. SVM-KM: speeding SVMs learning with a priori cluster selection and k-means , 2000, Proceedings. Vol.1. Sixth Brazilian Symposium on Neural Networks.

[3] Giles M. Foody,et al. The significance of border training patterns in classification by a feedforward neural network using back propagation learning , 1999 .

[4] Bernhard E. Boser,et al. A training algorithm for optimal margin classifiers , 1992, COLT '92.

[5] Hyunjung Shin. Pattern Selection Using the Bias and Variance of Ensemble , 2001 .

[6] Vladimir Cherkassky,et al. The Nature Of Statistical Learning Theory , 1997, IEEE Trans. Neural Networks.