Large Scale Classification with Support Vector Machine Algorithms

Boosting of least-squares support vector machine (LS-SVM) algorithms can classify large datasets on standard personal computers (PCs). We extend the LS-SVM proposed by Suykens and Vandewalle in several ways to efficiently classify large datasets. We developed a row-incremental version for datasets with billions of data points and up to 10,000 dimensions. By adding a Tikhonov regularization term and using the Sherman-Morrison-Woodbury formula, we developed a column-incremental LS-SVM to process datasets with a small number of data points but very high dimensionality. Finally, by applying boosting to these incremental LS-SVM algorithms, we developed classification algorithms for massive, very-high-dimensional datasets, and we also applied these ideas to build boosting of other efficient SVM algorithms proposed by Mangasarian, including Lagrange SVM (LSVM), proximal SVM (PSVM) and Newton SVM (NSVM). Numerical test results on UCI, RCV1- binary, Reuters-21578, Forest cover type and KDD cup 1999 datasets showed that our algorithms are often significantly faster and/or more accurate than state-of- the-art algorithms LibSVM, SVM-perf and CB-SVM.

[1]  David R. Musicant,et al.  Lagrangian Support Vector Machines , 2001, J. Mach. Learn. Res..

[2]  Jiawei Han,et al.  Classifying large data sets using SVMs with hierarchical clusters , 2003, KDD '03.

[3]  Johan A. K. Suykens,et al.  Automatic relevance determination for Least Squares Support Vector Machines classifiers , 2001, ESANN.

[4]  Robert E. Schapire,et al.  How boosting the margin can also boost classifier complexity , 2006, ICML.

[5]  Yoav Freund,et al.  A decision-theoretic generalization of on-line learning and an application to boosting , 1995, EuroCOLT.

[7]  John C. Platt,et al.  Fast training of support vector machines using sequential minimal optimization, advances in kernel methods , 1999 .

[8]  Nello Cristianini,et al.  An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[9]  François Poulet,et al.  Towards High Dimensional Data Mining with Boosting of PSVM and Visualization Tools , 2004, ICEIS.

[10]  François Poulet,et al.  Classifying one billion data with a new distributed svm algorithm , 2006, 2006 International Conference onResearch, Innovation and Vision for the Future.

[11]  Bernhard E. Boser,et al.  A training algorithm for optimal margin classifiers , 1992, COLT '92.

[12]  Olvi L. Mangasarian,et al.  A Finite Newton Method for Classi cation Problems , 2001 .

[13]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[14]  Glenn Fung,et al.  Incremental Support Vector Machine Classification , 2002, SDM.

[15]  L. Breiman Arcing Classifiers , 1998 .

[16]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[17]  Gert Cauwenberghs,et al.  Incremental and Decremental Support Vector Machine Learning , 2000, NIPS.

[18]  Daphne Koller,et al.  Support Vector Machine Active Learning with Applications to Text Classification , 2000, J. Mach. Learn. Res..

[19]  Thorsten Joachims,et al.  Training linear SVMs in linear time , 2006, KDD '06.