论文信息 - Mining Very Large Datasets with Support Vector Machine Algorithms

Mining Very Large Datasets with Support Vector Machine Algorithms

In this paper, we present new support vector machines (SVM) algorithms that can be used to classify very large datasets on standard personal computers. The algorithms have been extended from three recent SVMs algorithms: least squares SVM classification, finite Newton method for classification and incremental proximal SVM classification. The extension consists in building incremental, parallel and distributed SVMs for classification. Our three new algorithms are very fast and can handle very large datasets. An example of the effectiveness of these new algorithms is given with the classification into two classes of one billion points in 10-dimensional input space in some minutes on ten personal computers (800 MHz Pentium III, 256 MB RAM, Linux).

François Poulet | Thanh-Nghi Do

[1] Huan Liu,et al. Handling concept drifts in incremental learning with support vector machines , 1999, KDD '99.

[2] Kristin P. Bennett,et al. Support vector machines: hype or hallelujah? , 2000, SKDD.

[3] Vladimir N. Vapnik,et al. The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[4] Glenn Fung,et al. Incremental Support Vector Machine Classification , 2002, SDM.

[5] Ramasamy Uthurusamy,et al. EVOLVING DATA MINING INTO SOLUTIONS FOR INSIGHTS , 2002 .

[6] Stefan Rüping,et al. Incremental Learning with Support Vector Machines , 2001, ICDM.

[7] Nello Cristianini,et al. An Introduction to Support Vector Machines and Other Kernel-based Learning Methods , 2000 .

[8] Gert Cauwenberghs,et al. Incremental and Decremental Support Vector Machine Learning , 2000, NIPS.

[9] Glenn Fung,et al. Finite Newton method for Lagrangian support vector machine classification , 2003, Neurocomputing.

[10] Glenn Fung,et al. Proximal support vector machine classifiers , 2001, KDD '01.

[11] Ramasamy Uthurusamy,et al. Evolving data into mining solutions for insights , 2002, CACM.

[12] Johan A. K. Suykens,et al. Least Squares Support Vector Machine Classifiers , 1999, Neural Processing Letters.