Unlabeled data classification via support vector machines and k-means clustering

Support vector machines (SVMS), a powerful machine method developed from statistical learning and have made significant achievement in some field. Introduced in the early 90's, they led to an explosion of interest in machine learning. However, like most machine learning algorithms, they are generally applied using a selected training set classified in advance. With the repaid development of the Internet and telecommunication, huge of information has been produced as digital data format, generally the data is unlabeled. It is impossible to classify the data with one's own hand one by one in many realistic problems, so that the research on unlabeled data classification has been grown. Improvements in databases technology, computing performance and artificial intelligence have contributed to the development of intelligent data analysis. A SVM classifier based on k-means algorithm is presented for the classification of unlabeled data.

[1]  David R. Musicant,et al.  Data Discrimination via Nonlinear Generalized Support Vector Machines , 2001 .

[2]  Alexander J. Smola,et al.  Support Vector Regression Machines , 1996, NIPS.

[3]  Federico Girosi,et al.  Training support vector machines: an application to face detection , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[4]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[5]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[6]  Bernhard Schölkopf,et al.  New Support Vector Algorithms , 2000, Neural Computation.

[7]  Christopher J. C. Burges,et al.  A Tutorial on Support Vector Machines for Pattern Recognition , 1998, Data Mining and Knowledge Discovery.

[8]  O. Mangasarian,et al.  Massive data discrimination via linear support vector machines , 2000 .

[9]  Olvi L. Mangasarian,et al.  Mathematical Programming in Data Mining , 1997, Data Mining and Knowledge Discovery.

[10]  Thorsten Joachims,et al.  Text categorization with support vector machines , 1999 .