A manifold learning framework for both clustering and classification

In recent years, a great deal of manifold clustering algorithms was presented to identify the subsets of the manifolds data. Meanwhile, numerous classification algorithms were also developed to classified data shaped in the form of manifold. However, nearly none of them pay attention to the statistical relationship between the manifold structures and class labels, thus failing to discover the knowledge concealed in data. In this paper, a manifold learning framework for both clustering and classification is presented, which involves two steps. In the first step, the clustering through ranking on manifolds is executed to explore structures in data; in the second step, the class posterior probability is calculated by using the Bayesian rule. The core of this framework lies in employing the Bayesian theory to establish the relationship between manifolds and classes thus creates a bridge between clustering learning and classification learning. Our new manifold learning framework is interesting from a number of perspectives: (1) our algorithm can perform manifold clustering learning which can auto-determine the clustering parameters without manual determining; (2) our algorithm can perform manifold classification learning which models the posterior probabilities p ( ω l | x i ) by using the Bayesian rule; (3) our algorithm can provide the statistical relationship between the manifold structure and the given classes. Encouraging experimental results are obtained on 2 artificial and 16 real-life benchmark datasets.

[1]  Ulrike von Luxburg,et al.  A tutorial on spectral clustering , 2007, Stat. Comput..

[2]  José R. Dorronsoro,et al.  Finding Optimal Model Parameters by Discrete Grid Search , 2008, Innovations in Hybrid Intelligent Systems.

[3]  Olatz Arbelaitz,et al.  An extensive comparative study of cluster validity indices , 2013, Pattern Recognit..

[4]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[5]  Ching Y. Suen,et al.  A trainable feature extractor for handwritten digit recognition , 2007, Pattern Recognit..

[6]  Peng Hao,et al.  Transfer learning using computational intelligence: A survey , 2015, Knowl. Based Syst..

[7]  Javier Montero,et al.  Fuzzy image segmentation based upon hierarchical clustering , 2015, Knowl. Based Syst..

[8]  Bernhard Schölkopf,et al.  Learning with Local and Global Consistency , 2003, NIPS.

[9]  Hao Yu,et al.  Advantages of Radial Basis Function Networks for Dynamic System Design , 2011, IEEE Transactions on Industrial Electronics.

[10]  Markus Breitenbach,et al.  Clustering through ranking on manifolds , 2005, ICML '05.

[11]  J. Bezdek,et al.  FCM: The fuzzy c-means clustering algorithm , 1984 .

[12]  Shigeo Abe Training of Support Vector Machines with Mahalanobis Kernels , 2005, ICANN.

[13]  Daoqiang Zhang,et al.  Robust fuzzy relational classifier incorporating the soft class labels , 2007, Pattern Recognit. Lett..

[14]  Yongzhao Zhan,et al.  Improved pseudo nearest neighbor classification , 2014, Knowl. Based Syst..

[15]  Chang-Hwan Lee A gradient approach for value weighted classification learning in naive Bayes , 2015, Knowl. Based Syst..

[16]  Mohammad Hossein Fazel Zarandi,et al.  A new cluster validity measure based on general type-2 fuzzy sets: Application in gene expression data clustering , 2014, Knowl. Based Syst..

[17]  Zoran Zivkovic,et al.  Improved adaptive Gaussian mixture model for background subtraction , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[18]  Donald K. Wedding,et al.  Discovering Knowledge in Data, an Introduction to Data Mining , 2005, Inf. Process. Manag..

[19]  Michael I. Jordan,et al.  On Spectral Clustering: Analysis and an algorithm , 2001, NIPS.

[20]  Anil K. Jain Data clustering: 50 years beyond K-means , 2008, Pattern Recognit. Lett..

[21]  Paul S. Bradley,et al.  k-Plane Clustering , 2000, J. Glob. Optim..

[22]  Jiye Liang,et al.  An initialization method to simultaneously find initial cluster centers and the number of clusters for clustering categorical data , 2011, Knowl. Based Syst..

[23]  Michael Biehl,et al.  Dynamics and Generalization Ability of LVQ Algorithms , 2007, J. Mach. Learn. Res..

[24]  Mikhail Belkin,et al.  Manifold Regularization: A Geometric Framework for Learning from Labeled and Unlabeled Examples , 2006, J. Mach. Learn. Res..

[25]  Seung-Yeon Kim,et al.  Prediction of protein solvent accessibility using fuzzy k-nearest neighbor method , 2005, Bioinform..

[26]  Felipe Cucker,et al.  On the mathematical foundations of learning , 2001 .

[27]  David G. Stork,et al.  Pattern Classification , 1973 .

[28]  Junfei Qiao,et al.  Adaptive Computation Algorithm for RBF Neural Network , 2012, IEEE Transactions on Neural Networks and Learning Systems.

[29]  Shigeo Abe DrEng Pattern Classification , 2001, Springer London.

[30]  Mohammad Reza Daliri,et al.  Classification of silhouettes using contour fragments , 2009, Comput. Vis. Image Underst..

[31]  Daoqiang Zhang,et al.  Robust image segmentation using FCM with spatial constraints based on new kernel-induced distance measure , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[32]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[33]  Magne Setnes,et al.  Fuzzy relational classifier trained by fuzzy clustering , 1999, IEEE Trans. Syst. Man Cybern. Part B.

[34]  Pablo A. Estévez,et al.  A review of learning vector quantization classifiers , 2013, Neural Computing and Applications.

[35]  Dao-Qiang Zhang,et al.  A novel kernelized fuzzy C-means algorithm with application in medical image segmentation , 2004, Artif. Intell. Medicine.

[36]  Ivor W. Tsang,et al.  Large-Scale Sparsified Manifold Regularization , 2006, NIPS.

[37]  Anil K. Jain Data clustering: 50 years beyond K-means , 2010, Pattern Recognit. Lett..

[38]  Robert M. Pap,et al.  Handbook of neural computing applications , 1990 .

[39]  Mohammad Reza Daliri,et al.  Shape and texture clustering: Best estimate for the clusters number , 2009, Image Vis. Comput..

[40]  Vikas Sindhwani,et al.  On Manifold Regularization , 2005, AISTATS.

[41]  Mohammad Reza Daliri,et al.  Shape recognition based on Kernel-edit distance , 2010, Comput. Vis. Image Underst..

[42]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[43]  Wei Wang,et al.  Recommender system application developments: A survey , 2015, Decis. Support Syst..

[44]  Bin Chen,et al.  Plane-Gaussian artificial neural network , 2011, Neural Computing and Applications.