A Multiobjective Simultaneous Learning Framework for Clustering and Classification

Traditional pattern recognition involves two tasks: clustering learning and classification learning. Clustering result can enhance the generalization ability of classification learning, while the class information can improve the accuracy of clustering learning. Hence, both learning methods can complement each other. To fuse the advantages of both learning methods together, many existing algorithms have been developed in a sequential fusing way by first optimizing the clustering criterion and then the classification criterion associated with the obtained clustering results. However, such kind of algorithms naturally fails to achieve the simultaneous optimality for two criteria, and thus has to sacrifice either the clustering performance or the classification performance. To overcome that problem, in this paper, we present a multiobjective simultaneous learning framework (MSCC) for both clustering and classification learning. MSCC utilizes multiple objective functions to formulate the clustering and classification problems, respectively, and more importantly, it employs the Bayesian theory to make these functions all only dependent on a set of the same parameters, i.e., clustering centers which play a role of the bridge connecting the clustering and classification learning. By simultaneously optimizing the clustering centers embedded in these functions, not only the effective clustering performance but also the promising classification performance can be simultaneously attained. Furthermore, from the multiple Pareto-optimality solutions obtained in MSCC, we can get an interesting observation that there is complementarity to great extent between clustering and classification learning processes. Empirical results on both synthetic and real data sets demonstrate the effectiveness and potential of MSCC.

[1]  Vincenzo Cutello,et al.  A Class of Pareto Archived Evolution Strategy Algorithms Using Immune Inspired Operators for Ab-Initio Protein Structure Prediction , 2005, EvoWorkshops.

[2]  Doug Hill,et al.  Prototypes stability analysis in the design of fuzzy classifiers to assess the severity of scoliosis , 2003, CCECE 2003 - Canadian Conference on Electrical and Computer Engineering. Toward a Caring and Humane Technology (Cat. No.03CH37436).

[3]  Hiroshi Murase,et al.  Visual learning and recognition of 3-d objects from appearance , 2005, International Journal of Computer Vision.

[4]  Dao-Qiang Zhang,et al.  A novel kernelized fuzzy C-means algorithm with application in medical image segmentation , 2004, Artif. Intell. Medicine.

[5]  Shigeo Abe DrEng Pattern Classification , 2001, Springer London.

[6]  Nong Ye,et al.  A scalable, incremental learning algorithm for classification problems , 2002 .

[7]  Nong Ye,et al.  Grid- and dummy-cluster-based learning of normal and intrusive clusters for computer intrusion detection , 2002 .

[8]  Y. Rahmat-Samii,et al.  Genetic algorithm (GA) and particle swarm optimization (PSO) in engineering electromagnetics , 2003, 17th International Conference on Applied Electromagnetics and Communications, 2003. ICECom 2003..

[9]  Kalyanmoy Deb,et al.  A fast and elitist multiobjective genetic algorithm: NSGA-II , 2002, IEEE Trans. Evol. Comput..

[10]  Terry Windeatt,et al.  Accuracy/Diversity and Ensemble MLP Classifier Design , 2006, IEEE Transactions on Neural Networks.

[11]  Raymond J. Mooney,et al.  A probabilistic framework for semi-supervised clustering , 2004, KDD.

[12]  Carlos A. Coello Coello,et al.  Handling multiple objectives with particle swarm optimization , 2004, IEEE Transactions on Evolutionary Computation.

[13]  B. John Oommen,et al.  Enhancing prototype reduction schemes with LVQ3-type algorithms , 2003, Pattern Recognit..

[14]  Dong-Jo Park,et al.  A Novel Validity Index for Determination of the Optimal Number of Clusters , 2001 .

[15]  Anil K. Jain,et al.  Statistical Pattern Recognition: A Review , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[16]  Anastasios Bezerianos,et al.  Ischemia detection with a self-organizing map supplemented by supervised learning , 2001, IEEE Trans. Neural Networks.

[17]  Daoqiang Zhang,et al.  A simultaneous learning framework for clustering and classification , 2009, Pattern Recognit..

[18]  Witold Pedrycz,et al.  Fuzzy clustering with supervision , 2004, Pattern Recognit..

[19]  Daoqiang Zhang,et al.  Robust fuzzy relational classifier incorporating the soft class labels , 2007, Pattern Recognit. Lett..

[20]  José R. Dorronsoro,et al.  Finding Optimal Model Parameters by Discrete Grid Search , 2008, Innovations in Hybrid Intelligent Systems.

[21]  Ujjwal Maulik,et al.  Validity index for crisp and fuzzy clusters , 2004, Pattern Recognit..

[22]  Magne Setnes,et al.  Fuzzy relational classifier trained by fuzzy clustering , 1999, IEEE Trans. Syst. Man Cybern. Part B.

[23]  Carlos Eduardo Pedreira,et al.  Learning vector quantization with training data selection , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[24]  Satu Elisa Schaeffer,et al.  Graph Clustering , 2017, Encyclopedia of Machine Learning and Data Mining.

[25]  Zheng Rong Yang,et al.  A novel radial basis function neural network for discriminant analysis , 2006, IEEE Transactions on Neural Networks.

[26]  Charu C. Aggarwal,et al.  Graph Clustering , 2010, Encyclopedia of Machine Learning and Data Mining.

[27]  Inderjit S. Dhillon,et al.  Semi-supervised graph clustering: a kernel approach , 2005, Machine Learning.

[28]  James M. Keller,et al.  A fuzzy K-nearest neighbor algorithm , 1985, IEEE Transactions on Systems, Man, and Cybernetics.

[29]  Song-Can Chen,et al.  Enhanced fuzzy relational classifier with representative training samples , 2007, 2007 International Conference on Wavelet Analysis and Pattern Recognition.

[30]  Narendra Ahuja,et al.  Learning to Recognize 3D Objects with SNoW , 2000, ECCV.

[31]  H. Timm,et al.  Fuzzy cluster analysis of classified data , 2001, Proceedings Joint 9th IFSA World Congress and 20th NAFIPS International Conference (Cat. No. 01TH8569).

[32]  Anil K. Jain,et al.  Unsupervised Learning of Finite Mixture Models , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[33]  Nicolaos B. Karayiannis,et al.  Soft learning vector quantization and clustering algorithms based on non-Euclidean norms: single-norm algorithms , 2005, IEEE Transactions on Neural Networks.

[34]  Gábor Lugosi,et al.  Introduction to Statistical Learning Theory , 2004, Advanced Lectures on Machine Learning.

[35]  Chris T. Kiranoudis,et al.  Radial Basis Function Neural Networks Classification for the Recognition of Idiopathic Pulmonary Fibrosis in Microscopic Images , 2008, IEEE Transactions on Information Technology in Biomedicine.

[36]  Catherine Blake,et al.  UCI Repository of machine learning databases , 1998 .

[37]  Ijaz Mansoor Qureshi,et al.  Particle Swarm Optimization Assisted Multiuser Detection along with Radial Basis Function , 2007, IEICE Trans. Commun..

[38]  Shigeo Abe Training of Support Vector Machines with Mahalanobis Kernels , 2005, ICANN.

[39]  Xiaojin Zhu,et al.  --1 CONTENTS , 2006 .

[40]  X. Li,et al.  A supervised clustering and classification algorithm for mining data with mixed variables , 2006, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.