Effective Kernel-Based Fuzzy Clustering Systems in Analyzing Cancer Database

The greatest challenge in high-dimensional medical cancer databases is to differentiate the available subtypes due to uncertainty associated with the objects of the database. Recently, mathematical algorithm-based diagnosing system plays an increasingly important role in analyzing the high-dimensional medical cancer databases. Particularly, fuzzy clustering technique has taken a major role in clustering medical cancer databases. The fuzzy clustering techniques are not robust with high-dimensional databases which have more similar objects. Therefore, this paper tries to propose robust fuzzy clustering algorithms for effective analysis of high-dimensional medical database. This paper introduces the objective function of proposed robust fuzzy clustering techniques by incorporating Laplacian kernel-induced distance, Canberra distance, possibilistic memberships, and fuzzy memberships. The proposed methods have been implemented with high-dimensional breast cancer database containing three subclasses which are the leading causes of cancer deaths in the world. Benchmark datasets have been used to evaluate the performance of the proposed methods, and this paper has shown the effectiveness of the proposed methods through clustering accuracy.

[1]  Jun-Dong Chang,et al.  Oncogenes and Subtypes of Diffuse Large B-Cell Lymphoma Discoveries from Microarray Database , 2006, JCIS.

[2]  Lipo Wang,et al.  Gene selection and cancer classification using a fuzzy neural network , 2004, IEEE Annual Meeting of the Fuzzy Information, 2004. Processing NAFIPS '04..

[3]  Mariagrazia Dotoli,et al.  Fuzzy Clustering - A Versatile Mean to Explore Medical Databases. , 2000 .

[4]  Daoqiang Zhang,et al.  Robust image segmentation using FCM with spatial constraints based on new kernel-induced distance measure , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[5]  G. Church,et al.  Systematic determination of genetic network architecture , 1999, Nature Genetics.

[6]  J. Mesirov,et al.  Interpreting patterns of gene expression with self-organizing maps: methods and application to hematopoietic differentiation. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[7]  Sultan Aljahdali,et al.  A kernelized fuzzy c-means algorithm for automatic magnetic resonance image segmentation , 2009, J. Comput. Methods Sci. Eng..

[8]  Hoda M. O. Mokhtar,et al.  An extended k-means technique for clustering moving objects , 2011 .

[9]  Russell G. Congalton,et al.  Assessing the accuracy of remotely sensed data : principles and practices , 1998 .

[10]  Chih-Hsuan Wang,et al.  Outlier identification and market segmentation using kernel-based clustering techniques , 2009, Expert Syst. Appl..

[11]  James M. Keller,et al.  A possibilistic fuzzy c-means clustering algorithm , 2005, IEEE Transactions on Fuzzy Systems.

[12]  S. R. Kannan,et al.  Extended fuzzy c-means: an analyzing data clustering problems , 2012, Cluster Computing.

[13]  Kathleen Marchal,et al.  Adaptive quality-based clustering of gene expression profiles , 2002, Bioinform..

[14]  P. Rousseeuw Silhouettes: a graphical aid to the interpretation and validation of cluster analysis , 1987 .

[15]  Debahuti Mishra,et al.  A New Meta-heuristic Bat Inspired Classification Approach for Microarray Data , 2012 .

[16]  Elisabetta Binaghi,et al.  A fuzzy set-based accuracy assessment of soft classification , 1999, Pattern Recognit. Lett..

[17]  Bing Liu,et al.  An efficient semi-unsupervised gene selection method via spectral biclustering , 2006, IEEE Transactions on NanoBioscience.

[18]  Hu Yang,et al.  Biomedical data classification using hierarchical clustering , 2004, Canadian Conference on Electrical and Computer Engineering 2004 (IEEE Cat. No.04CH37513).

[19]  M. Jeżewski Application of modified fuzzy clustering to medical data classification , 2011 .

[20]  Daewon Lee,et al.  Dynamic Characterization of Cluster Structures for Robust and Inductive Support Vector Clustering , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[21]  Jiye Liang,et al.  An initialization method to simultaneously find initial cluster centers and the number of clusters for clustering categorical data , 2011, Knowl. Based Syst..

[22]  Paulo Novais,et al.  A visual analytics framework for cluster analysis of DNA microarray data , 2013, Expert Syst. Appl..

[23]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[24]  Mika Pollari,et al.  Semi-automatic liver tumor segmentation with hidden Markov measure field model and non-parametric distribution estimation , 2012, Medical Image Anal..

[25]  Rama Krishnaiah,et al.  Data Analysis of Bio-Medical Data Mining using Enhanced Hierarchical Agglomerative Clustering , 2012 .

[26]  Xiaowen Li,et al.  Performance research of Gaussian function weighted fuzzy C-means algorithm , 2007, International Symposium on Multispectral Image Processing and Pattern Recognition.

[27]  Dervis Karaboga,et al.  Fuzzy clustering with artificial bee colony algorithm , 2010 .

[28]  Ashish Ghosh,et al.  Fuzzy clustering algorithms incorporating local information for change detection in remotely sensed images , 2012, Appl. Soft Comput..

[29]  Andrea Schenone,et al.  A fuzzy clustering based segmentation system as support to diagnosis in medical imaging , 1999, Artif. Intell. Medicine.

[30]  R. Lunetta,et al.  Remote sensing and Geographic Information System data integration: error sources and research issues , 1991 .

[31]  James C. Bezdek,et al.  Pattern Recognition with Fuzzy Objective Function Algorithms , 1981, Advanced Applications in Pattern Recognition.