An application approach of “cluster-classification” in cancer scan images and gene expressions

Medical professionals need a reliable prediction methodology to diagnose cancer and distinguish between the different stages in cancer. Classification is a data mining techniques it mainly classify the dataset based on certain specific criteria. Clustering is another type grouping based on the similarity. These algorithms are applied to cancer dataset to group the patients into either “Carcinoma in situ” (beginning stage) or “Malignant potential” group. Pre-processing techniques have been applied to prepare the raw dataset and identify the relevant attributes for classification. Random test samples have been selected from the pre-processed data to obtain the results. The results are presented and discussed.

[1]  J. Mesirov,et al.  Interpreting patterns of gene expression with self-organizing maps: methods and application to hematopoietic differentiation. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[2]  Ricardo J. G. B. Campello,et al.  Proximity Measures for Clustering Gene Expression Microarray Data: A Validation Methodology and a Comparative Analysis , 2013, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[3]  Roded Sharan,et al.  CLICK: A Clustering Algorithm for Gene Expression Analysis , 2000, ISMB 2000.

[4]  Laurie J. Heyer,et al.  Exploring expression data: identification and analysis of coexpressed genes. , 1999, Genome research.

[5]  Russell Greiner,et al.  Quick detection of brain tumors and edemas: A bounding box method using symmetry , 2012, Comput. Medical Imaging Graph..

[6]  Pradipta Maji,et al.  Rough-Fuzzy Clustering for Grouping Functionally Similar Genes from Microarray Data , 2013, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[7]  Siripun Sanguansintukul,et al.  Clustering Analysis of Water Quality for Canals in Bangkok, Thailand , 2010, ICCSA.

[8]  Jonathan M. Garibaldi,et al.  Robust Bayesian Clustering for Replicated Gene Expression Data , 2012, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[9]  C. Nandini,et al.  Comprehensive framework to gait recognition , 2008, Int. J. Biom..

[10]  Sankar K. Pal,et al.  Rough-Fuzzy Pattern Recognition: Applications in Bioinformatics and Medical Imaging , 2012 .

[11]  S. Pal,et al.  Clustering Functionally Similar Genes from Microarray Data , 2012 .

[12]  Pradipta Maji,et al.  Microarray Time-Series Data Clustering Using Rough-Fuzzy C-Means Algorithm , 2011, 2011 IEEE International Conference on Bioinformatics and Biomedicine.

[13]  Pradipta Maji,et al.  Rough set based maximum relevance-maximum significance criterion and Gene selection from microarray data , 2011, Int. J. Approx. Reason..

[14]  Ron Shamir,et al.  Clustering Gene Expression Patterns , 1999, J. Comput. Biol..

[15]  Ron Shamir,et al.  A clustering algorithm based on graph connectivity , 2000, Inf. Process. Lett..

[16]  Richard M. Karp,et al.  CLIFF: clustering of high-dimensional microarray data via iterative feature filtering using normalized cuts , 2001, ISMB.

[17]  Adrian E. Raftery,et al.  How Many Clusters? Which Clustering Method? Answers Via Model-Based Cluster Analysis , 1998, Comput. J..

[18]  Pradipta Maji,et al.  Mutual Information-Based Supervised Attribute Clustering for Microarray Sample Classification , 2012, IEEE Transactions on Knowledge and Data Engineering.

[19]  Jane You,et al.  SC³: Triple Spectral Clustering-Based Consensus Clustering Framework for Class Discovery from Cancer Gene Expression Profiles , 2012, TCBB.