Feature selection and classification by using grid computing based evolutionary approach for the microarray data

The cancer classification through gene expression patterns becomes one of the most promising applications of the microarray technology. It is also a significant procedure in bioinformatics. In this study a grid computing based evolutionary mining approach is proposed as discriminant function for gene selection and tumor classification. The proposed approach is based on the grid computing infrastructure for establishing the best attributes set. The discriminant analysis based on vector distant of median method as the evaluation function of genetic algorithm which lays stress on find the key attributes set of the data set to establish the best attributes set for constructing a classification response model with highest accuracy. We show experimentally that the proposed approach for several benchmarking cancer microarray data sets can work effectively and efficiently, and the results of the proposed methods are superior to or as well as other existing methods in literatures.

[1]  Nello Cristianini,et al.  Support vector machine classification and validation of cancer tissue samples using microarray expression data , 2000, Bioinform..

[2]  Ali S. Hadi,et al.  Finding Groups in Data: An Introduction to Chster Analysis , 1991 .

[3]  Zbigniew Michalewicz,et al.  Genetic Algorithms + Data Structures = Evolution Programs , 1996, Springer Berlin Heidelberg.

[4]  Thomas A. Darden,et al.  Gene selection for sample classification based on gene expression data: study of sensitivity to choice of parameters of the GA/KNN method , 2001, Bioinform..

[5]  P. Brown,et al.  DNA arrays for analysis of gene expression. , 1999, Methods in enzymology.

[6]  Peter J. Rousseeuw,et al.  Finding Groups in Data: An Introduction to Cluster Analysis , 1990 .

[7]  Sung-Bae Cho Exploring Features and Classifiers to Classify Gene Expression Profiles of Acute Leukemia , 2002, Int. J. Pattern Recognit. Artif. Intell..

[8]  Wei Kong,et al.  Hybrid particle swarm optimization and tabu search approach for selecting genes for tumor classification using gene expression data , 2008, Comput. Biol. Chem..

[9]  Huan Liu,et al.  Redundancy based feature selection for microarray data , 2004, KDD.

[10]  Sung-Bae Cho,et al.  The classification of cancer based on DNA microarray data that uses diverse ensemble genetic programming , 2006, Artif. Intell. Medicine.

[11]  Heping Zhang,et al.  Cell and tumor classification using gene expression data: Construction of forests , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[12]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.

[13]  Lucila Ohno-Machado,et al.  An Epicurean learning approach to gene-expression data classification , 2003, Artif. Intell. Medicine.

[14]  U. Schmidt,et al.  Cancer diagnosis and microarrays. , 2003, The international journal of biochemistry & cell biology.

[15]  Sung-Bae Cho,et al.  Evolutionary Computation for Optimal Ensemble Classifier in Lymphoma Cancer Classification , 2003, ISMIS.

[16]  William Perrizo,et al.  Comprehensive vertical sample-based KNN/LSVM classification for gene expression analysis , 2004, J. Biomed. Informatics.

[17]  Jiawei Han,et al.  Cancer classification using gene expression data , 2003, Inf. Syst..

[18]  Daniel Q. Naiman,et al.  Simple decision rules for classifying human cancers from gene expression profiles , 2005, Bioinform..

[19]  Fillia Makedon,et al.  HykGene: a hybrid approach for selecting marker genes for phenotype classification using microarray gene expression data , 2005, Bioinform..

[20]  Yuehui Chen,et al.  A novel ensemble of classifiers for microarray data classification , 2008, Appl. Soft Comput..

[21]  Joaquín Dopazo,et al.  Using a Genetic Algorithm and a Perceptron for Feature Selection and Supervised Class Learning in DNA Microarray Data , 2003, Artificial Intelligence Review.

[22]  Yong Xu,et al.  Neuro-Fuzzy Ensemble Approach for Microarray Cancer Gene Expression Data Analysis , 2006, 2006 International Symposium on Evolving Fuzzy Systems.

[23]  Lucila Ohno-Machado,et al.  A primer on gene expression and microarrays for machine learning researchers , 2004, J. Biomed. Informatics.

[24]  J. Stuart Aitken,et al.  Feature selection and classification for microarray data analysis: Evolutionary methods for identifying predictive genes , 2005, BMC Bioinformatics.

[25]  C. Rosenow,et al.  Monitoring gene expression using DNA microarrays. , 2000, Current opinion in microbiology.