Comparison of Different Feature Extraction Methods on Classification of Gene Expression Data

It is important to extract the most relevant features of the genetic profiles to determine the health condition of the cellular structure. Early diagnosis of the illnesses has a great importance in the treatment. In this study, we analyzed a gene expression data by classifying using support vector machines after applying different feature extraction methods as principal component analysis (PCA) and independent component analysis (ICA). Results have been compared with the results of the feature extraction algorithm based on genetic algorithm (GA).

[1]  D. Botstein,et al.  Cluster analysis and display of genome-wide expression patterns. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[2]  Andrew R. Webb,et al.  Statistical Pattern Recognition , 1999 .

[3]  Marco Richeldi,et al.  ADHOC: a tool for performing effective feature selection , 1996, Proceedings Eighth IEEE International Conference on Tools with Artificial Intelligence.

[4]  Reyyan Yeniterzi,et al.  Evolutionary selection of minimum number of features for classification of gene expression data using genetic algorithms , 2007, GECCO '07.

[5]  Aapo Hyvärinen,et al.  Survey on Independent Component Analysis , 1999 .

[6]  U. Alon,et al.  Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[7]  Shigeo Abe DrEng Pattern Classification , 2001, Springer London.

[8]  Yudong D. He,et al.  Gene expression profiling predicts clinical outcome of breast cancer , 2002, Nature.

[9]  E. Lander,et al.  Gene expression correlates of clinical prostate cancer behavior. , 2002, Cancer cell.

[10]  Robert P. W. Duin,et al.  Parameter Estimation and State Estimation: An Engineering Approach using MATLAB , 2004 .

[11]  Vladimir Vapnik,et al.  Statistical learning theory , 1998 .

[12]  Anil K. Jain,et al.  Dimensionality reduction using genetic algorithms , 2000, IEEE Trans. Evol. Comput..

[13]  Alexander J. Smola,et al.  Learning with kernels , 1998 .

[14]  E. Petricoin,et al.  Use of proteomic patterns in serum to identify ovarian Cancer , 2002 .

[15]  David W. Galbraith,et al.  Global Analysis of Cell Type-Specific Gene Expression , 2003, Comparative and functional genomics.

[16]  Bernhard Schölkopf,et al.  Feature selection for support vector machines by means of genetic algorithm , 2003, Proceedings. 15th IEEE International Conference on Tools with Artificial Intelligence.

[17]  Tianzi Jiang,et al.  A combinational feature selection and ensemble neural network method for classification of gene expression data , 2004, BMC Bioinformatics.

[18]  R. W. Davis,et al.  Discovery and analysis of inflammatory disease-related genes using cDNA microarrays. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[19]  Huiqing Liu,et al.  A comparative study on feature selection and classification methods using gene expression profiles and proteomic patterns. , 2002, Genome informatics. International Conference on Genome Informatics.