Comparison of Gene Identification Based on Artificial Neural Network Pre-processing with k-Means Cluster and Principal Component Analysis

A combination of gene ranking, dimensional reduction, and recursive feature elimination (RFE) using a BP-MLP artificial neural network (ANN) was used to select genes for DNA microarray classification. Use of k-means cluster analysis for dimensional reduction and maximum sensitivity for RFE resulted in 64-gene models with fewer invariant and correlated features when compared with PCA and mimimum error. In conclusion, k-means cluster analysis and sensitivity may be better suited for classifying diseases for which gene expression is more strongly influenced by pathway heterogeneity.

[1]  Tao Li,et al.  A comparative study of feature selection and multiclass classification methods for tissue classification based on gene expression , 2004, Bioinform..

[2]  Li Song,et al.  Neural network analysis of lymphoma microarray data: prognosis and diagnosis near-perfect , 2003, BMC Bioinformatics.

[3]  J. Sudbø,et al.  Gene-expression profiles in hereditary breast cancer. , 2001, The New England journal of medicine.

[4]  Tianzi Jiang,et al.  A combinational feature selection and ensemble neural network method for classification of gene expression data , 2004, BMC Bioinformatics.

[5]  Taizo Hanai,et al.  Fuzzy Neural Network Applied to Gene Expression Profiling for Predicting the Prognosis of Diffuse Large B‐cell Lymphoma , 2002, Japanese journal of cancer research : Gann.

[6]  Simon Parsons,et al.  Principles of Data Mining by David J. Hand, Heikki Mannila and Padhraic Smyth, MIT Press, 546 pp., £34.50, ISBN 0-262-08290-X , 2004, The Knowledge Engineering Review.

[7]  Silvio Bicciato,et al.  Pattern identification and classification in gene expression data using an autoassociative neural network model. , 2003, Biotechnology and bioengineering.

[8]  M. Ringnér,et al.  Classification and diagnostic prediction of cancers using gene expression profiling and artificial neural networks , 2001, Nature Medicine.

[9]  Lucila Ohno-Machado,et al.  A neural network-based similarity index for clustering DNA microarray data , 2003, Comput. Biol. Medicine.

[10]  Qing-Rong Chen,et al.  Prediction of Clinical Outcome Using Gene Expression Profiling and Artificial Neural Networks for Patients with Neuroblastoma , 2004, Cancer Research.

[11]  D. Ruppert The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2004 .

[12]  Frank Berthold,et al.  Erratum: Prediction of clinical outcome using gene expression profiling and artificial neural networks for patients with neuroblastoma (Cancer Research (October 1, 2004) 64 (6883-6891)) , 2005 .

[13]  A. L. Tarca,et al.  A robust neural networks approach for spatial and intensity-dependent normalization of cDNA microarray data , 2005, Bioinform..

[14]  Chen-Khong Tham,et al.  Predicting Risk of Coronary Artery Disease from Dna Microarray-based Genotyping Using Neural Networks and Other Statistical Analysis Tool , 2003, J. Bioinform. Comput. Biol..