MLP network for lung cancer presence prediction based on microarray data

The appearance of the Microarray technology has attracted the scientific community and industry; with its ability of measuring simultaneously the activity and interactions of thousands of genes. This advanced technology was applied for enormous issues such as drug discovery, gene discovery, diagnosis and prognosis of disease and toxicological research. Despite the fact that Microarray applications have known birth in many biological studies, the handling and analysis of the data obtained are not trivial tasks. For these reasons, it has been focused on the present paper on the PCA classification technique and Neural Network for Microarray data; in the object of reducing the large data and producing informative results. The methodology proposes an approach based on MLP neural network to resolve the problem of lung cancer classification based on Microarray data. The approach consists on data reduction by using the PCA Technique, followed by a classification based on MLP network, feed-forward neural network known by its stable learning. The effectiveness of the implemented method was evaluated by measuring the correct classification rate performed on lung cancer gene expression dataset and compared to results obtained by other methods that use the same data.

[1]  Jukka Corander,et al.  Bayesian clustering and feature selection for cancer tissue samples , 2009, BMC Bioinformatics.

[2]  Y. Suneetha,et al.  Review Article: Current Knowledge on Microarray Technology - An Overview , 2012 .

[3]  Hong-Wen Deng,et al.  Gene selection for classification of microarray data based on the Bayes error , 2007, BMC Bioinformatics.

[4]  M Lipkin,et al.  Expression of cloned sequences in biopsies of human colonic tissue and in colonic carcinoma cells induced to differentiate in vitro. , 1987, Cancer research.

[5]  Wen Du,et al.  New Variable Selection Method Using Interval Segmentation Purity with Application to Blockwise Kernel Transform Support Vector Machine Classification of High-Dimensional Microarray Data , 2009, J. Chem. Inf. Model..

[6]  R. Siezen,et al.  others , 1999, Microbial Biotechnology.

[7]  L. Augenlicht,et al.  Patterns of gene expression that characterize the colonic mucosa in patients at genetic risk for colonic cancer. , 1991, Proceedings of the National Academy of Sciences of the United States of America.

[8]  Martin T. Hagan,et al.  Neural network design , 1995 .

[9]  Richard P. Lippmann,et al.  An introduction to computing with neural nets , 1987 .

[10]  Kurt Hornik,et al.  Approximation capabilities of multilayer feedforward networks , 1991, Neural Networks.

[11]  S. Ramaswamy,et al.  Translation of microarray data into clinically relevant cancer diagnostic tests using gene expression ratios in lung cancer and mesothelioma. , 2002, Cancer research.

[12]  Jing Yin,et al.  Artificial neural networks and gene filtering distinguish between global gene expression profiles of Barrett's esophagus and esophageal cancer. , 2002, Cancer research.

[13]  Weida Tong,et al.  Consensus analysis of multiple classifiers using non-repetitive variables: Diagnostic application to microarray gene expression data , 2007, Comput. Biol. Chem..

[14]  G. Garcı́a-Cardeña,et al.  Argus--a new database system for Web-based analysis of multiple microarray data sets. , 2001, Genome research.

[15]  R. Lippmann,et al.  An introduction to computing with neural nets , 1987, IEEE ASSP Magazine.

[16]  Wei-Chung Cheng,et al.  THEME: A web tool for loop-design microarray data analysis , 2012, Comput. Biol. Medicine.

[17]  J. Downing,et al.  Classification, subtype discovery, and prediction of outcome in pediatric acute lymphoblastic leukemia by gene expression profiling. , 2002, Cancer cell.

[18]  Salvatore Petralia,et al.  Recent Advances in DNA Microarray Technology: an Overview on Production Strategies and Detection Methods , 2013 .

[19]  Werner Dubitzky,et al.  A Practical Approach to Microarray Data Analysis , 2003, Springer US.

[20]  Aaron M. Newman,et al.  AutoSOME: a clustering method for identifying gene expression modules without prior knowledge of cluster number , 2010, BMC Bioinformatics.

[21]  Anita Bai,et al.  Feature Extraction and Classification of Microarray Cancer Data Using Intelligent Techniques , 2013, ICACNI.

[22]  Gavin Sherlock,et al.  The Stanford Microarray Database: implementation of new analysis tools and open source release of software , 2002, Nucleic Acids Res..

[23]  Heping Zhang,et al.  Cell and tumor classification using gene expression data: Construction of forests , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[24]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.

[25]  Dorothea K. Thompson,et al.  Microarray Technology and Applications in Environmental Microbiology , 2004 .

[26]  Constantin F. Aliferis,et al.  A comprehensive evaluation of multicategory classification methods for microarray gene expression cancer diagnosis , 2004, Bioinform..

[27]  Christophe Lemetre,et al.  An introduction to artificial neural networks in bioinformatics - application to complex microarray and mass spectrometry datasets in cancer studies , 2008, Briefings Bioinform..

[28]  Jingjing Liu,et al.  Cancer classification based on microarray gene expression data using a principal component accumulation method , 2011 .

[29]  Helen Parkinson,et al.  Data storage and analysis in ArrayExpress. , 2006, Methods in enzymology.

[30]  Dong Hoon Lim,et al.  Principal Component Analysis using Singular Value Decomposition of Microarray Data , 2013 .

[31]  Michael R. Kosorok,et al.  Identification of differential gene pathways with principal component analysis , 2009, Bioinform..

[32]  E. Kagereki,et al.  Principal component analysis and linear discriminant analysis in gene expression data , 2013 .

[33]  Zongmin Ma,et al.  Database Modeling in Biology: Practices and Challenges , 2007 .