Evolutionary Computational Algorithm by Blending of PPCA and EP-Enhanced Supervised Classifier for Microarray Gene Expression Data

In DNA microarray technology, gene classification is considered to be difficult because the attributes of the data, are characterized by high dimensionality and small sample size. Classification of tissue samples in such high dimensional problems is a complicated task. Furthermore, there is a high redundancy in microarray data and several genes comprise inappropriate information for accurate classification of diseases or phenotypes. Consequently, an efficient classification technique is necessary to retrieve the gene information from the microarray experimental data. In this paper, a classification technique is proposed that classifies the microarray gene expression data well. In the proposed technique, the dimensionality of the gene expression dataset is reduced by Probabilistic PCA. Then, an Artificial Neural Network (ANN) is selected as the supervised classifier and it is enhanced using Evolutionary programming (EP) technique. The enhancement of the classifier is accomplished by optimizing the dimension of the ANN. The enhanced classifier is trained using the Back Propagation (BP) algorithm and so the BP error gets minimized. The well-trained ANN has the capacity of classifying the gene expression data to the associated classes. The proposed technique is evaluated by classification performance over the cancer classes, Acute myeloid leukemia (AML) and Acute Lymphoblastic Leukemia (ALL). The classification performance of the enhanced ANN classifier is compared over the existing ANN classifier and SVM classifier.

[1]  N. S. Khlopova,et al.  Differentiation of Gene Expression Profiles Data for Liver and Kidney of Pigs , 2009 .

[2]  Ying Liu,et al.  A Hybrid Approach for Biomarker Discovery from Microarray Gene Expression Data for Cancer Classification , 2007, Cancer informatics.

[3]  Chonghun Han,et al.  Hybrid Clustering Method for DNA Microarray Data Analysis , 2002 .

[4]  Hau-San Wong,et al.  Constructing the gene regulation-level representation of microarray data for cancer classification , 2008, J. Biomed. Informatics.

[5]  P. P. Vaidyanathan,et al.  The role of signal-processing concepts in genomics and proteomics , 2004, J. Frankl. Inst..

[6]  Chonghun Han,et al.  DNA Microarray Data Analysis for Cancer Classification Based on Stepwise Discriminant Analysis and Bayesian Decision Theory , 2001 .

[7]  Qiang Shen,et al.  Aiding classification of gene expression data with feature selection: a comparative study , 2005 .

[8]  Keun Ho Ryu,et al.  Clustering Approach using MCL Algorithm 1 for Analyzing Microarray Data , 2007 .

[9]  Dechang Chen,et al.  Gene Expression Data Classification With Kernel Principal Component Analysis , 2005, Journal of biomedicine & biotechnology.

[10]  Juan V. Lorenzo Ginori,et al.  A New Predictor of Coding Regions in Genomic Sequences using a Combination of Different Approaches , 2007 .

[11]  Jesús S. Aguilar-Ruiz,et al.  Incremental wrapper-based gene selection from microarray data for cancer classification , 2006, Pattern Recognit..

[12]  Blaz Zupan,et al.  Data and text mining Visualization-based cancer microarray data classification analysis , 2007 .

[13]  Henry Horng-Shing Lu,et al.  Gene Expression Analysis Refining System (GEARS) via Statistical Approach: A Preliminary Report , 2003 .

[14]  M. Anandhavalli Gauthaman Analysis of DNA Microarray Data using Association Rules: A Selective Study , 2008 .

[15]  Pericles A. Mitkas,et al.  SoFoCles: Feature filtering for microarray classification based on Gene Ontology , 2010, J. Biomed. Informatics.

[16]  Yonggwan Won,et al.  Classification Study on DNA Microarray with Feedforward Neural Network Trained by Singular Value Decomposition , 2009 .

[17]  Prabhas Chongstitvatana,et al.  Feature Selection by Weighted-SNR for Cancer Microarray Data Classification , 2008 .

[18]  Y Xu,et al.  Minimum spanning trees for gene expression data clustering. , 2001, Genome informatics. International Conference on Genome Informatics.

[19]  Joaquín Dopazo,et al.  New Challenges in Gene Expression Data Analysis and the Extended GEPAS , 2004, Spanish Bioinformatics Conference.

[20]  Takio Kurita,et al.  An evolutionary approach for gene selection and classification of microarray data based on SVM error-bound theories , 2010, Biosyst..

[21]  Sandhya Samarasinghe,et al.  A Simple Affymetrix Ratio-transformation Method Yields Comparable Expression Level Quantifications with cDNA Data , 2010 .

[22]  P.D. Cristea,et al.  Genomic signal processing , 2004, 7th Seminar on Neural Network Applications in Electrical Engineering, 2004. NEUREL 2004. 2004.

[23]  Michael E. Tipping,et al.  Probabilistic Principal Component Analysis , 1999 .

[24]  Ahmad M. Sarhan,et al.  Journal of Theoretical and Applied Information Technology Cancer Classification Based on Microarray Gene Expression Data Using Dct and Ann , 2022 .

[25]  J. Volkmer,et al.  Gene expression analysis of human red blood cells , 2009, International journal of medical sciences.

[26]  Ya Zhang,et al.  Data-Dependent Kernel Machines for Microarray Data Classification , 2007, IEEE/ACM Transactions on Computational Biology and Bioinformatics.