Stacked Autoencoder for classification of glioma grade III and grade IV

Abstract Invention of the microarray technology has rendered it possible to inspect the whole genome at once in cancer classification. However, in order to curtail the computational complexity and augment the accuracy of cancer classification, it is essential to sift the vast microarray data for the informative genes. In this paper, Thresholding and Ratio methods are presented, individually as well as conjointly (hybrid method) to choose optimal gene subset from the microarray data. Moreover, Discrete Wavelet Transform (DWT) is deployed to pare the size of microarray data still further. The classification is accomplished by using various neural network algorithms and Stacked Autoencoder algorithm. The results of classification are compared for number of thresholds, ratios, wavelets and classification algorithms. It is observed that the Stacked Autoencoder network trained by Back Propagation algorithm delivers the best results in terms of classification accuracy and number of genes.

[1]  M. J. D. Powell,et al.  Restart procedures for the conjugate gradient method , 1977, Math. Program..

[2]  Hao Yu,et al.  Neural Network Learning Without Backpropagation , 2010, IEEE Transactions on Neural Networks.

[3]  Michael S. Lew,et al.  Deep learning for visual understanding: A review , 2016, Neurocomputing.

[4]  Y. Hasegawa,et al.  EGFR-TKI resistance due to BIM polymorphism can be circumvented in combination with HDAC inhibition. , 2013, Cancer research.

[5]  Martin A. Riedmiller,et al.  A direct adaptive method for faster backpropagation learning: the RPROP algorithm , 1993, IEEE International Conference on Neural Networks.

[6]  Jane Fridlyand,et al.  Reversing HOXA9 oncogene activation by PI3K inhibition: epigenetic mechanism and prognostic significance in human glioblastoma. , 2010, Cancer research.

[7]  Mohamed A. Ismail,et al.  A novel ensemble selection method for cancer diagnosis using microarray datasets , 2012, 2012 IEEE 12th International Conference on Bioinformatics & Bioengineering (BIBE).

[8]  Xi Chen,et al.  Identification and Functional Analysis of a Novel Cyclin E/Cdk2 Substrate Ankrd17* , 2009, Journal of Biological Chemistry.

[9]  Lei Liu,et al.  Ensemble gene selection by grouping for microarray data classification , 2010, J. Biomed. Informatics.

[10]  Thomas D. Wu,et al.  Molecular subclasses of high-grade glioma predict prognosis, delineate a pattern of disease progression, and resemble stages in neurogenesis. , 2006, Cancer cell.

[11]  P. Walter,et al.  Assembly of the Alu domain of the signal recognition particle (SRP): dimerization of the two protein components is required for efficient binding to SRP RNA , 1990, Molecular and cellular biology.

[12]  Young Bun Kim,et al.  Unsupervised Gene Selection For High Dimensional Data , 2006, Sixth IEEE Symposium on BioInformatics and BioEngineering (BIBE'06).

[13]  Yong Wang,et al.  A Novel Method of Feature Selection based on SVM , 2013, J. Comput..

[14]  Feiping Nie,et al.  Discriminative Least Squares Regression for Multiclass Classification and Feature Selection , 2012, IEEE Transactions on Neural Networks and Learning Systems.

[15]  Zhuang Liu,et al.  Drug delivery with carbon nanotubes for in vivo cancer treatment , 2008, 0808.2070.

[16]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.

[17]  Surajit Ray,et al.  Top scoring pairs for feature selection in machine learning and applications to cancer outcome prediction , 2011, BMC Bioinformatics.

[18]  Thibault Helleputte,et al.  Robust biomarker identification for cancer diagnosis with ensemble feature selection methods , 2010, Bioinform..

[19]  Khalid Benabdeslem,et al.  Local-to-global semi-supervised feature selection , 2013, CIKM.

[20]  Robert Tibshirani,et al.  A Framework for Feature Selection in Clustering , 2010, Journal of the American Statistical Association.

[21]  H. Mure,et al.  Akt2 and Akt3 play a pivotal role in malignant gliomas. , 2010, Neuro-oncology.

[22]  S. Horvath,et al.  Gene Expression Profiling of Gliomas Strongly Predicts Survival , 2004, Cancer Research.

[23]  Heba Abusamra,et al.  A Comparative Study of Feature Selection and Classification Methods for Gene Expression Data of Glioma , 2013 .

[24]  Richard Weber,et al.  Simultaneous feature selection and classification using kernel-penalized support vector machines , 2011, Inf. Sci..

[25]  Sameem Abdul Kareem,et al.  Oral cancer prognosis based on clinicopathologic and genomic markers using a hybrid of feature selection and machine learning methods , 2013, BMC Bioinformatics.

[26]  Yee Whye Teh,et al.  A Fast Learning Algorithm for Deep Belief Nets , 2006, Neural Computation.

[27]  Qi Shen,et al.  Simultaneous genes and training samples selection by modified particle swarm optimization for gene expression data classification , 2009, Comput. Biol. Medicine.

[28]  D. Ayer,et al.  Role for the Mortality Factors MORF4, MRGX, and MRG15 in Transcriptional Repression via Associations with Pf1, mSin3A, and Transducin-Like Enhancer of Split , 2002, Molecular and Cellular Biology.

[29]  L. E. Scales,et al.  Introduction to Non-Linear Optimization , 1985 .

[30]  Habibollah Haron,et al.  Supervised, Unsupervised, and Semi-Supervised Feature Selection: A Review on Gene Selection , 2016, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[31]  Sara Tarek,et al.  Gene expression based cancer classification , 2017 .

[32]  K. P. Soman,et al.  Insight into Wavelets: From Theory to Practice , 2005 .

[33]  J. Wu,et al.  Feature Selection for Cancer Classification Using Microarray Gene Expression Data , 2017 .

[34]  Mário A. T. Figueiredo,et al.  Efficient feature selection filters for high-dimensional data , 2012, Pattern Recognit. Lett..

[35]  Satoru Miyano,et al.  A Top-r Feature Selection Algorithm for Microarray Gene Expression Data , 2012, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[36]  J. Shewchuk An Introduction to the Conjugate Gradient Method Without the Agonizing Pain , 1994 .

[37]  Shutao Li,et al.  Wavelet-Based Feature Extraction for Microarray Data Classification , 2006, The 2006 IEEE International Joint Conference on Neural Network Proceedings.

[38]  Chee Keong Kwoh,et al.  A Feature Subset Selection Method Based On High-Dimensional Mutual Information , 2011, Entropy.

[39]  C. M. Reeves,et al.  Function minimization by conjugate gradients , 1964, Comput. J..