Reliable multiclass cancer classification of microarray gene expression profiles using an improved wavelet neural network

Abstract Properly designing a wavelet neural network (WNN) is crucial for achieving the optimal generalization performance. In this paper, two different approaches were proposed for improving the predictive capability of WNNs. First, the types of activation functions used in the hidden layer of the WNN were varied. Second, the proposed enhanced fuzzy c-means clustering algorithm—specifically, the modified point symmetry-based fuzzy c-means (MSFCM) algorithm—was employed in selecting the locations of the translation vectors of the WNN. The modified WNN was then applied to heterogeneous cancer classification using four different microarray benchmark datasets. The comparative experimental results showed that the proposed methodology achieved an almost 100% classification accuracy in multiclass cancer prediction, leading to superior performance with respect to other clustering algorithms. Subsequently, performance comparisons with other classifiers were made. An assessment analysis showed that this proposed approach outperformed most of the other classifiers.

[1]  Yihui Liu,et al.  Wavelet feature selection for microarray data , 2007, 2007 IEEE/NIH Life Science Systems and Applications Workshop.

[2]  Yonghong Peng,et al.  A novel ensemble machine learning for robust microarray data classification , 2006, Comput. Biol. Medicine.

[3]  Cheng-Jian Lin,et al.  Nonlinear systems control using self-constructing wavelet networks , 2009, Appl. Soft Comput..

[4]  Gérard Dreyfus,et al.  Initialization by selection for wavelet network training , 2000, Neurocomputing.

[5]  Ru-Sheng Liu,et al.  Pattern classification in DNA microarray data of multiple tumor types , 2006, Pattern Recognit..

[6]  Shinn-Ying Ho,et al.  Selecting a minimal number of relevant genes from microarray data to design accurate tissue classifiers , 2007, Biosyst..

[7]  Zarita Zainuddin,et al.  A Neural Network Approach in Predicting the Blood Glucose Level for Diabetic Patients , 2009 .

[8]  Yuehui Chen,et al.  A novel ensemble of classifiers for microarray data classification , 2008, Appl. Soft Comput..

[9]  Roberto Muñoz,et al.  Multivariate calibration model from overlapping voltammetric signals employing wavelet neural networks , 2006 .

[10]  M. Moraud Wavelet Networks , 2018, Foundations of Wavelet Networks and Applications.

[11]  David J. Evans,et al.  Human Face Recognition Using Accelerated Multilayer Perceptrons , 2003, Int. J. Comput. Math..

[12]  Engin Avci,et al.  The performance comparison of discrete wavelet neural network and discrete wavelet adaptive network based fuzzy inference system for digital modulation recognition , 2008, Expert Syst. Appl..

[13]  T. Poggio,et al.  Prediction of central nervous system embryonal tumour outcome based on gene expression , 2002, Nature.

[14]  Stephen T. C. Wong,et al.  Cancer classification and prediction using logistic regression with Bayesian gene selection , 2004, J. Biomed. Informatics.

[15]  Tzu-Tsung Wong,et al.  Two-stage classification methods for microarray data , 2008, Expert Syst. Appl..

[16]  Zarita Zainuddin,et al.  Function approximation using artificial neural networks , 2007 .

[17]  Charles Wang,et al.  Multi-class tumor classification by discriminant partial least squares using microarray gene expression data and assessment of classification models , 2004, Comput. Biol. Chem..

[18]  Roberto Muñoz,et al.  Wavelet neural networks to resolve the overlapping signal in the voltammetric determination of phenolic compounds. , 2008, Talanta.

[19]  Wei Kong,et al.  New gene selection method for multiclass tumor classification by class centroid , 2009, J. Biomed. Informatics.

[20]  R. Tibshirani,et al.  Diagnosis of multiple cancer types by shrunken centroids of gene expression , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[21]  Li Jie,et al.  Prediction of Al(OH)3 fluidized roasting temperature based on wavelet neural network , 2007 .

[22]  Wei Du,et al.  Molecular classification of cancer types from microarray data using the combination of genetic algorithms and support vector machines , 2003, FEBS letters.

[23]  Li M Fu,et al.  Multi‐class cancer subtype classification based on gene expression signatures with reliability analysis , 2004, FEBS letters.

[24]  Eduardo Gómez-Ramírez,et al.  Forecasting Time Series with a New Architecture for Polynomial Artificial Neural Network , 2006, The 2006 IEEE International Joint Conference on Neural Network Proceedings.

[25]  Yan Zhou,et al.  Defect Characteristic Prediction of Pipeline by Means of Wavelet Neural Network Based on the Hierarchical Clustering Algorithm , 2004 .

[26]  Chokri Ben Amar,et al.  Comparison between Beta Wavelets Neural Networks, RBF Neural Networks and Polynomial Approximation for 1D, 2DFunctions Approximation , 2008 .

[27]  Jie Li,et al.  A new classification model with simple decision rule for discovering optimal feature gene pairs , 2007, Comput. Biol. Medicine.

[28]  Sung-Bae Cho,et al.  The classification of cancer based on DNA microarray data that uses diverse ensemble genetic programming , 2006, Artif. Intell. Medicine.

[29]  Qi Tian,et al.  Gabor wavelet associative memory for face recognition , 2005, IEEE Trans. Neural Networks.

[30]  Mohammad Fazle Azeem,et al.  Artificial wavelet neural network and its application in neuro-fuzzy models , 2008, Appl. Soft Comput..

[31]  Lalita Udpa,et al.  Electromagnetic NDE signal inversion by function-approximation neural networks , 2002 .

[32]  Seong-Joo Kim,et al.  Design of the scaling-wavelet neural network using genetic algorithm , 2002, Proceedings of the 2002 International Joint Conference on Neural Networks. IJCNN'02 (Cat. No.02CH37290).

[33]  Bijaya K. Panigrahi,et al.  Power signal classification using dynamic wavelet network , 2009, Appl. Soft Comput..

[34]  Xin Feng,et al.  Identification of critical genes in microarray experiments by a Neuro-Fuzzy approach , 2006, Comput. Biol. Chem..

[35]  Jorng-Tzong Horng,et al.  An expert system to identify co-regulated gene groups from time-lagged gene clusters using cell cycle expression data , 2010, Expert Syst. Appl..

[36]  Nikola K. Kasabov,et al.  Evolving connectionist systems for knowledge discovery from gene expression data of cancer tissue , 2003, Artif. Intell. Medicine.

[37]  Jae Won Lee,et al.  An extensive comparison of recent classification tools applied to microarray data , 2004, Comput. Stat. Data Anal..

[38]  Kuo-Liang Chung,et al.  Faster and more robust point symmetry-based K-means algorithm , 2007, Pattern Recognit..

[39]  Dahai Zhang,et al.  Design and initialization algorithm based on modulus maxima of wavelet transform for wavelet neural network , 2004, 2004 International Conference on Power System Technology, 2004. PowerCon 2004..

[40]  Xuehua Li,et al.  Kernel based nonlinear dimensionality reduction for microarray gene expression data analysis , 2009, Expert Syst. Appl..

[41]  M. Ringnér,et al.  Classification and diagnostic prediction of cancers using gene expression profiling and artificial neural networks , 2001, Nature Medicine.

[42]  HuanqinLi,et al.  Using Multi-input-layer Wavelet Neural Network to Model Product Quality of Continuous Casting Furnace and Hot Rolling Mill , 2004 .

[43]  Satish S. Udpa,et al.  Characterization of gas pipeline inspection signals using wavelet basis function neural networks , 2000 .

[44]  Jiwen Dong,et al.  Time-series prediction using a local linear wavelet neural network , 2006, Neurocomputing.

[45]  Armando Blanco,et al.  Intelligent system for the analysis of microarray data using principal components and estimation of distribution algorithms , 2009, Expert Syst. Appl..

[46]  Chien-Hsing Chou,et al.  Fuzzy C-Means Algorithm with a Point Symmetry Distance , 2006 .

[47]  Dhammika Amaratunga,et al.  Exploration and Analysis of DNA Microarray and Protein Array Data , 2003, Wiley series in probability and statistics.

[48]  Jianping Li,et al.  A multiple kernel support vector machine scheme for feature selection and rule extraction from gene expression data of cancer tissue , 2007, Artif. Intell. Medicine.

[49]  Madhusudan Singh,et al.  New fuzzy wavelet neural networks for system identification and control , 2005, Appl. Soft Comput..

[50]  Zarita Zainuddin,et al.  Improved wavelet neural network for early diagnosis of cancer patients using microarray gene expression data , 2009, 2009 International Joint Conference on Neural Networks.

[51]  Hau-San Wong,et al.  Constructing the gene regulation-level representation of microarray data for cancer classification , 2008, J. Biomed. Informatics.

[52]  Wei Xie,et al.  Accurate Cancer Classification Using Expressions of Very Few Genes , 2007, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[53]  Vadlamani Ravi,et al.  Software development cost estimation using wavelet neural networks , 2008, J. Syst. Softw..

[54]  Jiacong Cao,et al.  Study of hourly and daily solar irradiation forecast using diagonal recurrent wavelet neural networks , 2008 .

[55]  Ping Xu,et al.  Modified linear discriminant analysis approaches for classification of high-dimensional microarray data , 2009, Comput. Stat. Data Anal..

[56]  Zhang Xiao-guang,et al.  Fuzzy Wavelet Neural Networks Based on SVM , 2006 .

[57]  Sung-Bae Cho,et al.  Prediction of colon cancer using an evolutionary neural network , 2004, Neurocomputing.

[58]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.

[59]  Jorng-Tzong Horng,et al.  An expert system to classify microarray gene expression data using gene selection by decision tree , 2009, Expert Syst. Appl..

[60]  Roberto Avogadri,et al.  Fuzzy ensemble clustering based on random projections for DNA microarray data analysis , 2009, Artif. Intell. Medicine.

[61]  Sung-Bae Cho,et al.  Ensemble classifiers based on correlation analysis for DNA microarray classification , 2006, Neurocomputing.

[62]  K. Deb,et al.  Reliable classification of two-class cancer data using evolutionary algorithms. , 2003, Bio Systems.

[63]  Ah-Hwee Tan,et al.  Predictive neural networks for gene expression data analysis , 2005, Neural Networks.

[64]  Wan-zhao Cui,et al.  Prediction of thin film thickness of field emission using wavelet neural networks , 2005 .

[65]  Chien-Hsing Chou,et al.  Short Papers , 2001 .

[66]  Nishu Sharma,et al.  A Comparative Study Of Data Clustering Techniques , 2013 .

[67]  Xue-wen Chen,et al.  Margin-based wrapper methods for gene identification using microarray , 2006, Neurocomputing.

[68]  Dimitrios Vogiatzis,et al.  Active learning for microarray data , 2008, Int. J. Approx. Reason..

[69]  John Moody,et al.  Fast Learning in Networks of Locally-Tuned Processing Units , 1989, Neural Computation.

[70]  Pericles A. Mitkas,et al.  SoFoCles: Feature filtering for microarray classification based on Gene Ontology , 2010, J. Biomed. Informatics.

[71]  Peng Zhou,et al.  A sequential feature extraction approach for naïve bayes classification of microarray data , 2009, Expert Syst. Appl..

[72]  Zhiguo Zhang,et al.  Learning algorithm of wavelet network based on sampling theory , 2007, Neurocomputing.

[73]  Zhiping Lin,et al.  Composite function wavelet neural networks with extreme learning machine , 2010, Neurocomputing.

[74]  H. Iba,et al.  Gene selection for classification of cancers using probabilistic model building genetic algorithm. , 2005, Bio Systems.

[75]  Eric A. Rying,et al.  Focused local learning with wavelet neural networks , 2002, IEEE Trans. Neural Networks.

[76]  Igor V. Tetko,et al.  Gene selection from microarray data for cancer classification - a machine learning approach , 2005, Comput. Biol. Chem..

[77]  T. Golub,et al.  Gene expression-based classification of malignant gliomas correlates better with survival than histological classification. , 2003, Cancer research.

[78]  Michalis E. Blazadonakis,et al.  Wrapper filtering criteria via linear neuron and kernel approaches , 2008, Comput. Biol. Medicine.

[79]  Jesús S. Aguilar-Ruiz,et al.  Incremental wrapper-based gene selection from microarray data for cancer classification , 2006, Pattern Recognit..

[80]  Mohsen Hayati,et al.  Multilayer perceptron neural networks with novel unsupervised training method for numerical solution of the partial differential equations , 2009, Appl. Soft Comput..

[81]  Indra Neil Sarkar,et al.  Characteristic attributes in cancer microarrays , 2002, Journal of Biomedical Informatics.

[82]  Jiacong Cao,et al.  Application of the diagonal recurrent wavelet neural network to solar irradiation forecast assisted with fuzzy technique , 2008, Eng. Appl. Artif. Intell..

[83]  Zaki Ahmad Dahlan,et al.  The Performance of Two Mothers Wavelets in Function Approximation , 2009 .

[84]  Insuk Sohn,et al.  Selecting marker genes for cancer classification using supervised weighted kernel clustering and the support vector machine , 2009, Comput. Stat. Data Anal..