Classification of microarray datasets using finite impulse response extreme learning machine for cancer diagnosis

Cancer diagnosis is a highly researched field in bioinformatics since the introduction of microarray gene expression technology which allows multiple diseases to be simultaneously tested and compared between healthy and malignant cells. This paper analyses the use of a recently suggested neural network based classifier known as the finite impulse response extreme learning machine (FIR-ELM) for the classification of two binary bioinformatics datasets consisting of microarray gene expressions for leukemia and colon tumor. The FIR-ELM is based on the single hidden layer feedforward neural network (SLFN) whose weights are trained to reduce the effects of noise and improve the robustness of the classifier. The hidden layer of the FIR-ELM is seen to be able to reduce the noise and disturbances from the full microarray dataset which is known to consist of many experimental errors and biases accrued from the production process. Experimental results have shown that the FIR-ELM is capable of achieving good performance compared to conventional classifiers such as the back propagation artificial neural network (BP-ANN), extreme learning machine (ELM), and support vector machine (SVM) which implement a gene selection procedure prior to working on the datasets.

[1]  Dr.S. Santhosh Baboo,et al.  Multicategory Classification Using Support Vector Machine for Microarray Gene Expression Cancer Diagnosis , 2010 .

[2]  P. Saratchandran,et al.  Multicategory Classification Using An Extreme Learning Machine for Microarray Gene Expression Cancer Diagnosis , 2007, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[3]  Tarek Helmy,et al.  Multi-category bioinformatics dataset classification using extreme learning machine , 2009, 2009 IEEE Congress on Evolutionary Computation.

[4]  Ahmad M. Sarhan,et al.  Journal of Theoretical and Applied Information Technology Cancer Classification Based on Microarray Gene Expression Data Using Dct and Ann , 2022 .

[5]  Ali Hussian Ali Self-Organization Maps for Prediction of Kidney Dysfunction , 2008 .

[6]  William A. Schmitt,et al.  Interactive exploration of microarray gene expression patterns in a reduced dimensional space. , 2002, Genome research.

[7]  S. Dudoit,et al.  Introduction to Classification in Microarray Experiments , 2003 .

[8]  Chee Kheong Siew,et al.  Universal Approximation using Incremental Constructive Feedforward Networks with Random Hidden Nodes , 2006, IEEE Transactions on Neural Networks.

[9]  Chee Kheong Siew,et al.  Extreme learning machine: Theory and applications , 2006, Neurocomputing.

[10]  Werner Dubitzky,et al.  Multiclass Cancer Classification Using Gene Expression Profiling and Probabilistic Neural Networks , 2002, Pacific Symposium on Biocomputing.

[11]  Lawrence Carin,et al.  ICA and PLS modeling for functional analysis and drug sensitivity for DNA microarray signals , 2002, 2002 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[12]  Pedro Antonio Gutiérrez,et al.  On the suitability of Extreme Learning Machine for gene classification using feature selection , 2010, 2010 10th International Conference on Intelligent Systems Design and Applications.

[13]  Luis Mateus Rocha,et al.  Singular value decomposition and principal component analysis , 2003 .

[14]  T. Poggio,et al.  Multiclass cancer diagnosis using tumor gene expression signatures , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[15]  A. Bharathi,et al.  Microarray gene expression cancer diagnosis using Machine Learning algorithms , 2010, 2010 International Conference on Signal and Image Processing.

[16]  Zhihong Man,et al.  A new robust training algorithm for a class of single-hidden layer feedforward neural networks , 2011, Neurocomputing.