Enhancing the performance of hybrid evolutionary algorithms in microarray colon data classification

Microarray technology aids in monitoring simultaneous gene expressions of human cells for diagnosis and treatment of infectious diseases in a single experiment. The microarray data are of high dimension with a large number of genes and fewer samples. Highly informative features and appropriate classifiers are necessary to increase the accuracy of disease classification. In this paper, a publicly available colon cancer dataset is analyzed for classification. At first, Power Spectral Density (PSD) technique is employed to obtain the reduced gene features and after that, they are classified using various types of classifiers for colon cancer classification. To improve the classification accuracy, the hybridized evolutionary classifiers, namely, Artificial Bee Colony‐Firefly (ABC‐Firefly), Artificial Bee Colony‐Particle Swarm Optimization (ABC‐PSO), Particle Swarm Optimization‐Firefly (PSO‐Firefly) are derived from the conventional evolutionary classifiers, namely, Artificial Bee Colony (ABC), Particle Swarm Optimization (PSO) and Firefly classifiers. The performance of the three hybridized evolutionary algorithms is compared with the conventional classifiers. Additionally, the comparison was also performed on the conventional and hybridized evolutionary algorithms using Expectation Maximization (EM) and cascaded Power Spectral Density‐Expectation Maximization (PSD‐EM) dimensionality reduction techniques. Although there are many classifiers for colon classification, it is wise to use PSD features with a Hybrid ABC‐PSO classifier for enhanced performance. The experimental results reveal that the Hybrid ABC‐PSO classifier with PSD features achieves 98.96% for colon cancer samples and 100% accuracy for colon normal samples when compared with the other classifiers reported in the literature.

[1]  Xin-She Yang,et al.  Nature-Inspired Metaheuristic Algorithms , 2008 .

[2]  G. Jeyakumar,et al.  Hybrid feature selection using micro genetic algorithm on microarray gene expression data , 2019, J. Intell. Fuzzy Syst..

[3]  J. Wu,et al.  Feature Selection for Cancer Classification Using Microarray Gene Expression Data , 2017 .

[4]  Ali Najafi,et al.  Classification and Biomarker Genes Selection for Cancer Gene Expression Data Using Random Forest , 2017, Iranian journal of pathology.

[5]  David H. Wolpert,et al.  No free lunch theorems for optimization , 1997, IEEE Trans. Evol. Comput..

[6]  Alessandro Orro,et al.  A tool for mapping Single Nucleotide Polymorphisms using Graphics Processing Units , 2014, BMC Bioinformatics.

[7]  A. Jemal,et al.  Cancer statistics, 2017 , 2017, CA: a cancer journal for clinicians.

[8]  Namita Srivastava,et al.  Artificial neural network classification of microarray data using new hybrid gene selection method , 2017, Int. J. Data Min. Bioinform..

[9]  Shutao Li,et al.  Gene selection using hybrid particle swarm optimization and genetic algorithm , 2008, Soft Comput..

[10]  U. Alon,et al.  Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[11]  R. Young,et al.  Biomedical Discovery with DNA Arrays , 2000, Cell.

[12]  M. Sim,et al.  AJCC Cancer Staging Manual 7th edition criteria for colon cancer: do the complex modifications improve prognostic assessment? , 2013, Journal of the American College of Surgeons.

[13]  Shu Liao,et al.  Dominant Local Binary Patterns for Texture Classification , 2009, IEEE Transactions on Image Processing.

[14]  Steven J. M. Jones,et al.  Meta-analysis of Colorectal Cancer Gene Expression Profiling Studies Identifies Consistently Reported Candidate Biomarkers , 2008, Cancer Epidemiology Biomarkers & Prevention.

[15]  Seyed Mohammad Hosseini,et al.  A Novel Weighted Support Vector Machine Based on Particle Swarm Optimization for Gene Selection and Tumor Classification , 2012, Comput. Math. Methods Medicine.

[16]  Gregory W. Corder,et al.  Nonparametric Statistics for Non-Statisticians: A Step-by-Step Approach , 2009 .

[17]  Riccardo Poli,et al.  Particle swarm optimization , 1995, Swarm Intelligence.

[18]  Haiyan Wang,et al.  Improving accuracy for cancer classification with a new algorithm for genes selection , 2012, BMC Bioinformatics.

[19]  Lin Sun,et al.  A Hybrid Gene Selection Method Based on ReliefF and Ant Colony Optimization Algorithm for Tumor Classification , 2019, Scientific Reports.

[20]  N. Dubrawsky Cancer statistics , 1989, CA: a cancer journal for clinicians.

[21]  Yusuke Nakamura,et al.  Molecular features of hormone-refractory prostate cancer cells by genome-wide gene expression profiles. , 2007, Cancer research.

[22]  T. Wetter,et al.  Verification of gene expression profiles for colorectal cancer using 12 internet public microarray datasets. , 2014, World journal of gastroenterology.

[23]  Bart De Moor,et al.  New bandwidth selection criterion for Kernel PCA: Approach to dimensionality reduction and classification problems , 2014, BMC Bioinformatics.

[24]  Yu Xue,et al.  A hybrid feature selection algorithm for gene expression data classification , 2017, Neurocomputing.

[25]  Wei Kong,et al.  Hybrid particle swarm optimization and tabu search approach for selecting genes for tumor classification using gene expression data , 2008, Comput. Biol. Chem..

[26]  Hala Mohammed Alshamlan,et al.  Co-ABC: Correlation artificial bee colony algorithm for biomarker gene discovery using gene expression profile , 2018, Saudi journal of biological sciences.

[27]  Songyot Nakariyakul,et al.  A hybrid gene selection algorithm based on interaction information for microarray-based cancer classification , 2019, PloS one.

[28]  R Fodde,et al.  Expression and genomic profiling of colorectal cancer. , 2007, Biochimica et biophysica acta.

[29]  Z. Voulgaris,et al.  Dimensionality Reduction for Feature and Pattern Selection in Classification Problems , 2008, 2008 The Third International Multi-Conference on Computing in the Global Information Technology (iccgi 2008).

[30]  Hua Wang,et al.  Combined Gene Selection Methods for Microarray Data Analysis , 2006, KES.

[31]  Hala M. Alshamlan,et al.  The Performance of Bio-Inspired Evolutionary Gene Selection Methods for Cancer Classification Using Microarray Dataset , 2014 .

[32]  Hala Alshamlan,et al.  mRMR-ABC: A Hybrid Gene Selection Algorithm for Cancer Classification Using Microarray Gene Expression Profiling , 2015, BioMed research international.

[33]  Ali Najafi,et al.  A hybrid gene selection algorithm for microarray cancer classification using genetic algorithm and learning automata , 2017 .

[34]  Smitarani Satpathy,et al.  Microarray Classification Using Intelligent Techniques , 2014 .

[35]  Fan Zhang,et al.  IPAD: the Integrated Pathway Analysis Database for Systematic Enrichment Analysis , 2012, BMC Bioinformatics.

[36]  Li Liu,et al.  Cancer Feature Selection and Classification Using a Binary Quantum-Behaved Particle Swarm Optimization and Support Vector Machine , 2016, Comput. Math. Methods Medicine.

[37]  M. O. Arowolo,et al.  A Comparative Analysis of Feature Extraction Methods for Classifying Colon Cancer Microarray Data , 2017, EAI Endorsed Trans. Scalable Inf. Syst..

[38]  William H. Press,et al.  Numerical recipes , 1990 .

[39]  Hervé Abdi,et al.  Multiple Subject Barycentric Discriminant Analysis (MUSUBADA): How to Assign Scans to Categories without Using Spatial Normalization , 2012, Comput. Math. Methods Medicine.

[40]  Thomas Wetter,et al.  Gene expression profile of peripheral blood in colorectal cancer. , 2014, World journal of gastroenterology.

[41]  Lin Sun,et al.  A Gene selection approach based on the fisher linear discriminant and the neighborhood rough set , 2017, Bioengineered.

[42]  John Quackenbush,et al.  Computational genetics: Computational analysis of microarray data , 2001, Nature Reviews Genetics.

[43]  C. Compton,et al.  The Eighth Edition AJCC Cancer Staging Manual: Continuing to build a bridge from a population‐based to a more “personalized” approach to cancer staging , 2017, CA: a cancer journal for clinicians.

[44]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.