New Bio-Marker Gene Discovery Algorithms for Cancer Gene Expression Profile

Several hybrid gene selection algorithms for cancer classification that employ bio-inspired evolutionary wrapper algorithm have been proposed in the literature and show good classification accuracy. In our recent previous work, we proposed a new wrapper gene selection method based-on firefly algorithm named FF-SVM. In this work, we will improve the classification performance of FF-SVM algorithm by proposed a new hybrid gene selection algorithm. Our new biomarker gene discovery algorithm for microarray cancer gene expression analysis that integrates f-score filter method with Firefly feature selection method alongside with SVM classifier named FFF-SVM is proposed. The classification accuracy for the selected gene subset is measured by support vector machine SVM classifier with leave-one-out cross validation LOOCV. The evaluation of the FFF-SVM algorithm done by using five benchmark microarray datasets of binary and multi class. To show result validation of the proposed we compare it with other related state-of-the-art algorithms. The experiment proves that the FFF-SVM outperform other hybrid algorithm in terms of high classification accuracy and low number of selected genes. In addition, we compare the proposed algorithm with previously proposed wrapper-based gene selection algorithm FF-SVM. The result show that the hybrid-based algorithm shoe higher performance than wrapper based. The proposed algorithm is an improvement of our previous proposed algorithm.

[1]  Yu Xue,et al.  A hybrid feature selection algorithm for gene expression data classification , 2017, Neurocomputing.

[2]  S. Wright THE INTERPRETATION OF POPULATION STRUCTURE BY F‐STATISTICS WITH SPECIAL REGARD TO SYSTEMS OF MATING , 1965 .

[3]  Andrew Y. Ng,et al.  Preventing "Overfitting" of Cross-Validation Data , 1997, ICML.

[4]  Salwani Abdullah,et al.  Hybrid feature selection algorithm using symmetrical uncertainty and a harmony search algorithm , 2016, Int. J. Syst. Sci..

[5]  Yousef Al-Ohali,et al.  ABC-SVM: Artificial Bee Colony and SVM Method for Microarray Gene Selection and Multi Class Cancer Classification , 2016 .

[6]  U. Alon,et al.  Broad patterns of gene expression revealed by clustering analysis of tumor and normal colon tissues probed by oligonucleotide arrays. , 1999, Proceedings of the National Academy of Sciences of the United States of America.

[7]  Xin-She Yang,et al.  Firefly Algorithms for Multimodal Optimization , 2009, SAGA.

[8]  Namita Srivastava,et al.  A novel approach for dimension reduction of microarray , 2017, Comput. Biol. Chem..

[9]  Nada Almugren,et al.  A Survey on Hybrid Feature Selection Methods in Microarray Gene Expression Data for Cancer Classification , 2019, IEEE Access.

[10]  Keinosuke Fukunaga,et al.  A Branch and Bound Algorithm for Feature Subset Selection , 1977, IEEE Transactions on Computers.

[11]  Nada Almugren,et al.  FF-SVM: New FireFly-based Gene Selection Algorithm for Microarray Cancer Classification , 2019, 2019 IEEE Conference on Computational Intelligence in Bioinformatics and Computational Biology (CIBCB).

[12]  Nizamettin Aydin,et al.  Gene selection and classification approach for microarray data based on Random Forest Ranking and BBHA , 2016, 2016 IEEE-EMBS International Conference on Biomedical and Health Informatics (BHI).

[13]  Verónica Bolón-Canedo,et al.  A review of microarray datasets and applied feature selection methods , 2014, Inf. Sci..

[14]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.

[15]  Mohammad Hossein Moattar,et al.  A hybrid gene selection approach for microarray data classification using cellular learning automata and ant colony optimization. , 2016, Genomics.

[16]  Xiaowei Yang,et al.  An efficient gene selection algorithm based on mutual information , 2009, Neurocomputing.

[17]  Hala Alshamlan,et al.  mRMR-ABC: A Hybrid Gene Selection Algorithm for Cancer Classification Using Microarray Gene Expression Profiling , 2015, BioMed research international.

[18]  M. Ringnér,et al.  Classification and diagnostic prediction of cancers using gene expression profiling and artificial neural networks , 2001, Nature Medicine.

[19]  Pedro Larrañaga,et al.  A review of feature selection techniques in bioinformatics , 2007, Bioinform..

[20]  E. Lander,et al.  MLL translocations specify a distinct gene expression profile that distinguishes a unique leukemia , 2002, Nature Genetics.

[21]  Gamal Attiya,et al.  Classification of human cancer diseases by gene expression profiles , 2017, Appl. Soft Comput..

[22]  M. Balafar,et al.  Gene selection for microarray cancer classification using a new evolutionary method employing artificial intelligence concepts. , 2017, Genomics.