A Heuristic Algorithm for Identifying Molecular Signatures in Cancer

Molecular signatures of cancer, e.g., genes or microRNAs (miRNAs), have been recognized very important in predicting the occurrence of cancer. From gene-expression and miRNA-expression data, the challenge of identifying molecular signatures lies in the huge number of molecules compared to the small number of samples. To address this issue, in this paper, we propose a heuristic algorithm to identify molecular signatures, termed HAMS, for cancer diagnosis by modeling it as a multi-objective optimization problem. In the proposed HAMS, an elitist-guided individual update strategy is proposed to obtain a small number of molecular signatures, which are closely related with cancer and contain less redundant signatures. Experimental results demonstrate that the proposed HAMS achieves superior performance over seven state-of-the-art algorithms on both gene-expression and miRNA-expression datasets. We also validate the biological significance of the molecular signatures obtained by the proposed HAMS through biological analysis.

[1]  Ye Tian,et al.  Efficient Large-Scale Multiobjective Optimization Based on a Competitive Swarm Optimizer , 2020, IEEE Transactions on Cybernetics.

[2]  Xiangxiang Zeng,et al.  A Parallel Workflow Pattern Modeling Using Spiking Neural P Systems With Colored Spikes , 2018, IEEE Transactions on NanoBioscience.

[3]  Xiangxiang Zeng,et al.  Prediction of potential disease-associated microRNAs using structural perturbation method , 2017, bioRxiv.

[4]  X S Zhao,et al.  An Improved Binary Differential Evolution Algorithm for Feature Selection in Molecular Signatures , 2018, Molecular informatics.

[5]  Dong-Sheng Cao,et al.  ECoFFeS: A Software Using Evolutionary Computation for Feature Selection in Drug Discovery , 2018, IEEE Access.

[6]  Ye Tian,et al.  A Decision Variable Clustering-Based Evolutionary Algorithm for Large-Scale Many-Objective Optimization , 2018, IEEE Transactions on Evolutionary Computation.

[7]  Mengjie Zhang,et al.  Pareto front feature selection based on artificial bee colony optimization , 2018, Inf. Sci..

[8]  Janez Brest,et al.  Multi-Objective Differential Evolution for feature selection in Facial Expression Recognition systems , 2017, Expert Syst. Appl..

[9]  Ibrahim Farag,et al.  Multistage feature selection approach for high-dimensional cancer data , 2017, Soft Comput..

[10]  Xiangxiang Zeng,et al.  Spiking Neural P Systems With Scheduled Synapses , 2017, IEEE Transactions on NanoBioscience.

[11]  Ye Tian,et al.  Effectiveness and efficiency of non-dominated sorting for evolutionary multi- and many-objective optimization , 2017, Complex & Intelligent Systems.

[12]  Su Ruan,et al.  Feature selection for outcome prediction in oesophageal cancer using genetic algorithm and random forest classifier , 2017, Comput. Medical Imaging Graph..

[13]  Markus Olhofer,et al.  A mini-review on preference modeling and articulation in multi-objective optimization: current status and challenges , 2017, Complex & Intelligent Systems.

[14]  Ben Niu,et al.  A discrete bacterial algorithm for feature selection in classification of microarray gene expression cancer data , 2017, Knowl. Based Syst..

[15]  Wei Lin,et al.  A comprehensive overview and evaluation of circular RNA detection tools , 2017, PLoS Comput. Biol..

[16]  Xiangxiang Zeng,et al.  Prediction and Validation of Disease Genes Using HeteSim Scores , 2017, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[17]  Nitin K. Singh,et al.  Sparse feature selection for classification and prediction of metastasis in endometrial cancer , 2017, BMC Genomics.

[18]  Abdelkader Benyettou,et al.  Kernel-based learning and feature selection analysis for cancer diagnosis , 2017, Appl. Soft Comput..

[19]  Xiangtao Li,et al.  Multiobjective ranking binary artificial bee colony for gene selection problems using microarray datasets , 2017 .

[20]  Satchidananda Dehuri,et al.  Feature selection model based on clustering and ranking in pipeline for microarray data , 2017 .

[21]  David J. Brown,et al.  Breast Cancer Diagnosis Using a Hybrid Genetic Algorithm for Feature Selection Based on Mutual Information , 2016, 2016 International Conference on Interactive Technologies and Games (ITAG).

[22]  Xingyi Zhang,et al.  A multi-objective membrane algorithm guided by the skin membrane , 2016, Natural Computing.

[23]  Xun Wang,et al.  Design of logic gates using spiking neural P systems with homogeneous neurons and astrocytes-like control , 2016, Inf. Sci..

[24]  A. Jemal,et al.  Cancer treatment and survivorship statistics, 2016 , 2016, CA: a cancer journal for clinicians.

[25]  Sergio Segura,et al.  SIP: Optimal Product Selection from Feature Models Using Many-Objective Evolutionary Optimization , 2016, ACM Trans. Softw. Eng. Methodol..

[26]  Satish Chandra,et al.  Improved cervix lesion classification using multi-objective binary firefly algorithm-based feature selection , 2016, Int. J. Bio Inspired Comput..

[27]  A. Chella,et al.  microRNA classifiers are powerful diagnostic/prognostic tools in ALK-, EGFR-, and KRAS-driven lung cancers , 2015, Proceedings of the National Academy of Sciences.

[28]  Juan-Zi Li,et al.  A multi-objective evolutionary algorithm for feature selection based on mutual information with a new redundancy measure , 2015, Inf. Sci..

[29]  Dong Xu,et al.  Classification of lung cancer using ensemble-based feature selection and machine learning methods. , 2015, Molecular bioSystems.

[30]  Miguel Ángel Guevara-López,et al.  Improving the Mann-Whitney statistical test for feature selection: An approach in breast cancer diagnosis on mammography , 2015, Artif. Intell. Medicine.

[31]  J. Hernández-Rivas,et al.  Transcriptome analysis reveals molecular profiles associated with evolving steps of monoclonal gammopathies , 2014, Haematologica.

[32]  Chien-Hsing Chen,et al.  A hybrid intelligent model of analyzing clinical breast cancer data using clustering techniques with feature selection , 2014, Appl. Soft Comput..

[33]  N. Seki,et al.  The MicroRNA Expression Signature of Bladder Cancer by Deep Sequencing: The Functional Significance of the miR-195/497 Cluster , 2014, PloS one.

[34]  Aixia Guo,et al.  Gene Selection for Cancer Classification using Support Vector Machines , 2014 .

[35]  Shi Wy,et al.  Gene expression analysis of lung cancer. , 2014 .

[36]  Weixiang Shi,et al.  Gene expression analysis of lung cancer. , 2014, European review for medical and pharmacological sciences.

[37]  Mengjie Zhang,et al.  Particle Swarm Optimization for Feature Selection in Classification: A Multi-Objective Approach , 2013, IEEE Transactions on Cybernetics.

[38]  U. Maulik,et al.  An SVM-Wrapped Multiobjective Evolutionary Feature Selection Approach for Identifying Cancer-MicroRNA Markers , 2013, IEEE Transactions on NanoBioscience.

[39]  Harish Sharma,et al.  Memetic search in artificial bee colony algorithm , 2013, Soft Computing.

[40]  Jacob Scharcanski,et al.  Feature selection for face recognition based on multi-objective evolutionary wrappers , 2013, Expert Syst. Appl..

[41]  Mengjie Zhang,et al.  Multi-objective Evolutionary Algorithms for filter Based Feature Selection in Classification , 2013, Int. J. Artif. Intell. Tools.

[42]  Lluís A. Belanche Muñoz,et al.  Feature Selection for Microarray Gene Expression Data using Simulated Annealing guided by the Multivariate Joint Entropy , 2013, ArXiv.

[43]  D. Ahel,et al.  ZRANB3 is a structure-specific ATP-dependent endonuclease involved in replication stress response. , 2012, Genes & development.

[44]  Mengjie Zhang,et al.  New fitness functions in binary particle swarm optimisation for feature selection , 2012, 2012 IEEE Congress on Evolutionary Computation.

[45]  Sanghamitra Bandyopadhyay,et al.  MicroRNA Transcription Start Site Prediction with Multi-objective Feature Selection , 2012, Statistical applications in genetics and molecular biology.

[46]  Barnali Sahu,et al.  A Novel Feature Selection Algorithm using Particle Swarm Optimization for Cancer Microarray Data , 2012 .

[47]  Li-Yeh Chuang,et al.  Gene selection and classification using Taguchi chaotic binary particle swarm optimization , 2011, Expert Syst. Appl..

[48]  Jiawei Han,et al.  Generalized Fisher Score for Feature Selection , 2011, UAI.

[49]  Ana Carolina Lorena,et al.  Multi-objective Genetic Algorithm Evaluation in Feature Selection , 2011, EMO.

[50]  Chih-Jen Lin,et al.  LIBSVM: A library for support vector machines , 2011, TIST.

[51]  Dr.S. Santhosh Baboo,et al.  Multicategory Classification Using Support Vector Machine for Microarray Gene Expression Cancer Diagnosis , 2010 .

[52]  Chuhsing Kate Hsiao,et al.  Identification of a Novel Biomarker, SEMA5A, for Non–Small Cell Lung Carcinoma in Nonsmoking Women , 2010, Cancer Epidemiology, Biomarkers & Prevention.

[53]  Wei Kong,et al.  Hybrid particle swarm optimization and tabu search approach for selecting genes for tumor classification using gene expression data , 2008, Comput. Biol. Chem..

[54]  Sushmita Mitra,et al.  Evolutionary Rough Feature Selection in Gene Expression Data , 2007, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[55]  Fakhri Karray,et al.  Multi-objective Feature Selection with NSGA II , 2007, ICANNGA.

[56]  Fuhui Long,et al.  Feature selection based on mutual information criteria of max-dependency, max-relevance, and min-redundancy , 2003, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[57]  R. Abseher,et al.  Microarray gene expression profiling of B-cell chronic lymphocytic leukemia subgroups defined by genomic aberrations and VH mutation status. , 2004, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[58]  Flávio Bortolozzi,et al.  Unsupervised feature selection using multi-objective genetic algorithms for handwritten word recognition , 2003, Seventh International Conference on Document Analysis and Recognition, 2003. Proceedings..

[59]  B. Morris,et al.  The Structure of the Zinc Finger Domain from Human Splicing Factor ZNF265 Fold* , 2003, Journal of Biological Chemistry.

[60]  V. Rao Vemuri,et al.  Use of K-Nearest Neighbor classifier for intrusion detection , 2002, Comput. Secur..

[61]  F. Piano,et al.  The ZW10 and Rough Deal checkpoint proteins function together in a large, evolutionarily conserved complex targeted to the kinetochore. , 2001, Journal of cell science.

[62]  E. R. Taylor,et al.  Isolation and characterization of two novel A20-like proteins. , 2001, The Biochemical journal.

[63]  Marco Laumanns,et al.  SPEA2: Improving the strength pareto evolutionary algorithm , 2001 .

[64]  Daniel A Starr,et al.  HZwint-1, a novel human kinetochore component that interacts with HZW10. , 2000, Journal of cell science.

[65]  David W. Corne,et al.  Approximating the Nondominated Front Using the Pareto Archived Evolution Strategy , 2000, Evolutionary Computation.

[66]  A. Gilman,et al.  A Candidate Target for G Protein Action in Brain* , 1999, The Journal of Biological Chemistry.

[67]  R. Davis,et al.  The cytoplasmic zinc finger protein ZPR1 accumulates in the nucleolus of proliferating cells. , 1998, Molecular biology of the cell.

[68]  R. Stahel,et al.  Human small cell lung cancer expresses the octamer DNA-binding and nervous system-specific transcription factor N-Oct 3 (brain-2). , 1992, Cancer research.

[69]  M. Gilman,et al.  Expression cloning of a novel zinc finger protein that binds to the c-fos serum response element , 1992, Molecular and cellular biology.

[70]  S. O’Brien,et al.  The GLI-Kruppel family of human genes , 1988, Molecular and cellular biology.