Hybridising harmony search with a Markov blanket for gene selection problems

Gene selection, which is a well-known NP-hard problem, is a challenging task that has been the subject of a large amount of research, especially in relation to classification tasks. This problem addresses the identification of the smallest possible set of genes that could achieve good predictive performance. Many gene selection algorithms have been proposed; however, because the search space increases exponentially with the number of genes, finding the best possible approach for a solution that would limit the search space is crucial. Metaheuristic approaches have the ability to discover a promising area without exploring the whole solution space. Hence, we propose a new method that hybridises the Harmony Search Algorithm (HSA) and the Markov Blanket (MB), called HSA-MB, for gene selection in classification problems. In this proposed approach, the HSA (as a wrapper approach) improvises a new harmony that is passed to the MB (treated as a filter approach) for further improvement. The addition and deletion of operators based on gene ranking information is used in the MB algorithm to further improve the harmony and to fine-tune the search space. The HSA-MB algorithm method works especially well on selected genes with higher correlation coefficients based on symmetrical uncertainty. Ten microarray datasets were experimented on, and the results demonstrate that the HSA-MB has a performance that is comparable to state-of-the-art approaches. HSA-MB yields very small sets of genes while preserving the classification accuracy. The results suggest that HSA-MB has a high potential for being an alternative method of gene selection when applied to microarray data and can be of benefit in clinical practice.

[1]  Xin-She Yang Harmony Search as a Metaheuristic Algorithm , 2009 .

[2]  Geoffrey J McLachlan,et al.  Selection bias in gene extraction on the basis of microarray gene-expression data , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[3]  Li-Yeh Chuang,et al.  A hybrid feature selection method for DNA microarray data , 2011, Comput. Biol. Medicine.

[4]  Jose Crispin Hernandez Hernandez,et al.  Hybrid Filter-Wrapper with a Specialized Random Multi-Parent Crossover Operator for Gene Selection and Classification Problems , 2011, ICIC.

[5]  Li-Yeh Chuang,et al.  A Hybrid BPSO-CGA Approach for Gene Selection and Classification of Microarray Data , 2012, J. Comput. Biol..

[6]  Dhanesh Ramachandram,et al.  Dynamic fuzzy clustering using Harmony Search with application to image segmentation , 2009, 2009 IEEE International Symposium on Signal Processing and Information Technology (ISSPIT).

[7]  Zong Woo Geem,et al.  Music Composition Using Harmony Search Algorithm , 2009, EvoWorkshops.

[8]  Bijaya K. Panigrahi,et al.  Dynamic economic load dispatch using hybrid swarm intelligence based harmony search algorithm , 2011, Expert Syst. Appl..

[9]  Panos M. Pardalos,et al.  An improved adaptive binary Harmony Search algorithm , 2013, Inf. Sci..

[10]  William H. Press,et al.  Numerical recipes in C , 2002 .

[11]  Sophie Schbath,et al.  Separating Significant Matches from Spurious Matches in DNA Sequences , 2012, J. Comput. Biol..

[12]  El-Ghazali Talbi,et al.  Comparison of population based metaheuristics for feature selection: Application to microarray data classification , 2008, 2008 IEEE/ACS International Conference on Computer Systems and Applications.

[13]  Jin-Kao Hao,et al.  A memetic algorithm for gene selection and molecular classification of cancer , 2009, GECCO.

[14]  Francisco Herrera,et al.  Enhancing evolutionary instance selection algorithms by means of fuzzy rough set based feature selection , 2012, Inf. Sci..

[15]  Ron Kohavi,et al.  Wrappers for Feature Subset Selection , 1997, Artif. Intell..

[16]  Zong Woo Geem,et al.  Harmony Search Algorithms for Structural Design Optimization , 2009 .

[17]  Morteza Haghir Chehreghani,et al.  Novel meta-heuristic algorithms for clustering web documents , 2008, Appl. Math. Comput..

[18]  Jesús S. Aguilar-Ruiz,et al.  Incremental wrapper-based gene selection from microarray data for cancer classification , 2006, Pattern Recognit..

[19]  Mohammad Reza Meybodi,et al.  Efficient stochastic algorithms for document clustering , 2013, Inf. Sci..

[20]  Huan Liu,et al.  Redundancy based feature selection for microarray data , 2004, KDD.

[21]  Daphne Koller,et al.  Toward Optimal Feature Selection , 1996, ICML.

[22]  Zong Woo Geem,et al.  A New Heuristic Optimization Algorithm: Harmony Search , 2001, Simul..

[23]  Witold Pedrycz,et al.  Feature selection using structural similarity , 2012, Inf. Sci..

[24]  Aditya Panchal,et al.  Harmony Search in Therapeutic Medical Physics , 2009 .

[25]  Shih-Wei Lin,et al.  Parameter determination and feature selection for C4.5 algorithm using scatter search approach , 2012, Soft Comput..

[26]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[27]  Silvia Casado Yusta,et al.  Different metaheuristic strategies to solve the feature selection problem , 2009, Pattern Recognit. Lett..

[28]  Richard Weber,et al.  A wrapper method for feature selection using Support Vector Machines , 2009, Inf. Sci..

[29]  Driss Aboutajdine,et al.  A two-stage gene selection scheme utilizing MRMR filter and GA wrapper , 2011, Knowledge and Information Systems.

[30]  Mohammed El-Abd,et al.  Performance assessment of foraging algorithms vs. evolutionary algorithms , 2012, Inf. Sci..

[31]  Jose Miguel Puerta,et al.  A GRASP algorithm for fast hybrid (filter-wrapper) feature subset selection in high-dimensional datasets , 2011, Pattern Recognit. Lett..

[32]  M. Fesanghary,et al.  Combined heat and power economic dispatch by harmony search algorithm , 2007 .

[33]  D. Ramachandram,et al.  Harmony search-based cluster initialization for fuzzy c-means segmentation of MR images , 2009, TENCON 2009 - 2009 IEEE Region 10 Conference.

[34]  Mohammed Azmi Al-Betar,et al.  A harmony search algorithm for university course timetabling , 2010, Annals of Operations Research.

[35]  M. Tamer Ayvaz,et al.  Simultaneous determination of aquifer parameters and zone structures with fuzzy c-means clustering and meta-heuristic harmony search algorithm , 2007 .

[36]  J. Ross Quinlan,et al.  Induction of Decision Trees , 1986, Machine Learning.

[37]  Cheng-Lung Huang,et al.  ACO-based hybrid classification system with feature subset selection and model parameters optimization , 2009, Neurocomputing.

[38]  Rajesh Kumar,et al.  An Intelligent Tuned Harmony Search algorithm for optimisation , 2012, Inf. Sci..

[39]  Zexuan Zhu,et al.  Markov blanket-embedded genetic algorithm for gene selection , 2007, Pattern Recognit..

[40]  N. Ramaraj,et al.  A novel hybrid feature selection via Symmetrical Uncertainty ranking based local memetic search algorithm , 2010, Knowl. Based Syst..

[41]  Zexuan Zhu,et al.  Wrapper–Filter Feature Selection Algorithm Using a Memetic Framework , 2007, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[42]  Ali R. Yildiz,et al.  A comparative study of population-based optimization algorithms for turning operations , 2012, Inf. Sci..

[43]  Shuo-Yan Chou,et al.  Enhancing the classification accuracy by scatter-search-based ensemble approach , 2011, Appl. Soft Comput..

[44]  R. Venkata Rao,et al.  Teaching-Learning-Based Optimization: An optimization method for continuous non-linear large scale problems , 2012, Inf. Sci..

[45]  Kazuyuki Murase,et al.  A new hybrid ant colony optimization algorithm for feature selection , 2012, Expert Syst. Appl..

[46]  Mohammed Azmi Al-Betar,et al.  A Harmony Search with Multi-pitch Adjusting Rate for the University Course Timetabling , 2010, Recent Advances In Harmony Search Algorithm.

[47]  Rajni Bala,et al.  A Hybrid Approach for Selection of Relevant Features for Microarray Datasets , 2007 .

[48]  K. Lee,et al.  A new structural optimization method based on the harmony search algorithm , 2004 .

[49]  M. Fesanghary,et al.  Design optimization of shell and tube heat exchangers using global sensitivity analysis and harmony search algorithm , 2009 .

[50]  Ramesh C. Bansal,et al.  Dynamic economic dispatch using harmony search algorithm with modified differential mutation operator , 2012 .

[51]  Carlos Alberto Cobos Lozada,et al.  GHS + LEM: Global-best Harmony Search using learnable evolution models , 2011, Appl. Math. Comput..

[52]  Hossein Nezamabadi-pour,et al.  An Improved Multi-Objective Harmony Search for Optimal Placement of DGs in Distribution Systems , 2013, IEEE Transactions on Smart Grid.

[53]  M. Tamer Ayvaz,et al.  Application of Harmony Search algorithm to the solution of groundwater management models , 2009 .

[54]  Huan Liu,et al.  Efficient Feature Selection via Analysis of Relevance and Redundancy , 2004, J. Mach. Learn. Res..

[55]  Y. M. Cheng,et al.  An improved harmony search minimization algorithm using different slip surface generation methods for slope stability analysis , 2008 .

[56]  Zong Woo Geem,et al.  Harmony Search Algorithm for Solving Sudoku , 2007, KES.