An Adaptive Harmony Search Approach for Gene Selection and Classification of High Dimensional Medical Data

Abstract In Bioinformatics, microarray data analysis has gained substantial attention for disease diagnosis. Microarray data is represented with a huge search space which imposes the foremost difficulties in selection of most relevant facts in terms of genes. In this esteem, we have recommended a hybridised harmony search and Pareto optimization approach for feature selection in high dimensional data classification problem. In the first stage an adaptive harmony search algorithm for gene selection with probability distribution factor for optimal gene ranking is implemented. This selection is further refined applying a bi-objective Pareto based feature selection technique to select optimal minimum number of top ranked genes. The importance and relevance of the selected genes are verified through a few numbers of classifiers. Experimental analysis is conducted over four well known microarray datasets. Finally statistical analysis is conducted to prove the superiority of proposed work with two other nature inspired algorithms. Simulation result reveals that the proposed hybridisation is providing high potentiality in both sample classification and feature subset prediction prospective for high dimensional databases.

[1]  Parham Moradi,et al.  Gene selection for microarray data classification using a novel ant colony optimization , 2015, Neurocomputing.

[2]  Thomas Stützle,et al.  Ant Colony Optimization , 2009, EMO.

[3]  G. Nagarajan,et al.  Hybrid Genetic Algorithm for Medical Image Feature Extraction and Selection , 2016 .

[4]  Salwani Abdullah,et al.  Hybridising harmony search with a Markov blanket for gene selection problems , 2014, Inf. Sci..

[5]  S. Kanmani,et al.  A hybrid algorithm using ant and bee colony optimization for feature selection and classification (AC-ABC Hybrid) , 2017, Swarm Evol. Comput..

[6]  Gregory F. Cooper,et al.  A Bayesian method for the induction of probabilistic networks from data , 1992, Machine-mediated learning.

[7]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.

[8]  Bijan Bihari Misra,et al.  Pipelining the ranking techniques for microarray data classification: A case study , 2016, Appl. Soft Comput..

[9]  Pradipta Kishore Dash,et al.  A self adaptive differential harmony search based optimized extreme learning machine for financial time series prediction , 2014, Swarm Evol. Comput..

[10]  Lale Özbakir,et al.  Training neural networks with harmony search algorithms for classification problems , 2012, Eng. Appl. Artif. Intell..

[11]  Manuel Laguna,et al.  Tabu Search , 1997 .

[12]  Azuraliza Abu Bakar,et al.  Hybrid feature selection based on enhanced genetic algorithm for text categorization , 2016, Expert Syst. Appl..

[13]  Ali Zakerolhosseini,et al.  Unsupervised probabilistic feature selection using ant colony optimization , 2016, Expert Syst. Appl..

[14]  Concha Bielza,et al.  Machine Learning in Bioinformatics , 2008, Encyclopedia of Database Systems.

[15]  Hao Dong,et al.  An improved particle swarm optimization for feature selection , 2011 .

[16]  Hugo Jair Escalante,et al.  Acute leukemia classification by ensemble particle swarm model selection , 2012, Artif. Intell. Medicine.

[17]  Wei Wang,et al.  Harmony search-based multi-objective optimization model for multi-site order planning with multiple uncertainties and learning effects , 2015, Comput. Ind. Eng..

[18]  Adel Al-Jumaily,et al.  Feature subset selection using differential evolution and a statistical repair mechanism , 2011, Expert Syst. Appl..

[19]  Longquan Yong,et al.  A harmony search algorithm for high-dimensional multimodal optimization problems , 2015, Digit. Signal Process..

[20]  João Paulo Papa,et al.  A novel algorithm for feature selection using Harmony Search and its application for non-technical losses detection , 2011, Comput. Electr. Eng..

[21]  Zarita Zainuddin,et al.  An enhanced harmony search based algorithm for feature selection: Applications in epileptic seizure detection and prediction , 2016, Comput. Electr. Eng..

[22]  Madhubanti Maitra,et al.  Gene selection from microarray gene expression data for classification of cancer subgroups employing PSO and adaptive K-nearest neighborhood technique , 2015, Expert Syst. Appl..

[23]  Himansu Sekhar Behera,et al.  A Global-best Harmony Search based Gradient Descent Learning FLANN (GbHS-GDL-FLANN) for data classification , 2016 .

[24]  Yin-Fu Huang,et al.  Music genre classification based on local feature selection using a self-adaptive harmony search algorithm , 2014, Data Knowl. Eng..

[25]  Bijan Bihari Misra,et al.  Gene selection and classification of microarray data: A Pareto DE approach , 2017, Intell. Decis. Technol..

[26]  S. Gunasundari,et al.  Velocity Bounded Boolean Particle Swarm Optimization for improved feature selection in liver and kidney disease diagnosis , 2016, Expert Syst. Appl..

[27]  Hugues Bersini,et al.  A Survey on Filter Techniques for Feature Selection in Gene Expression Microarray Analysis , 2012, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[28]  Sanghamitra Bandyopadhyay,et al.  Unsupervised feature selection using an improved version of Differential Evolution , 2015, Expert Syst. Appl..

[29]  B. B. Misra,et al.  Performance analysis of clustering techniques over microarray data: A case study , 2018 .

[30]  Lei Lu,et al.  Dynamic Genetic Algorithm-based Feature Selection Scheme for Machine Health Prognostics , 2016 .

[31]  Selma Ayse Özel,et al.  A hybrid approach of differential evolution and artificial bee colony for feature selection , 2016, Expert Syst. Appl..

[32]  Pedro Larrañaga,et al.  A review of feature selection techniques in bioinformatics , 2007, Bioinform..

[33]  Kok-Leong Ong,et al.  Feature selection for high dimensional imbalanced class data using harmony search , 2017, Eng. Appl. Artif. Intell..

[34]  Lichao Cao,et al.  Improved particle swarm optimization algorithm and its application in text feature selection , 2015, Appl. Soft Comput..

[35]  Ferat Sahin,et al.  A survey on feature selection methods , 2014, Comput. Electr. Eng..

[36]  Laizhong Cui,et al.  Artificial bee colony algorithm with gene recombination for numerical function optimization , 2017, Appl. Soft Comput..

[37]  Li-Yeh Chuang,et al.  Chaotic maps based on binary particle swarm optimization for feature selection , 2011, Appl. Soft Comput..

[38]  Ash A. Alizadeh,et al.  Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling , 2000, Nature.

[39]  Dong Jian,et al.  Pattern synthesis of MIMO radar based on chaotic differential evolution algorithm , 2017 .

[40]  Mohammad Hossein Moattar,et al.  A hybrid gene selection approach for microarray data classification using cellular learning automata and ant colony optimization. , 2016, Genomics.