A hybrid evolutionary algorithm for promoter recognition

The aim of this study is to identify the smallest possible set of features and optimal model parameters for promoter recognition. Particularly, we propose a novel hybrid evolutionary algorithm, which integrates Markov blanket-embedded genetic algorithm (MBEGA), comprehensive learning particle swarm optimize (CLPSO), and support vector machine (SVM) as a whole. This method adopts MBEGA for promoter feature selection while employs CLPSO to optimize the parameters of the promoter identification model. Empirical results on the eukaryotic promoter database (EPD) suggest that, our proposed approach is able to obtain better or competitive classification accuracy than other methods and it is effective and efficient in eliminating irrelevant and redundant features in training process.

[1]  Houkuan Huang,et al.  Feature selection for text classification with Naïve Bayes , 2009, Expert Syst. Appl..

[2]  Zne-Jung Lee,et al.  Parameter determination of support vector machine and feature selection using simulated annealing approach , 2008, Appl. Soft Comput..

[3]  Sohail Asghar,et al.  A REVIEW OF FEATURE SELECTION TECHNIQUES IN STRUCTURE LEARNING , 2013 .

[4]  Jing J. Liang,et al.  Comprehensive learning particle swarm optimizer for global optimization of multimodal functions , 2006, IEEE Transactions on Evolutionary Computation.

[5]  Evaluation of Prediction Accuracy of Genefinders Using Mouse Genomic DNA , 2011 .

[6]  Yanqing Zhang,et al.  A genetic algorithm-based method for feature subset selection , 2008, Soft Comput..

[7]  Geoffrey J McLachlan,et al.  Selection bias in gene extraction on the basis of microarray gene-expression data , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[8]  Daphne Koller,et al.  Toward Optimal Feature Selection , 1996, ICML.

[9]  Zexuan Zhu,et al.  Markov blanket-embedded genetic algorithm for gene selection , 2007, Pattern Recognit..

[10]  Pedro Larrañaga,et al.  A review of feature selection techniques in bioinformatics , 2007, Bioinform..

[11]  Huan Liu,et al.  Redundancy based feature selection for microarray data , 2004, KDD.

[12]  Erik Lux Feature selection for text classification with Naive Bayes , 2012 .

[14]  Wenqian Shang,et al.  A novel feature selection algorithm for text categorization , 2007, Expert Syst. Appl..

[15]  Cheng-Lung Huang,et al.  A GA-based feature selection and parameters optimizationfor support vector machines , 2006, Expert Syst. Appl..