HColonies: a new hybrid metaheuristic for medical data classification

Medical data feature a number of characteristics that make their classification a complex task. Yet, the societal significance of the subject and the computational challenge it presents has caused the classification of medical datasets to be a popular research area. A new hybrid metaheuristic is presented for the classification task of medical datasets. The hybrid ant–bee colonies (HColonies) consists of two phases: an ant colony optimization (ACO) phase and an artificial bee colony (ABC) phase. The food sources of ABC are initialized into decision lists, constructed during the ACO phase using different subsets of the training data. The task of the ABC is to optimize the obtained decision lists. New variants of the ABC operators are proposed to suit the classification task. Results on a number of benchmark, real-world medical datasets show the usefulness of the proposed approach. Classification models obtained feature good predictive accuracy and relatively small model size.

[1]  William B. Langdon,et al.  Fitness Causes Bloat in Variable Size Representations , 1997 .

[2]  James Kennedy,et al.  Particle swarm optimization , 2002, Proceedings of ICNN'95 - International Conference on Neural Networks.

[3]  Luca Maria Gambardella,et al.  Ant colony system: a cooperative learning approach to the traveling salesman problem , 1997, IEEE Trans. Evol. Comput..

[4]  Dervis Karaboga,et al.  A survey: algorithms simulating bee swarm intelligence , 2009, Artificial Intelligence Review.

[5]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[6]  Graham Kendall,et al.  Population based Local Search for university course timetabling problems , 2013, Applied Intelligence.

[7]  Dušan Teodorović,et al.  Bee Colony Optimization – a Cooperative Learning Approach to Complex Transportation Problems , 2005 .

[8]  Monique Snoeck,et al.  Classification With Ant Colony Optimization , 2007, IEEE Transactions on Evolutionary Computation.

[9]  Mohd Afizi Mohd Shukran,et al.  Artificial bee colony based data mining algorithms for classification tasks , 2011 .

[10]  S. Holm A Simple Sequentially Rejective Multiple Test Procedure , 1979 .

[11]  Wan Chul Yoon,et al.  A classification method using a hybrid genetic algorithm combined with an adaptive procedure for the pool of ellipsoids , 2006, Applied Intelligence.

[12]  Tiago Ferra de Sousa,et al.  Particle Swarm based Data Mining Algorithms for classification tasks , 2004, Parallel Comput..

[13]  Blaise Hanczar,et al.  The reliability of estimated confidence intervals for classification error rates when only a single sample is available , 2013, Pattern Recognit..

[14]  Chidchanok Lursinsap,et al.  Optimizing the modified fuzzy ant-miner for efficient medical diagnosis , 2011, Applied Intelligence.

[15]  Graham Kendall,et al.  A graph coloring constructive hyper-heuristic for examination timetabling problems , 2012, Applied Intelligence.

[16]  Panos Liatsis,et al.  A robust missing value imputation method for noisy data , 2010, Applied Intelligence.

[17]  Francisco Herrera,et al.  A study of statistical techniques and performance measures for genetics-based machine learning: accuracy and interpretability , 2009, Soft Comput..

[18]  Thomas Stützle,et al.  MAX-MIN Ant System , 2000, Future Gener. Comput. Syst..

[19]  William W. Cohen Fast Effective Rule Induction , 1995, ICML.

[20]  Dervis Karaboga,et al.  AN IDEA BASED ON HONEY BEE SWARM FOR NUMERICAL OPTIMIZATION , 2005 .

[21]  F. Wilcoxon Individual Comparisons by Ranking Methods , 1945 .

[22]  Ester Bernadó-Mansilla,et al.  A Comparative Study of Several Genetic-Based Supervised Learning Systems , 2008, Learning Classifier Systems in Data Mining.

[23]  Moshe Sipper,et al.  Evolutionary computation in medicine: an overview , 2000, Artif. Intell. Medicine.

[24]  Nurhan Karaboga,et al.  The parameter extraction of the thermally annealed Schottky barrier diode using the modified artificial bee colony , 2012, Applied Intelligence.

[25]  Alex Alves Freitas,et al.  cAnt-Miner: An Ant Colony Classification Algorithm to Cope with Continuous Attributes , 2008, ANTS Conference.

[26]  Aníbal R. Figueiras-Vidal,et al.  Pattern classification with missing data: a review , 2010, Neural Computing and Applications.

[27]  Bing Yu,et al.  Missing data analyses: a hybrid multiple imputation algorithm using Gray System Theory and entropy based on clustering , 2013, Applied Intelligence.

[28]  Dervis Karaboga,et al.  Artificial Bee Colony (ABC) Optimization Algorithm for Training Feed-Forward Neural Networks , 2007, MDAI.

[29]  Vidroha Debroy,et al.  Genetic Programming , 1998, Lecture Notes in Computer Science.

[30]  M. Stone Cross‐Validatory Choice and Assessment of Statistical Predictions , 1976 .

[31]  Alex Alves Freitas,et al.  Data mining with an ant colony optimization algorithm , 2002, IEEE Trans. Evol. Comput..

[32]  Antonio González Muñoz,et al.  SLAVE: a genetic learning system based on an iterative approach , 1999, IEEE Trans. Fuzzy Syst..

[33]  Marco Dorigo,et al.  Optimization, Learning and Natural Algorithms , 1992 .

[34]  Pedro Larrañaga,et al.  Machine learning: an indispensable tool in bioinformatics. , 2010, Methods in molecular biology.

[35]  Stewart W. Wilson Classifier Fitness Based on Accuracy , 1995, Evolutionary Computation.

[36]  Bart Baesens,et al.  To tune or not to tune: rule evaluation for metaheuristic-based sequential covering algorithms , 2013, Data Mining and Knowledge Discovery.

[37]  C. D. Gelatt,et al.  Optimization by Simulated Annealing , 1983, Science.

[38]  María José del Jesús,et al.  KEEL: a software tool to assess evolutionary algorithms for data mining problems , 2008, Soft Comput..

[39]  D. Pham,et al.  THE BEES ALGORITHM, A NOVEL TOOL FOR COMPLEX OPTIMISATION PROBLEMS , 2006 .

[40]  Brijesh Verma,et al.  Hybrid ensemble approach for classification , 2011, Applied Intelligence.

[41]  Erik Valdemar Cuevas Jiménez,et al.  A multi-threshold segmentation approach based on Artificial Bee Colony optimization , 2012, Applied Intelligence.

[42]  K. S. Chaudhuri,et al.  genetic algorithm-based rule extraction system ikash , 2011 .

[43]  Ron Kohavi,et al.  A Study of Cross-Validation and Bootstrap for Accuracy Estimation and Model Selection , 1995, IJCAI.

[44]  Dervis Karaboga,et al.  A powerful and efficient algorithm for numerical function optimization: artificial bee colony (ABC) algorithm , 2007, J. Glob. Optim..

[45]  Ian H. Witten,et al.  Generating Accurate Rule Sets Without Global Optimization , 1998, ICML.

[46]  M. Beekman,et al.  Honeybee Optimisation – An Overview and a New Bee Inspired Optimisation Scheme , 2011 .

[47]  Ester Bernadó-Mansilla,et al.  Accuracy-Based Learning Classifier Systems: Models, Analysis and Applications to Classification Tasks , 2003, Evolutionary Computation.

[48]  Christian Blum,et al.  Hybrid Metaheuristics, An Emerging Approach to Optimization , 2008, Hybrid Metaheuristics.

[49]  K. Frisch The dance language and orientation of bees , 1967 .

[50]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[51]  Masafumi Hagiwara,et al.  Bee System: Finding Solution by a Concentrated Search , 1998 .

[52]  Nada Lavrac,et al.  Selected techniques for data mining in medicine , 1999, Artif. Intell. Medicine.

[53]  Tong Heng Lee,et al.  Evolutionary computing for knowledge discovery in medical diagnosis , 2003, Artif. Intell. Medicine.

[54]  金鹏,et al.  Classification rule mining based on ant colony optimization algorithm , 2006 .

[55]  Alan F. Murray,et al.  IEEE International Conference on Neural Networks , 1997 .

[56]  John R. Koza,et al.  Genetic programming (videotape): the movie , 1992 .