GENETIC ALGORITHMS WITH DIVERSITY MEASURES TO BUILD CLASSIFIER SYSTEMS

The combination of classifiers is an active research area of the machine learning and pattern recognition communities. Many t heoretical and empirical studies have been published demonstrating the advantages of the paradigm of combination of classifiers over the individual classifiers. When combining classifiers it is important to guarantee the diversity among them [16]. Some statistical measures can be used to estimate how diverse the ensembles of classifiers are, they are called diversity measures. On the other hand the number of individual classifiers are very big and just a litter group of classifiers can be generated a large number of combinations, so emerge the idea to use one of the meta-heuristics: the genetic algorithms (gas). Genetic algorithms play a significant role as search technique for handling complex spaces in many fields. They are based on the underlying genetic process in biological organisms and on the natural evolution principles of populations. These algorithms process a population of chromosomes, which represent search space solutions, with three operations: selection, crossover and mutation. In this paper some diversity measures are presented and one modulation of genetic algorithm using diversity measures is enunciated and implemented in order to obtain, from all possible combinations of a large number of base classifiers, a combination that ensures greater diversity among the chosen classifiers and the best accuracy in multi classifier system. We also present and discuss the results of applying the implemented system in two fields of application. Finally, the general conclusions are exposed.

[1]  David B. Skalak,et al.  The Sources of Increased Accuracy for Two Proposed Boosting Algorithms , 1996, AAAI 1996.

[2]  Ludmila I. Kuncheva,et al.  Combining Pattern Classifiers: Methods and Algorithms , 2004 .

[3]  Ron Kohavi,et al.  Bias Plus Variance Decomposition for Zero-One Loss Functions , 1996, ICML.

[4]  R. Polikar,et al.  Ensemble based systems in decision making , 2006, IEEE Circuits and Systems Magazine.

[5]  Hai Huang,et al.  Improved genetic algorithm with two-level approximation for truss topology optimization , 2014 .

[6]  D. E. Goldberg,et al.  Genetic Algorithms in Search , 1989 .

[7]  Fabio Roli,et al.  Design of effective neural network ensembles for image classification purposes , 2001, Image Vis. Comput..

[8]  Deepti Rani,et al.  Genetic Algorithms and Their Applications to Water Resources Systems , 2013 .

[9]  Angel Valdéz Posada Hipertensión arterial sistémica , 2010 .

[10]  Bungon Kumphon Genetic Algorithms for Multi-objective Optimization: Application to a Multi-reservoir System in the Chi River Basin, Thailand , 2013, Water Resources Management.

[11]  David E. Goldberg,et al.  Real-coded Genetic Algorithms, Virtual Alphabets, and Blocking , 1991, Complex Syst..

[12]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[13]  Isis Bonet Cruz Modelo para la clasificación de secuencias, en problemas de la bioinformática, usando técnicas de inteligencia artificial , 2008 .

[14]  B. Everitt,et al.  Statistical methods for rates and proportions , 1973 .

[15]  Richard Frans,et al.  Sizing, Shape, and Topology Optimizations of Roof Trusses Using Hybrid Genetic Algorithms , 2014 .

[16]  Padraig Cunningham,et al.  Diversity versus Quality in Classification Ensembles Based on Feature Selection , 2000, ECML.

[17]  Sang-Youl Lee,et al.  An Advanced Coupled Genetic Algorithm for Identifying Unknown Moving Loads on Bridge Decks , 2014 .

[18]  Jim Antonisse,et al.  A New Interpretation of Schema Notation that Overtums the Binary Encoding Constraint , 1989, ICGA.

[19]  Nicholas J. Radcliffe,et al.  Non-Linear Genetic Representations , 1992, PPSN.

[20]  M. T. Aboul-Ela,et al.  Optimal Removal of Heavy Metals Pollutants from Groundwater Using a Real Genetic Algorithm and Finite-Difference Method , 2013 .

[21]  Bartosz Krawczyk,et al.  Diversity measures for one-class classifier ensembles , 2014, Neurocomputing.

[22]  Ludmila I. Kuncheva,et al.  Measures of Diversity in Classifier Ensembles and Their Relationship with the Ensemble Accuracy , 2003, Machine Learning.

[23]  Piotr Ponikowski,et al.  2007 Guidelines for the management of arterial hypertension: The Task Force for the Management of Arterial Hypertension of the European Society of Hypertension (ESH) and of the European Society of Cardiology (ESC). , 2007, European heart journal.

[24]  M. Basu,et al.  Fuel constrained economic emission dispatch using nondominated sorting genetic algorithm-II , 2014 .

[25]  D. Partridge Distinct Failure Diversity in Multiversion SoftwareDerek Partridge , 1997 .

[26]  Siripen Wikaisuksakul,et al.  A multi-objective genetic algorithm with fuzzy c-means for automatic data clustering , 2014, Appl. Soft Comput..

[27]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[28]  Tariq Rana,et al.  A SIMPLIFIED MULTI‐OBJECTIVE GENETIC ALGORITHM OPTIMIZATION MODEL FOR CANAL SCHEDULING , 2012 .

[29]  Derek Partridge,et al.  Software Diversity: Practical Statistics for Its Measurement and Exploitation | Draft Currently under Revision , 1996 .

[30]  A. F. Utset,et al.  Hipertensión arterial sistémica en 3 gatos , 2004 .

[31]  Thomas G. Dietterich Multiple Classifier Systems , 2000, Lecture Notes in Computer Science.