ClusterMI: Building Probabilistic Models Using Hierarchical Clustering and Mutual Information

Genetic Algorithms are a class of metaheuristics with applications in several fields including biology, engineering and even arts. However, simple Genetic Algorithms may suffer from exponential scalability on hard problems. Estimation of Distribution Algorithms, a special class of Genetic Algorithms, can build complex models of the iterations among variables in the problem, solving several intractable problems in tractable polynomial time. However, the model building process can be computationally expensive and efficiency enhancements are oftentimes necessary to make tractable problems practical. This paper presents a new model building approach, called ClusterMI, inspired both by the Extended Compact Genetic Algorithm and the Dependency Structure Matrix Genetic Algorithm. The new approach has a more efficient model building process, resulting in speed ups of 10 times for moderate size problems and potentially hundreds of times for large problems. Moreover, the new approach may be easily extended to perform incremental evolution, eliminating the burden of representing the population explicitly.

[1]  John H. Holland,et al.  Adaptation in Natural and Artificial Systems: An Introductory Analysis with Applications to Biology, Control, and Artificial Intelligence , 1992 .

[2]  David E. Goldberg,et al.  Genetic Algorithms in Search Optimization and Machine Learning , 1988 .

[3]  David E. Goldberg,et al.  Genetic Algorithms and Walsh Functions: Part II, Deception and Its Analysis , 1989, Complex Syst..

[4]  Erick Cantu-paz,et al.  Implementing Fast and Flexible Parallel Genetic Algorithms , 1998, Practical Handbook of Genetic Algorithms.

[5]  David E. Goldberg,et al.  Optimizing Global-Local Search Hybrids , 1999, GECCO.

[6]  David E. Goldberg,et al.  The compact genetic algorithm , 1999, IEEE Trans. Evol. Comput..

[7]  Erick Cant,et al.  Designing Efficient And Accurate Parallel Genetic Algorithms , 1999 .

[8]  Fernando G. Lobo,et al.  Extended Compact Genetic Algorithm in C , 1999 .

[9]  Erick Cantú-Paz Designing Efficient and Accurate Parallel Genetic Algorithms , 1999 .

[10]  G. Harik Linkage Learning via Probabilistic Modeling in the ECGA , 1999 .

[11]  J. A. Lozano,et al.  Estimation of Distribution Algorithms: A New Tool for Evolutionary Computation , 2001 .

[12]  David E. Goldberg,et al.  The Design of Innovation: Lessons from and for Competent Genetic Algorithms , 2002 .

[13]  Goldberg,et al.  Genetic algorithms , 1993, Robust Control Systems with Genetic Algorithms.

[14]  David E. Goldberg,et al.  Efficiency Enhancement of Probabilistic Model Building Genetic Algorithms , 2004, ArXiv.

[15]  Riccardo Poli,et al.  Genetic and Evolutionary Computation – GECCO 2004 , 2004, Lecture Notes in Computer Science.

[16]  David E. Goldberg,et al.  Let's Get Ready to Rumble: Crossover Versus Mutation Head to Head , 2004, GECCO.

[17]  Franz Rothlauf,et al.  Evaluation-Relaxation Schemes for Genetic and Evolutionary Algorithms , 2004 .

[18]  David E. Goldberg,et al.  Designing Efficient Genetic and Evolutionary Algorithm Hybrids , 2005 .

[19]  David E. Goldberg,et al.  Sporadic model building for efficiency enhancement of hierarchical BOA , 2006, GECCO '06.

[20]  Sang Joon Kim,et al.  A Mathematical Theory of Communication , 2006 .

[21]  David E. Goldberg,et al.  Conquering hierarchical difficulty by explicit chunking: substructural chromosome compression , 2006, GECCO '06.

[22]  D. Goldberg,et al.  A matrix approach for finding extrema: problems with modularity, hierarchy, and overlap , 2006 .

[23]  David E. Goldberg,et al.  Population sizing for entropy-based model building in discrete estimation of distribution algorithms , 2007, GECCO '07.

[24]  Simon M. Lucas,et al.  Parallel Problem Solving from Nature - PPSN X, 10th International Conference Dortmund, Germany, September 13-17, 2008, Proceedings , 2008, PPSN.

[25]  David E. Goldberg,et al.  iBOA: the incremental bayesian optimization algorithm , 2008, GECCO '08.

[26]  David E. Goldberg,et al.  Enhancing the Efficiency of the ECGA , 2008, PPSN.