Is the island model fault tolerant?

In this paper, we present a study on the fault tolerance nature of the island model when applied to Genetic Algorithms. Parallel and distributed models have been extensively applied to GAs when researchers tackle hard problems. The idea is both to reduce computing time while also improving diversity of populations and therefore quality of solutions. Nevertheless, there are few works dealing with the problem of faults that are usually present when a distributed infrastructure is employed for running the parallel algorithm. This paper studies the behavior of the Island Model when faults appear on a parallel computer or a network of computers. Two benchmark problems have been employed, and good results obtained for each of them allow us to reliably consider Island Model as a fault tolerant parallel algorithm.

[1]  Ewing Lusk,et al.  Fault Tolerance in MPI Programs , 2009 .

[2]  Marco Tomassini,et al.  Spatially Structured Evolutionary Algorithms: Artificial Evolution in Space and Time (Natural Computing Series) , 2005 .

[3]  William Gropp,et al.  Users guide for mpich, a portable implementation of MPI , 1996 .

[4]  James A. Foster,et al.  Fault-tolerant computing with N-version genetic programming , 2001 .

[5]  David E. Goldberg,et al.  Efficient Parallel Genetic Algorithms: Theory and Practice , 2000 .

[6]  Francisco Fernündez de Vega Parallel Genetic Programming , 2005, Handbook of Bioinspired Algorithms and Applications.

[7]  Ben Paechter,et al.  A Framework for Distributed Evolutionary Algorithms , 2002, PPSN.

[8]  L. Darrell Whitley,et al.  Island Model genetic Algorithms and Linearly Separable Problems , 1997, Evolutionary Computing, AISB Workshop.

[9]  Partha Dasgupta,et al.  CALYPSO: a novel software system for fault-tolerant parallel processing on distributed platforms , 1995, Proceedings of the Fourth IEEE International Symposium on High Performance Distributed Computing.

[10]  Daniel Lombraña Gonzalez,et al.  On the Intrinsic Fault-Tolerance Nature of Parallel Genetic Programming , 2007, 15th EUROMICRO International Conference on Parallel, Distributed and Network-Based Processing (PDP'07).

[11]  Marc Parizeau,et al.  Distributed Beagle: An Environment For Parallel And Distributed Evolutionary Computations , 2003 .

[12]  Roy Friedman,et al.  Starfish: Fault-Tolerant Dynamic MPI Programs on Clusters of Workstations , 1999, Proceedings. The Eighth International Symposium on High Performance Distributed Computing (Cat. No.99TH8469).

[13]  T.C. Fogarty,et al.  A Distributed Resource Evolutionary Algorithm Machine (DREAM) , 2000, Proceedings of the 2000 Congress on Evolutionary Computation. CEC00 (Cat. No.00TH8512).

[14]  John J. Grefenstette,et al.  A Parallel Genetic Algorithm , 1987, ICGA.

[15]  David H. Wolpert,et al.  No free lunch theorems for optimization , 1997, IEEE Trans. Evol. Comput..

[16]  William Gropp,et al.  User's Guide for mpich, a Portable Implementation of MPI Version 1.2.2 , 1996 .

[17]  David E. Goldberg,et al.  Are Multiple Runs of Genetic Algorithms Better than One? , 2003, GECCO.

[18]  Zbigniew Michalewicz,et al.  Genetic Algorithms + Data Structures = Evolution Programs , 1996, Springer Berlin Heidelberg.

[19]  Ben Paechter,et al.  Maintaining Connectivity in a Scalable and Robust Distributed Environment , 2002, 2nd IEEE/ACM International Symposium on Cluster Computing and the Grid (CCGRID'02).

[20]  Y. Ho,et al.  Simple Explanation of the No-Free-Lunch Theorem and Its Implications , 2002 .

[21]  David P. Anderson,et al.  BOINC: a system for public-resource computing and storage , 2004, Fifth IEEE/ACM International Workshop on Grid Computing.

[22]  William H. Sanders,et al.  A performability-oriented software rejuvenation framework for distributed applications , 2005, 2005 International Conference on Dependable Systems and Networks (DSN'05).

[23]  Miron Livny,et al.  Checkpoint and Migration of UNIX Processes in the Condor Distributed Processing System , 1997 .