The implementation of dynamite: an environment for migrating PVM tasks

Parallel programming on clusters of workstations is increasingly attractive, but dynamic load balancing is needed to make efficient use of the available resources. Dynamite provides dynamic load balancing for PVM applications running under Linux and Solaris. It supports migration of individual tasks between nodes in a manner transparent both to the application programmer and to the user, implemented entirely in user space. Dynamically linked executables are supported, as are tasks with open files and with direct PVM connections. In this paper, we describe the technical aspects of migrating message-passing tasks.

[1]  Sandra Loosemore,et al.  The GNU C Library Reference Manual , 2001 .

[2]  Henryk Krawczyk,et al.  Dynamic Assignment with Process Migration in Distributed Environments , 1999, PVM/MPI.

[3]  Weng-Fai Wong,et al.  tmPVM-task migratable PVM , 1999, Proceedings 13th International Parallel Processing Symposium and 10th Symposium on Parallel and Distributed Processing. IPPS/SPDP 1999.

[4]  Peter M. A. Sloot,et al.  A dynamic load balancing system for parallel cluster computing , 1996, Future Gener. Comput. Syst..

[5]  Miron Livny,et al.  Checkpoint and Migration of UNIX Processes in the Condor Distributed Processing System , 1997 .

[6]  William Gropp,et al.  Users guide for mpich, a portable implementation of MPI , 1996 .

[7]  Georg Stellner,et al.  CoCheck: checkpointing and process migration for MPI , 1996, Proceedings of International Conference on Parallel Processing.

[8]  Miron Livny,et al.  Managing Checkpoints for Parallel Programs , 1996, JSSPP.

[9]  Dan Pei,et al.  Quasi-asynchronous migration: a novel migration protocol for PVM tasks , 1999, OPSR.

[10]  Amnon Barak,et al.  Scalable Cluster Computing with MOSIX for LINUX , 1999 .

[11]  Peter M. A. Sloot,et al.  Experiments in Dynamic Load Balancing for Parallel Cluster Computing , 1995 .

[12]  Peter M. A. Sloot,et al.  DynamicPVM - Dynamic Load Balancing on Parallel Systems , 1994, HPCN.

[13]  Jonathan Robinson,et al.  A task migration implementation of the Message-Passing Interface , 1996, Proceedings of 5th IEEE International Symposium on High Performance Distributed Computing.

[14]  Jack Dongarra,et al.  PVM: Parallel virtual machine: a users' guide and tutorial for networked parallel computing , 1995 .

[15]  Kai Li,et al.  Libckpt: Transparent Checkpointing under UNIX , 1995, USENIX.

[16]  Jonathan Walpole,et al.  MPVM: A Migration Transparent Version of PVM , 1995, Comput. Syst..

[17]  Peter M. A. Sloot,et al.  Dynamite - Blasting Obstacles to Parallel Cluster Computing , 1999, HPCN Europe.