Breaking the Curse of Dynamics by Task Migration: Pilot Experiments in the Polder Metacomputer

With the advent of high speed networks, distributed cluster computing and metacomputing have assumed an enormous interest. However, software methods and techniques to make the full potential of these distributed environments available, are not yet mature. In this paper, we focus on dynamic load balancing of resources and applications as one of the crucial techniques to optimize performance in distributed environments. Some design and implementation details are described, and early experimental results are presented.

[1]  Andrew S. Grimshaw,et al.  Legion-a view from 50,000 feet , 1996, Proceedings of 5th IEEE International Symposium on High Performance Distributed Computing.

[2]  Andrew S. Grimshaw,et al.  A federated model for scheduling in wide-area systems , 1996, Proceedings of 5th IEEE International Symposium on High Performance Distributed Computing.

[3]  Peter M. A. Sloot,et al.  Load Balancing by Redundant Decomposition and Mapping , 1996, HPCN Europe.

[4]  Alexander Reinefeld,et al.  A Lightweight Communication Interface for Parallel Programming Environments , 1997, HPCN Europe.

[5]  Ian T. Foster,et al.  Overview of the I-Way: Wide-Area Visual Supercomputing , 1996, Int. J. High Perform. Comput. Appl..

[6]  Ian T. Foster,et al.  Globus: a Metacomputing Infrastructure Toolkit , 1997, Int. J. High Perform. Comput. Appl..

[7]  Miron Livny,et al.  Interfacing Condor and PVM to harness the cycles of workstation clusters , 1996, Future Gener. Comput. Syst..

[8]  Peter M. A. Sloot,et al.  A dynamic load balancing system for parallel cluster computing , 1996, Future Gener. Comput. Syst..

[9]  Jonathan Walpole,et al.  MIST: PVM with Transparent Migration and Checkpointing , 1995 .

[10]  Thomas L. Sterling,et al.  Parallel Supercomputing with Commodity Components , 1997, PDPTA.

[11]  Miron Livny,et al.  Condor-a hunter of idle workstations , 1988, [1988] Proceedings. The 8th International Conference on Distributed.

[12]  Amnon Barak,et al.  Performance of the MOSIX Parallel System for a Cluster of PCs , 1997, HPCN Europe.