Messages Scheduling for Data Redistribution Between Heterogeneous Clusters

In this paper, we tackle the problem of redistributing data between clusters connected by a backbone. On distributed environments, communications often take more time [12] and thus lead to worse results than on local clusters. There is therefore a strong need to optimize the time needed by communications.Indeed, when an application composed of several codes running on distant clusters is executing, data are required to be redistributed between the clusters. We propose a general solution to the problem when the platform is fully heterogeneous platforms (each node of each cluster can communicate at different speed) or when some nodes have several network interface cards. We provide an algorithm for scheduling the messages that gives a solution at most twice as long as the optimal one. Simulation results show that it is giving almost optimal schedules on large redistribution patterns, and very good results in the general case.

[1]  André Ribes,et al.  A Parallel Corba Component Model for Numerical Code Coupling , 2002, Int. J. High Perform. Comput. Appl..

[2]  Susan M. Mniszewski,et al.  PAWS: collective interactions and data transfers , 2001, Proceedings 10th IEEE International Symposium on High Performance Distributed Computing.

[3]  Stéphane Genaud,et al.  Seismic Ray-Tracing and Earth Mesh Modeling on Various Parallel Architectures , 2004, The Journal of Supercomputing.

[4]  Hyeong-Ah Choi,et al.  Efficient scheduling of transmissions in optical broadcast networks , 1996, TNET.

[5]  George N. Rouskas,et al.  On the design of optimal TDM schedules for broadcast WDM networks with arbitrary transceiver tuning latencies , 1996, Proceedings of IEEE INFOCOM '96. Conference on Computer Communications.

[6]  Emmanuel Jeannot,et al.  Messages Scheduling for Data Redistribution between Clusters , 2003, PPAM.

[7]  Emmanuel Jeannot,et al.  Two fast and efficient message scheduling algorithms for data redistribution through a backbone , 2004, 18th International Parallel and Distributed Processing Symposium, 2004. Proceedings..

[8]  G. Bongiovanni,et al.  An Optimal Switching Algorithm for Multibeam Satellite Systems with Variable Bandwidth Beams , 1982, IEEE Trans. Commun..

[9]  Aura Ganz,et al.  A time-wavelength assignment algorithm for a WDM star network , 1992, [Proceedings] IEEE INFOCOM '92: The Conference on Computer Communications.

[10]  Jack Dongarra,et al.  ScaLAPACK Users' Guide , 1987 .

[11]  D. C. Cox,et al.  An Optimum Time Slot Assignment Algorithm for an SS/TDMA System with Variable Number of Transponders , 1981 .

[12]  James Arthur Kohl,et al.  Cumulvs: Providing Fault Toler. Ance, Visualization, and Steer Ing of Parallel Applications , 1996, Int. J. High Perform. Comput. Appl..

[13]  Viktor K. Prasanna,et al.  Block‐cyclic redistribution over heterogeneous networks , 2004, Cluster Computing.

[14]  Frédéric Wagner,et al.  Redistribution de données à travers un réseau à haut débit. (Data Redistribution through High Performance Networks) , 2005 .

[15]  Chak-Kuen Wong,et al.  Minimizing the Number of Switchings in an SS/TDMA System , 1985, IEEE Trans. Commun..

[16]  Galen H. Sasaki,et al.  Scheduling transmissions in WDM broadcast-and-select networks , 1994, TNET.

[17]  Yves Robert,et al.  Scheduling Block-Cyclic Array Redistribution , 1998, IEEE Trans. Parallel Distributed Syst..

[18]  Bernard Tourancheau,et al.  Efficient Block Cyclic Data Redistribution , 1996, Euro-Par, Vol. I.