Distributed Scheduling of Unstructured Collective Communication on the CM-5

Parallelization of scientific applications often results in unstructured collective communication. In this paper, we present a distributed algorithm for scheduling such communication on parallel machines. We describe the performance of this algorithm on the CM-5 and show that the scheduling algorithm gives a significant improvement over naive methods.

[1]  R. Walters,et al.  Solution algorithms for the two-dimensional Euler equations on unstructured meshes , 1990 .

[2]  Geoffrey C. Fox,et al.  Solving problems on concurrent processors: vol. 2 , 1990 .

[3]  S. Eisenstat,et al.  An experimental study of methods for parallel preconditioned Krylov methods , 1989, C3P.

[4]  David W. Walker,et al.  Characterizing the Parallel Performance of a Large-scale, Particle-in-cell Plasma Simulation Code , 1990, Concurr. Pract. Exp..

[5]  Sanjay Ranka,et al.  Static and Runtime Scheduling of Unstructured Communication , 1993 .

[6]  Lionel M. Ni,et al.  A survey of wormhole routing techniques in direct networks , 1993, Computer.

[7]  Geoffrey C. Fox,et al.  Benchmarking the CM-5 multicomputer , 1992, [Proceedings 1992] The Fourth Symposium on the Frontiers of Massively Parallel Computation.

[8]  D. Mavriplis Three dimensional unstructured multigrid for the Euler equations , 1991 .

[9]  Y. Saad,et al.  Communication complexity of the Gaussian elimination algorithm on multiprocessors , 1986 .

[10]  Yousef Saad,et al.  Solving Sparse Triangular Linear Systems on Parallel Computers , 1989, Int. J. High Speed Comput..

[11]  William J. Dally,et al.  Deadlock-Free Message Routing in Multiprocessor Interconnection Networks , 1987, IEEE Transactions on Computers.

[12]  Sanjay Ranka,et al.  Personalized Communication Avoiding Node Contention on Distributed Memory Systems , 1993, 1993 International Conference on Parallel Processing - ICPP'93.

[13]  Viktor K. Decyk,et al.  A general concurrent algorithm for plasma particle-in-cell simulation codes , 1989 .

[14]  Geoffrey C. Fox,et al.  Parallel Genetic Algorithms with Application to Load Balancing for Parallel Computing , 1991 .