论文信息 - On the Impact of Communication Latencies on Distributed Sparse LU Factorization.

On the Impact of Communication Latencies on Distributed Sparse LU Factorization.

Sparse LU factorization offers some potential for parallelism, but at a level of very fine granularity. However, most current distributed memory MIMD architectures have too high communication latencies for exploiting all parallelism available. To cope with this, latencies must be avoided by coarsening the granularity and by message fusion. However, both techniques limit the concurrency, thereby reducing the scalability. In this paper, an implementation of a parallel LU decomposition algorithm for linear programming bases is presented for distributed memory parallel computers with noticable communication latencies. Several design decisions due to latencies, including data distribution and load balancing techniques, are discussed. An approximate performance model is set up for the algorithm, which allows to quantify the impact of latencies on its performance. Finally, experimental results for an Intel iPSC/860 parallel computer are reported and discussed.

Hans-Christian Hege | Roland Wunderling | Martin Grammel

[1] P. Sadayappan,et al. Communication reduction for distributed sparse matrix factorization on a processor mesh , 1989, Proceedings of the 1989 ACM/IEEE Conference on Supercomputing (Supercomputing '89).

[2] T. Davis,et al. A nondeterministic parallel algorithm for general unsymmetric sparse lu factorization , 1990 .

[3] Michael T. Heath,et al. Sparse Cholesky factorization on a local-memory multiprocessor , 1988 .

[4] Andrzej M. Goscinski,et al. Distributed operating systems - the logical design , 1991 .

[5] Iain S. Duff,et al. The Multifrontal Solution of Unsymmetric Sets of Linear Equations , 1984 .

[6] David W. Krumme,et al. Gossiping in Minimal Time , 1992, SIAM J. Comput..

[7] Michael T. Heath,et al. Parallel Algorithms for Sparse Linear Systems , 1991, SIAM Rev..

[8] Uwe H. Suhl,et al. Computing Sparse LU Factorizations for Large-Scale Linear Programming Bases , 1990, INFORMS J. Comput..

[9] Iain S. Duff,et al. Parallel implementation of multifrontal schemes , 1986, Parallel Comput..

[10] I. Duff,et al. Direct Methods for Sparse Matrices , 1987 .

[11] M. Yannakakis. Computing the Minimum Fill-in is NP^Complete , 1981 .

[12] J. G. G. Vorst,et al. Parallel Sparse LU Decomposition on a Mesh Network of Transputers , 1993, SIAM J. Matrix Anal. Appl..