Parallel LU Decomposition on a Transputer Network

A parallel algorithm is derived for LU decomposition with partial pivoting on a local-memory multiprocessor. A general Cartesian data distribution scheme is presented which contains many of the existing distribution schemes as special cases. This scheme is used to prove optimality of load balance for the grid distribution. Experimental results of an implementation of the algorithm in occam-2 on a square mesh of 36 transputers show an efficiency of 88% and a speed of 21.5 Mflop/s for a matrix of size n=1000.

[1]  S. Lennart Johnsson,et al.  Communication Efficient Basic Linear Algebra Computations on Hypercube Architectures , 1987, J. Parallel Distributed Comput..

[2]  A. Burns,et al.  Programming in Occam 2 , 1988 .

[3]  Ilse C. F. Ipsen,et al.  Complexity of dense linear system solution on a multiprocessor ring. Research report , 1986 .

[4]  L. D. J. C. Loyens,et al.  The Formal Construction of a Parallel Triangular System Solver , 1989, MPC.

[5]  永田 守男,et al.  Verifying Properties of Parallel Programs : An Axiomatic Approach , 1976 .

[6]  Jack J. Dongarra,et al.  The LINPACK Benchmark: An Explanation , 1988, ICS.

[7]  Jack J. Dongarra,et al.  Solving banded systems on a parallel processor , 1987, Parallel Comput..

[8]  J. M. Ortega,et al.  The ijk forms of factorization methods II. Vector computers , 1988, Parallel Comput..

[9]  Gene H. Golub,et al.  Matrix computations , 1983 .

[10]  Peter R. Cappello,et al.  Gaussian Elimination on a Hypercube Automaton , 1987, J. Parallel Distributed Comput..

[11]  Edsger W. Dijkstra,et al.  A Discipline of Programming , 1976 .

[12]  Charles H. Romine,et al.  $LU$ Factorization Algorithms on Distributed-Memory Multiprocessor Architectures , 1988 .

[13]  Senol Utku,et al.  A parallel Householder tridiagonalization stratagem using scattered square decomposition , 1988, Parallel Comput..

[14]  G. C. Fox,et al.  Solving Problems on Concurrent Processors , 1988 .

[15]  George J Davis,et al.  Column LU factorization with pivoting on a message-passing multiprocessor , 1986 .

[16]  David Gries,et al.  The Science of Programming , 1981, Text and Monographs in Computer Science.

[17]  Alan George,et al.  Gaussian elimination with partial pivoting and load balancing on a multiprocessor , 1987, Parallel Comput..

[18]  Y. Saad,et al.  Parallel Direct Methods for Solving Banded Linear Systems. , 1987 .