A parallel block partitioning method to solve a tri diagonal system of linear equations is adapted to the BBN Butterfly multiprocessor. A performance analysis of the programming experiments on the 32-node Butterfly is presented. An upper bound on the number of pro cessors to achieve the best performance with this method is derived. The computational results verify the theoretical speedup and efficiency results of the parallel algorithm over its serial counterpart. Also included is a study comparing performance runs of the same code on the Butterfly processor with a hardware floating point unit and on one with a software floating point facility. The total parallel time of the given code is considerably reduced by making use of the hardware floating point facility whereas the speedup and efficiency of the par allel program considerably improve on the system with software floating point capability. The achieved results are shown to be within 82% to 90% of the predicted performance.
[1]
David S. Kershaw,et al.
Solution of Single Tridiagonal Linear Systems and Vectorization of the ICCG Algorithm on the Cray-1
,
1982
.
[2]
S. P. Kumar,et al.
Parallel algorithms for recurrence and tridiagonal equations
,
1985
.
[3]
Guang R. Gao,et al.
A Maximally Pipelined Tridiagonal Linear Equation Solver
,
1986,
J. Parallel Distributed Comput..
[4]
H. H. Wang,et al.
A Parallel Method for Tridiagonal Equations
,
1981,
TOMS.
[5]
Jack J. Dongarra,et al.
Solving banded systems on a parallel processor
,
1987,
Parallel Comput..
[6]
Harold S. Stone,et al.
Parallel Tridiagonal Equation Solvers
,
1975,
TOMS.