Piecewise Cubic Interpolation on Distributed Memory Parallel Computers and Clusters of Workstations

The aim of this paper is to present two new portable and high performance implementations of routines that can be used for piecewise cubic interpolation. The first one (sequential) is based on LAPACK routines, while the next, based on ScaLAPACK is designed for distributed memory parallel computers and clusters. The results of experiments performed on a cluster of twenty Itanium 2 processors and on Cray XI are also presented and shortly discussed