Hierarchical Load Balancing for Parallel Fast Legendre Transforms

We present a parallel Fast Legendre Transform (FLT) based on the Driscol--Healy algorithm with computation complexity O(N log 2 N ). The parallel FLT is load-- balanced in a hierarchical fashion. We use a load--balanced FFT to deduce a load-- balanced parallel fast cosine transform, which in turn serves as a building block for the Legendre transform engine, from which the parallel FLT is constructed. We demonstrate how the arithmetic, memory and communication complexities of the parallel FLT are hierarchically derived via the complexity of its modular blocks. 1 Introduction Legendre transforms are ubiquitous in many disciplines of applied sciences, particularly spectral methods for the solution of partial differential equations [3]. For applications of harmonic analysis on the 2--sphere S 2 , an efficient Legendre transform is as crucial to numeric computation as the fast Fourier transform (FFT) is to classical time--series analysis on R. This stems from the fact that harmonic ana...

[1]  G. Steidl,et al.  A polynomial approach to fast algorithms for discrete Fourier-cosine and Fourier-sine transforms , 1991 .

[2]  D. Healy,et al.  Computing Fourier Transforms and Convolutions on the 2-Sphere , 1994 .

[3]  D. Rockmore,et al.  Generalized FFT's- A survey of some recent results , 1996, Groups and Computation.

[4]  F. A. Seiler,et al.  Numerical Recipes in C: The Art of Scientific Computing , 1989 .

[5]  Paul N. Swarztrauber,et al.  Multiprocessor FFTs , 1987, Parallel Comput..

[6]  S. Lennart Johnsson,et al.  A Vector Space Framework for Parallel Stable Permutations , 1995 .

[7]  Dennis M. Healy,et al.  Fast Discrete Polynomial Transforms with Applications to Data Analysis for Distance Transitive Graphs , 1997, SIAM J. Comput..

[8]  Dennis M. Healy,et al.  Efficiency and Stability Issues in the Numerical Computation of Fourier Transforms and Convolutions on the 2-Sphere , 1994 .

[9]  G. L. Browning,et al.  A comparison of three numerical methods for solving differential equations on the sphere , 1989 .

[10]  Ian T. Foster,et al.  Parallelizing the Spectral Transform Method: A Comparison of Alternative Parallel Algorithms , 1993, PPSC.

[11]  Patrick H. Worley,et al.  Parallelizing the Spectral Transform Method , 1992, The Sixth Distributed Memory Computing Conference, 1991. Proceedings.

[12]  Bradley K. Alpert,et al.  A Fast Algorithm for the Evaluation of Legendre Expansions , 1991, SIAM J. Sci. Comput..

[13]  Richard B. Pelz,et al.  A Balanced, Parallel algorithm for Spectral Global Climate Models , 1993, PPSC.

[14]  Implementation and parallelization of fast matrix multiplication for a fast Legendre transform , 1993 .

[15]  Leslie G. Valiant,et al.  A bridging model for parallel computation , 1990, CACM.