Performance Evaluation of Parallel Fast Fourier Transform on a Multiprocessor Workstation

Abstract The fast Fourier transform (FFT) is very frequently used in various fields such as computer tomography, speech recognition, and image processing. As the area of computer applications has broadened, the quantity of data to be transformed has greatly increased. A parallel FFT is one way of achieving a fast transformation. Up until now, the experimental performance of parallel FFTs has not been sufficiently investigated on real multiprocessor systems. This paper describes an implementation of a parallel FFT on a multiprocessor workstation to investigate its real performance. The multiprocessor workstation provides parallel environments for both a multithread operating system and a multiprocess operating system. The performance of the parallel FFT is discussed with respect to cache protocols, floating-point coprocessors, and operating systems.

[1]  N. Oba,et al.  Top-1: a snoop-cache-based multiprocessor , 1990, Ninth Annual International Phoenix Conference on Computers and Communications. 1990 Conference Proceedings.

[2]  Alan Norton,et al.  Parallelization and Performance Analysis of the Cooley–Tukey FFT Algorithm for Shared-Memory Architectures , 1987, IEEE Transactions on Computers.

[3]  Z. Cvetanovic,et al.  Performance Analysis of the FFT Algorithm on a Shared-Memory Parallel Architecture , 1987, IBM J. Res. Dev..

[4]  James K. Archibald,et al.  Cache coherence protocols: evaluation using a multiprocessor simulation model , 1986, TOCS.

[5]  Lawrence C. Stewart,et al.  Firefly: a multiprocessor workstation , 1987, ASPLOS 1987.

[6]  Kai Hwang,et al.  Computer architecture and parallel processing , 1984, McGraw-Hill Series in computer organization and architecture.

[7]  J. Tukey,et al.  An algorithm for the machine calculation of complex Fourier series , 1965 .