A class of parallel architectures for fast Fourier transform

We propose a new class of parallel architectures called unfolded swapped networks (USN) for fast Fourier transform (FFT) and related problems. The VLSI area of a suitably constructed N(log/sub 2/ N+o(log N))-node USN is no more than N/sup 2/+o(N/sup 2/), which is smaller than the best known result for a log, N-dimensional butterfly network. USNs can be constructed using small butterfly modules, each built on a chip, and requires fewer pins than a similar-sized butterfly network by a factor of /spl Theta/(log N). N-point FFT can be executed on a USN at a speed comparable to a butterfly network, assuming constant link delay; it can be executed on a USN considerably faster than on a butterfly when link delay increases with length and/or when inter-chip data transfers are much slower than intra-chip ones.

[1]  Sajal K. Das,et al.  Book Review: Introduction to Parallel Algorithms and Architectures : Arrays, Trees, Hypercubes by F. T. Leighton (Morgan Kauffman Pub, 1992) , 1992, SIGA.

[2]  Jean Vuillemin,et al.  A combinatorial limit to the computing power of V.L.S.I. circuits , 1980, 21st Annual Symposium on Foundations of Computer Science (sfcs 1980).

[3]  Earl E. Swartzlander,et al.  A radix-8 wafer scale FFT processor , 1992, J. VLSI Signal Process..

[4]  Mark A. Franklin,et al.  Pin Limitations and Partitioning of VLSI Interconnection Networks , 1982, IEEE Transactions on Computers.

[5]  Susumu Horiguchi,et al.  Parallel FFT Algorithms Using Radix 4 Butterfly Computation on an Eight-Neighbor Processor Array , 1995, Parallel Comput..

[6]  Behrooz Parhami,et al.  Swapped networks: unifying the architectures and algorithms of a wide class of hierarchical parallel processors , 1996, Proceedings of 1996 International Conference on Parallel and Distributed Systems.

[7]  C. Loan Computational Frameworks for the Fast Fourier Transform , 1992 .

[8]  Jean Vuillemin,et al.  A Combinatorial Limit to the Computing Power of VLSI Circuits , 1983, IEEE Transactions on Computers.

[9]  Behrooz Parhami,et al.  Recursive hierarchical swapped networks: versatile interconnection architectures for highly parallel systems , 1996, Proceedings of SPDP '96: 8th IEEE Symposium on Parallel and Distributed Processing.

[10]  Thompson Fourier Transforms in VLSI , 1983, IEEE Transactions on Computers.