FFT computation with systolic arrays, a new architecture

The use of the Cooley-Tukey algorithm for computing the 1-d FFT lends itself to a particular matrix factorization which suggests direct implementation by linearly-connected systolic arrays. Here we present a new systolic architecture that embodies this algorithm. This implementation requires a smaller number of processors and a smaller number of memory cells than other recent implementations, as well as having all the advantages of systolic arrays. For the implementation of the decimation-in-frequency case, word-serial data input allows continuous real-time operation without the need of a serial-to-parallel conversion device. No control or data stream switching is necessary. Computer simulation of this architecture was done in the context of a 1024 point DFT with a fixed point processor, and CMOS processor implementation has started. >

[1]  Robert Michael Owens,et al.  An architecture for a VLSI FFT processor , 1983, Integr..

[2]  G. Bergland Fast Fourier transform hardware implementations--A survey , 1969 .

[3]  J. Choi,et al.  A new linear systolic array for FFT computation , 1992 .

[4]  H.L. Groginsky,et al.  A Pipeline Fast Fourier Transform , 1970, IEEE Transactions on Computers.

[5]  W. Steenaart,et al.  Efficient one-dimensional systolic array realization of the discrete Fourier transform , 1989 .

[6]  Howard C. Card,et al.  Dual Systolic Architectures for VLSI Digital Signal Processing Systems , 1986, IEEE Transactions on Computers.

[7]  H. T. Kung,et al.  Warp: A Programmable Systolic Array Processor , 1984, Optics & Photonics.

[8]  Chein-Wei Jen,et al.  A new systolic array algorithm for discrete Fourier transform , 1991, 1991., IEEE International Sympoisum on Circuits and Systems.

[9]  Marshall C. Pease,et al.  An Adaptation of the Fast Fourier Transform for Parallel Processing , 1968, JACM.

[10]  Robert Michael Owens,et al.  A VLSI chip for the winograd/Prime factor algorithm to compute the discrete Fourier transform , 1986, IEEE Trans. Acoust. Speech Signal Process..

[11]  H. T. Kung,et al.  Wafer-scale integration and two-level pipelined implementations of systolic arrays , 1984, J. Parallel Distributed Comput..

[12]  Chein-Wei Jen,et al.  On the design of VLSI arrays for discrete Fourier transform , 1992 .

[13]  H. T. Kung Why systolic architectures? , 1982, Computer.

[14]  R. Jarocki,et al.  Modular architecture for high performance implementation of FFT algorithm , 1986, ISCA 1986.

[15]  Eurípides Montagne,et al.  Optimizing systolic networks by fitting diagonals , 1987, Parallel Comput..