Efficient VLSI architectures for fast computation of the discrete Fourier transform and its inverse

In this paper, we propose two new VLSI architectures for computing the N-point discrete Fourier transform (DFT) and its inverse (IDFT) based on a radix-2 fast algorithm, where N is a power of two. The first part of this work presents a linear systolic array that requires log/sub 2/ N complex multipliers and is able to provide a throughput of one transform sample per clock cycle. Compared with other related systolic designs based on direct computation or a radix-2 fast algorithm, the proposed one has the same throughput performance but involves less hardware complexity. This design is suitable for high-speed real-time applications, but it would not be easily realized in a single chip when N gets large. To balance the chip area and the processing speed, we further present a new reduced-complexity design for the DFT/IDFT computation. The alternative design is a memory-based architecture that consists of one complex multiplier, two complex adders, and some special memory units. The new design has the capability of computing one transform sample every log/sub 2/ N+1 clock cycles on average. In comparison with the first design, the second design reaches a lower throughput with less hardware complexity. As N=512, the chip area required for the memory-based design is about 5742/spl times/5222 /spl mu/m/sup 2/, and the corresponding throughput can attain a rate as high as 4M transform samples per second under 0.6 /spl mu/m CMOS technology. Such area-time performance makes this design very competitive for use in long-length DFT applications, such as asymmetric digital subscriber lines (ADSL) and orthogonal frequency-division multiplexing (OFDM) systems.

[1]  D. J. Skellern,et al.  VLSI for OFDM , 1998 .

[2]  Marshall C. Pease,et al.  An Adaptation of the Fast Fourier Transform for Parallel Processing , 1968, JACM.

[3]  Shen-Fu Hsiao,et al.  Power, speed and area comparison of several new DPT architectures , 1997, Proceedings of 1997 IEEE International Symposium on Circuits and Systems. Circuits and Systems in the Information Age ISCAS '97.

[4]  Shousheng He,et al.  Design and implementation of a 1024-point pipeline FFT processor , 1998, Proceedings of the IEEE 1998 Custom Integrated Circuits Conference (Cat. No.98CH36143).

[5]  N. Rama Murthy,et al.  On the real-time computation of DFT and DCT through systolic architectures , 1994, IEEE Trans. Signal Process..

[6]  Long-Wen Chan,et al.  A new systolic array for discrete Fourier transform , 1988, IEEE Trans. Acoust. Speech Signal Process..

[7]  Alan V. Oppenheim,et al.  Discrete-Time Signal Pro-cessing , 1989 .

[8]  J. Choi,et al.  A new linear systolic array for FFT computation , 1992 .

[9]  Francisco Argüello,et al.  A VLSI Constant Geometry Architecture for the Fast Hartley and Fourier Transforms , 1992, IEEE Trans. Parallel Distributed Syst..

[10]  John M. Cioffi,et al.  Performance Evaluation of a Multichannel Transceiver System for ADSL and VHDSL Services , 1991, IEEE J. Sel. Areas Commun..

[11]  Ramjee Prasad,et al.  Overview of multicarrier CDMA , 1997, IEEE Commun. Mag..

[12]  V. Boriakoff,et al.  FFT computation with systolic arrays, a new architecture , 1994 .

[13]  W. Steenaart,et al.  Efficient one-dimensional systolic array realization of the discrete Fourier transform , 1989 .

[14]  K. Maxwell Asymmetric digital subscriber line: interim technology for the next forty years , 1996 .

[15]  H. T. Kung Why systolic architectures? , 1982, Computer.