Area-Delay Efficient FFT Architecture using Parallel Processing and New Memory Sharing Technique

In this paper we present a novel architecture for FFT implementation on FPGA. The proposed architecture based on radix-4 algorithm presents the advantage of a higher throughput and low area-delay product. In fact, the novelty consists on using a memory sharing and dividing technique along with parallel-in parallel-out Processing Elements (PE). The proposed architecture can perform N-point FFT using only 4/3N delay elements and involves a latency of N/4 cycles. Comparison in terms of hardware complexity and area-delay product with recent works presented in the literature and commercial IPs has been made to show the efficiency of the proposed design. Moreover, from the experimental results obtained from a FPGA prototype we find that the proposed design involves an execution time of 56% lower than that obtained with Xilinx IP core and an increase of 19% in the throughput by area ratio for 256-point FFT.

[1]  Z. Ali A High-Speed FFT Processor , 1978, IEEE Trans. Commun..

[2]  T. Parks,et al.  A prime factor FFT algorithm using high-speed convolution , 1977 .

[3]  Alan V. Oppenheim,et al.  Discrete-Time Signal Pro-cessing , 1989 .

[4]  Guoan Bi,et al.  Pipelined structure based on radix-22 FFT algorithm , 2011, 2011 6th IEEE Conference on Industrial Electronics and Applications.

[5]  Lewis Johnson,et al.  Conflict free memory addressing for dedicated FFT hardware , 1992 .

[6]  Truong Q. Nguyen,et al.  On the Fixed-Point Accuracy Analysis of FFT Algorithms , 2008, IEEE Transactions on Signal Processing.

[7]  Myung Hoon Sunwoo,et al.  New in-place strategy for a mixed-radix FFT processor , 2003, IEEE International [Systems-on-Chip] SOC Conference, 2003. Proceedings..

[8]  C. Sidney Burrus,et al.  On computing the split-radix FFT , 1986, IEEE Trans. Acoust. Speech Signal Process..

[9]  E. V. Jones,et al.  A pipelined FFT processor for word-sequential data , 1989, IEEE Trans. Acoust. Speech Signal Process..

[10]  Peter Liu,et al.  Minimizing the memory requirement for continuous flow FFT implementation: continuous flow mixed mode FFT (CFMM-FFT) , 2000, 2000 IEEE International Symposium on Circuits and Systems. Emerging Technologies for the 21st Century. Proceedings (IEEE Cat No.00CH36353).

[11]  Mats Torkelson,et al.  A new approach to pipeline FFT processor , 1996, Proceedings of International Conference on Parallel Processing.

[12]  Yingning Peng,et al.  Pipeline FFT Architectures Optimized for FPGAs , 2009, Int. J. Reconfigurable Comput..

[13]  Ahmed F. Shalash,et al.  FPGA Implementation of an ASIP for high throughput DFT/DCT 1D/2D engine , 2011, 2011 IEEE International Symposium of Circuits and Systems (ISCAS).

[14]  Hen-Geul Yeh,et al.  Speed and area analysis of memory based FFT processors in a FPGA , 2007, 2007 Wireless Telecommunications Symposium.

[15]  Myung Hoon Sunwoo,et al.  A high-speed FFT processor for OFDM systems , 2002, 2002 IEEE International Symposium on Circuits and Systems. Proceedings (Cat. No.02CH37353).

[16]  Alvin M. Despain,et al.  Pipeline and Parallel-Pipeline FFT Processors for VLSI Implementations , 1984, IEEE Transactions on Computers.

[17]  J. Tukey,et al.  An algorithm for the machine calculation of complex Fourier series , 1965 .

[18]  Yousri Ouerhani,et al.  Implementation techniques of high-order FFT into low-cost FPGA , 2011, 2011 IEEE 54th International Midwest Symposium on Circuits and Systems (MWSCAS).

[19]  Myung Hoon Sunwoo,et al.  New continuous-flow mixed-radix (CFMR) FFT Processor using novel in-place strategy , 2005, IEEE Transactions on Circuits and Systems I: Regular Papers.

[20]  C. K. Yuen,et al.  Theory and Application of Digital Signal Processing , 1978, IEEE Transactions on Systems, Man, and Cybernetics.

[21]  Liang Yang,et al.  An efficient locally pipelined FFT processor , 2006, IEEE Transactions on Circuits and Systems II: Express Briefs.

[22]  Li Tan,et al.  Oversampling Technique for Obtaining Higher Order Derivative of Low-Frequency Signals , 2011, IEEE Transactions on Instrumentation and Measurement.

[23]  Alvin M. Despain,et al.  Fourier Transform Computers Using CORDIC Iterations , 1974, IEEE Transactions on Computers.