A comparison of FFT processor designs

ASTRON is the Netherlands Institute for Radio Astronomy. They operate, among others, LOFAR (Low Frequency Array), which is a radio telescope using a concept based on a large array of omni-directional antennas. The signals from these antennas go through various processing units, one of which is an FFT processor. In the current LOFAR design, FPGAs are used for this, since the numbers are too small to a�ord custom chips. For future astronomical applications, especially for the SKA telescope, a more speci�c chip solution is desired. SKA will be much larger than LOFAR and use many more processing elements. As power consumption is a major concern, the FPGAs are unsuitable and they need to be replaced with ASICs. The energy consumption of the FPGAs is compared to the energy comsumption of the same FFT design implemented on an ASIC. For the FPGA synthesis and power calculation, Quartus is used. The ASIC was synthesized with Synopsys Design Compiler using 65nm technology. The energy usage is reduced from 0.84�J per FFT on the FPGA to 0.41�J per FFT on the ASIC. Four new ASIC designs are compared to the existing one, in search of a better solution. An approach that uses the minimal amount of memory (SDF), and one that uses more memory for faster calculation (MDC) are implemented for both radix-2 and radix-4 designs. Di�erent complex multipliers and di�erent methods of storing the twiddle factors are also compared. The fast calculating radix-2 design gives the best results. Combined with a complex multiplier that uses Gauss' complex multiplication algorithm and a twiddle factor component based on registers, the energy comsumption per FFT can be reduced to 0.33�J.

[1]  Jonathan Rose,et al.  Measuring the Gap Between FPGAs and ASICs , 2007, IEEE Trans. Comput. Aided Des. Integr. Circuits Syst..

[2]  J. Tukey,et al.  An algorithm for the machine calculation of complex Fourier series , 1965 .

[3]  Mats Torkelson,et al.  A new approach to pipeline FFT processor , 1996, Proceedings of International Conference on Parallel Processing.

[4]  Chris Dick Computing the Discrete Fourier Transform on FPGA Based Systolic Arrays , 1996, Fourth International ACM Symposium on Field-Programmable Gate Arrays.

[5]  Quanyuan Feng,et al.  ASIC design of low-power reconfigurable FFT processor , 2007, 2007 7th International Conference on ASIC.

[6]  K. Shankar,et al.  Design and implementation of low power FFT/IFFT processor for wireless communication , 2012, International Conference on Pattern Recognition, Informatics and Medical Engineering (PRIME-2012).

[7]  Sabih H. Gerez,et al.  Implementaion of a combined OFDM-demodulation and WCDMA-equalization module , 2006 .

[8]  P. K. Jawahar,et al.  An efficient FFT/IFFT architecture for wireless communication , 2012, 2012 International Conference on Communication and Signal Processing.

[9]  Luca Fanucci,et al.  Design and Comparison of FFT VLSI Architectures for SoC Telecom Applications with Different Flexibility, Speed and Complexity Trade-Offs , 2012, Circuits Syst. Signal Process..

[10]  Akshay Sridharan,et al.  Low Power Hardware Implementation of High Speed FFT Core , 2010, 2010 International Conference on Advances in Computer Engineering.

[11]  Edmund Lai,et al.  Practical digital signal processing for engineers and technicians , 2004 .

[12]  Prabir Saha,et al.  ASIC implementation of high speed processor for calculating discrete Fourier transformation using circular convolution technique , 2011 .

[13]  Indranil Hatai,et al.  ASIC implementation of a 512-point FFT/IFFT processor for 2D CT image reconstruction algorithm , 2011, IEEE Technology Students' Symposium.

[14]  John G. Proakis,et al.  Digital Signal Processing: Principles, Algorithms, and Applications , 1992 .

[15]  B.M. Baas An energy-efficient single-chip FFT processor , 1996, 1996 Symposium on VLSI Circuits. Digest of Technical Papers.

[16]  Shousheng He,et al.  Design and implementation of a 1024-point pipeline FFT processor , 1998, Proceedings of the IEEE 1998 Custom Integrated Circuits Conference (Cat. No.98CH36143).

[17]  C. Burrus Index mappings for multidimensional formulation of the DFT and convolution , 1977 .

[18]  Li Jie,et al.  Low-power digital ASIC for on-chip spectral analysis of low-frequency physiological signals , 2012 .

[19]  Tughrul Arslan,et al.  A low-power and domain-specific reconfigurable FFT fabric for system-on-chip applications , 2005, 19th IEEE International Parallel and Distributed Processing Symposium.

[20]  Bevan M. Baas,et al.  A low-power, high-performance, 1024-point FFT processor , 1999, IEEE J. Solid State Circuits.

[21]  Sy-Yen Kuo,et al.  Adaptive Low-Error Fixed-Width Booth Multipliers , 2007, IEICE Trans. Fundam. Electron. Commun. Comput. Sci..