Pipeline FFT Architectures Optimized for FPGAs

This paper presents optimized implementations of two different pipeline FFT processors on Xilinx Spartan-3 and Virtex-4 FPGAs. Different optimization techniques and rounding schemes were explored. The implementation results achieved better performance with lower resource usage than prior art. The 16-bit 1024-point FFT with the R22SDF architecture had a maximum clock frequency of 95.2 MHz and used 2802 slices on the Spartan-3, a throughput per area ratio of 0.034 Msamples/s/slice. The R4SDC architecture ran at 123.8 MHz and used 4409 slices on the Spartan-3, a throughput per area ratio of 0.028 Msamples/s/slice. On Virtex-4, the 16-bit 1024-point R22SDF architecture ran at 235.6 MHz and used 2256 slice, giving a 0.104 Msamples/s/slice ratio; the 16-bit 1024-point R4SDC architecture ran at 219.2 MHz and used 3064 slices, giving a 0.072 Msamples/s/slice ratio. The R22SDF was more efficient than the R4SDC in terms of throughput per area due to a simpler controller and an easier balanced rounding scheme. This paper also shows that balanced stage rounding is an appropriate rounding scheme for pipeline FFT processors.

[1]  E. V. Jones,et al.  A pipelined FFT processor for word-sequential data , 1989, IEEE Trans. Acoust. Speech Signal Process..

[2]  T. Sansaloni,et al.  Efficient pipeline FFT processors for WLAN MIMO-OFDM systems , 2005 .

[3]  Bin Zhou,et al.  Implementations and Optimizations of Pipeline FFTs on Xilinx FPGAs , 2008, 2008 International Conference on Reconfigurable Computing and FPGAs.

[4]  C. K. Yuen,et al.  Theory and Application of Digital Signal Processing , 1978, IEEE Transactions on Systems, Man, and Cybernetics.

[5]  Mats Torkelson,et al.  A new approach to pipeline FFT processor , 1996, Proceedings of International Conference on Parallel Processing.

[6]  Alvin M. Despain,et al.  Pipeline and Parallel-Pipeline FFT Processors for VLSI Implementations , 1984, IEEE Transactions on Computers.

[7]  Peter Kabal,et al.  Performance of fixed-point FFT's: Rounding and scaling considerations , 1986, ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.

[8]  Khaled Benkrid,et al.  A high-level implementation of a high performance pipeline FFT on Virtex-E FPGAs , 2004, IEEE Computer Society Annual Symposium on VLSI.

[9]  Myoung Seob Lim,et al.  New Radix-2 to the 4th Power Pipeline FFT Processor , 2005, IEICE Trans. Electron..

[10]  Shousheng He,et al.  Wordlength optimization of a pipelined FFT processor , 1999, 42nd Midwest Symposium on Circuits and Systems (Cat. No.99CH36356).