Hardware design of FFT polynomial multipliers

This paper presents the design of two FFT polynomial multipliers using parallel and sequential architectures. Parallel and sequential polynomial multipliers were optimized for throughput and area resources, respectively. The designs are described in generic structural VHDL, synthesized on the Stratix EP4SGX230KF40C2 using Quartus II V. 13, and verified using SignalTap. The hardware synthesis and performance results show that the designed multipliers present a good area-throughput trade-off and they are suitable for high-performance scientific computing applications.

[1]  Tet Hin Yeap,et al.  Efficient FPGA implementation of FFT based multipliers , 2005, Canadian Conference on Electrical and Computer Engineering, 2005..

[2]  Alexander Lopez Parrado,et al.  COFDM baseband processor on FPGA , 2011, 2011 IEEE Second Latin American Symposium on Circuits and Systems (LASCAS).

[3]  Carl Ponder Parallel Multiplication and Powering of Polynomials , 1991, J. Symb. Comput..

[4]  Marc Moreno Maza,et al.  FFT-Based Dense Polynomial Arithmetic on Multi-cores , 2009, HPCS.

[5]  Pavel Emeliyanenko,et al.  Efficient Multiplication of Polynomials on Graphics Hardware , 2009, APPT.

[6]  Richard P. Brent,et al.  Modern Computer Arithmetic , 2010 .

[7]  Mats Torkelson,et al.  A new approach to pipeline FFT processor , 1996, Proceedings of International Conference on Parallel Processing.

[8]  Tim Güneysu,et al.  Towards Efficient Arithmetic for Lattice-Based Cryptography on Reconfigurable Hardware , 2012, LATINCRYPT.

[9]  M. M. Maza,et al.  Fast polynomial multiplication on a GPU , 2010 .

[10]  Christof Paar,et al.  Generalizations of the Karatsuba Algorithm for Efficient Implementations , 2006, IACR Cryptol. ePrint Arch..