Optimization techniques for FPGA-based wave-pipelined DSP blocks

In this paper, techniques for efficient implementation of field-programmable gate-array (FPGA)-based wave-pipelined (WP) multipliers, accumulators, and filters are presented. A comparison of the performance of WP and pipelined systems has been made. Major contributions of this paper are development of an on-chip clock generation scheme which permits finer tuning of the frequency, a synthesis technique that reduces the area and latency by 25%, a placement utility that results in 10%-40% increase in speed and proposal of an interleaving scheme for filters that reduces the number of multipliers required by 50%. WP multipliers of size 2 /spl times/ 6 and the filters using them are found to be 11% faster and require lower power than those using pipelined multipliers. Filters with higher order WP multipliers also operate with lower power at the cost of speed. The delay-register products of such filters are found to be about 60% lower than those using the pipelined multipliers. The paper also outlines applications of these techniques for the Spartan II FPGAs and a self-tuning scheme for optimizing the speed.

[1]  Wentai Liu,et al.  Wave Pipelining: Theory and CMOS Implementation , 1993 .

[2]  H. H. Guild Fully iterative fast array for binary multiplication and addition , 1969 .

[3]  Wonchan Kim,et al.  The design of 16/spl times/16 wave pipelined multiplier using fan-in equalization technique , 1997, Proceedings of 40th Midwest Symposium on Circuits and Systems. Dedicated to the Memory of Professor Mac Van Valkenburg.

[4]  Sorin A. Huss,et al.  Two-phase asynchronous wave-pipelines and their application to a 2D-DCT , 1999, Proceedings. Fifth International Symposium on Advanced Research in Asynchronous Circuits and Systems.

[5]  F. Morin,et al.  A high throughput architecture for channel equalization based on a neural network using a wave pipeline method , 1999, Engineering Solutions for the Next Millennium. 1999 IEEE Canadian Conference on Electrical and Computer Engineering (Cat. No.99TH8411).

[6]  Sorin A. Huss,et al.  VLSI system design using asynchronous wave pipelines: a 0.35 /spl mu/m CMOS 1.5 GHz elliptic curve public key cryptosystem chip , 2000, Proceedings Sixth International Symposium on Advanced Research in Asynchronous Circuits and Systems (ASYNC 2000) (Cat. No. PR00586).

[7]  Ralph K. Cavin,et al.  A 250-MHz wave pipelined adder in 2-/spl mu/m CMOS , 1994 .

[8]  R. S. Parthasarathy,et al.  Double pass-transistor logic for high performance wave pipeline circuits , 1998, Proceedings Eleventh International Conference on VLSI Design.

[9]  B. Venkataramani,et al.  Design and implementation of FPGA based wavepipelined fast convolver , 2000, 2000 TENCON Proceedings. Intelligent Systems and Technologies for the New Millennium (Cat. No.00CH37119).

[10]  Jr. Earl E. Swartzlander,et al.  VLSI Signal Processing Systems , 1985 .

[11]  Daniel Massicotte,et al.  A systolic architecture for Kalman-filter-based signal reconstruction using the wave pipeline method , 1998, 1998 IEEE International Conference on Electronics, Circuits and Systems. Surfing the Waves of Science and Technology (Cat. No.98EX196).

[12]  S. K. Nandy,et al.  Design and realization of high-performance wave-pipelined 8×8 b multiplier in CMOS technology , 1995, IEEE Trans. Very Large Scale Integr. Syst..

[13]  W. Liu,et al.  Wave-pipelining: a tutorial and research survey , 1998, IEEE Trans. Very Large Scale Integr. Syst..

[14]  José G. Delgado-Frias,et al.  A hybrid wave pipelined network router , 2002 .

[15]  E.I. Boemo,et al.  Wave pipelines via look-up tables , 1996, 1996 IEEE International Symposium on Circuits and Systems. Circuits and Systems Connecting the World. ISCAS 96.

[16]  Giovanni De Micheli,et al.  Designing high-performance digital circuits using wave pipelining: algorithms and practical experiences , 1993, IEEE Trans. Comput. Aided Des. Integr. Circuits Syst..

[17]  L. W. Cotten Maximum-rate pipeline systems , 1969, AFIPS '69 (Spring).

[18]  Ram Krishnamurthy,et al.  A CMOS wave-pipelined image processor for real-time morphology , 1995, Proceedings of ICCD '95 International Conference on Computer Design. VLSI in Computers and Processors.

[19]  Jr. W.A. Chren,et al.  One-hot residue coding for low delay-power product CMOS design , 1998 .