Automation techniques for implementation of hybrid wave-pipelined 2D DWT

In the literature, techniques such as pipelining and wave-pipelining (WP) are proposed for increasing the operating frequency of a digital circuit. In general, use of pipelining results in higher speed at the cost of increase in the area and clock routing complexity. On the other hand, use of WP results in less clock routing complexity and less area but enables the digital circuit to be operated only at moderate speeds. In this paper, a hybrid wave-pipelining scheme is proposed to get the benefits of both pipelining and WP techniques. Major contributions of this paper are: proposal for the implementation of 2D DWT using lifting scheme by adopting the hybrid wave-pipelining and proposal for the automation of the choice of clock frequency and clock skew between the input and output registers of wave-pipelined circuit using built in self test (BIST) and system-on-chip (SOC) approaches. In the hybrid scheme, different lifting blocks are interconnected using pipelining registers and the individual blocks are implemented using WP. For the purpose of evaluating the superiority of the schemes proposed in this paper, the system for the computation of one level 2D DWT is implemented using the following techniques: pipelining, non-pipelining and hybrid wave-pipelining. The BIST approach is used for the implementation on Xilinx Spartan-II device. The SOC approach is adopted for implementation on Altera and Xilinx field programmable gate arrays (FPGAs) based SOC kits with Nios II or Micro blaze soft-core processors. From the implementation results, it is verified that the hybrid WP circuit is faster than non-pipelined circuit by a factor of 1.25–1.39. The pipelined circuit is in turn faster than the hybrid wave-pipelined circuit by a factor of 1.15–1.38 and this is achieved with the increase in the number of registers by a factor of 1.79–3.15 and increase in the number of LEs by a factor of 1.11–1.65. The soft-core processor based automation scheme has considerably reduced the effort required for the design and testing of the hybrid wave-pipelined circuit. The techniques proposed in this paper, are also applicable for ASICs. The optimization schemes proposed in this paper are also applicable for the computation of other image transforms such as DCT, DHT.

[1]  B. Venkataramani,et al.  Optimization techniques for FPGA-based wave-pipelined DSP blocks , 2005, IEEE Transactions on Very Large Scale Integration (VLSI) Systems.

[2]  R. Kumar,et al.  Conjoining Soft-Core FPGA Processors , 2006, 2006 IEEE/ACM International Conference on Computer Aided Design.

[3]  Jr. Earl E. Swartzlander,et al.  VLSI Signal Processing Systems , 1985 .

[4]  B. Venkataramani,et al.  SOC Implementation of HMM Based Speaker Independent Isolated Digit Recognition System , 2007, 20th International Conference on VLSI Design held jointly with 6th International Conference on Embedded Systems (VLSID'07).

[5]  G. Lakshminarayanan,et al.  Design and FPGA Implementation of Self-Tuned Wave-Pipelined Filters with Distributed Arithmetic Algorithm , 2008 .

[6]  W. Liu,et al.  Wave-pipelining: a tutorial and research survey , 1998, IEEE Trans. Very Large Scale Integr. Syst..

[7]  I. Daubechies,et al.  Factoring wavelet transforms into lifting steps , 1998 .

[8]  Paul Molitor,et al.  A pipelined architecture for partitioned DWT based lossy image compression using FPGA's , 2001, FPGA '01.

[9]  Michael John Sebastian Smith,et al.  Application-specific integrated circuits , 1997 .

[10]  B. Venkataramani,et al.  SOC implementation of wave-pipelined circuits , 2007, 2007 International Conference on Field-Programmable Technology.

[11]  Sorin A. Huss,et al.  VLSI system design using asynchronous wave pipelines: a 0.35 /spl mu/m CMOS 1.5 GHz elliptic curve public key cryptosystem chip , 2000, Proceedings Sixth International Symposium on Advanced Research in Asynchronous Circuits and Systems (ASYNC 2000) (Cat. No. PR00586).

[12]  José G. Delgado-Frias,et al.  A hybrid wave pipelined network router , 2002 .

[13]  Bruce A. Draper,et al.  Accelerated image processing on FPGAs , 2003, IEEE Trans. Image Process..

[14]  E.I. Boemo,et al.  Wave pipelines via look-up tables , 1996, 1996 IEEE International Symposium on Circuits and Systems. Circuits and Systems Connecting the World. ISCAS 96.

[15]  Wentai Liu,et al.  Wave Pipelining: Theory and CMOS Implementation , 1993 .

[16]  Ping-Sing Tsai,et al.  JPEG2000 Standard for Image Compression: Concepts, Algorithms and VLSI Architectures , 2004 .

[17]  Khalid Sayood,et al.  Introduction to Data Compression , 1996 .

[18]  G. Lakshminarayanan,et al.  Design and FPGA implementation of image block encoders with 2D-DWT , 2003, TENCON 2003. Conference on Convergent Technologies for Asia-Pacific Region.

[19]  G. Lakshminarayanan,et al.  Design and FPGA Implementation of Self Tuned Wavepipelined Filters , 2006 .