Fast RNS-based 2D-DCT computation on field-programmable devices

This paper shows the implementation of an 8/spl times/8 2D-DCT (discrete cosine transform) processor based on the residue number system (RNS). It makes use of a fast cosine transform (FCT) algorithm that requires a single multiplication stage for each signal path, while most other algorithms include paths with more than one multiplication. The row-column decomposition technique is used and each 1D-DCT processor requires only 14 multipliers and 32 adders and subtractors. The proposed RNS-based 2D-DCT processor provides a throughput improvement over the equivalent 2's complement system of up to 147% when 8-bit moduli are used. This is achieved due to the synergy between RNS and modern FPL device families.

[1]  P. Landman,et al.  Compact inverse discrete cosine transform circuit for MPEG video decoding , 1997, 1997 IEEE Workshop on Signal Processing Systems. SiPS 97 Design and Implementation formerly VLSI Signal Processing.

[2]  Ephraim Feig,et al.  Fast algorithms for the discrete cosine transform , 1992, IEEE Trans. Signal Process..

[3]  P. Yip,et al.  Discrete Cosine Transform: Algorithms, Advantages, Applications , 1990 .

[4]  B. Lee A new algorithm to compute the discrete cosine Transform , 1984 .

[5]  Michael A. Soderstrand,et al.  Residue number system arithmetic: modern applications in digital signal processing , 1986 .

[6]  Javier Ramírez,et al.  RNS-FPL merged architectures for orthogonal DWT , 2000 .

[7]  P. C. Jain,et al.  VLSI implementation of two-dimensional DCT processor in real time for video codec , 1992 .

[8]  N. Ahmed,et al.  Discrete Cosine Transform , 1996 .

[9]  Wen-Hsiung Chen,et al.  A Fast Computational Algorithm for the Discrete Cosine Transform , 1977, IEEE Trans. Commun..

[10]  G.S. Moschytz,et al.  Practical fast 1-D DCT algorithms with 11 multiplications , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[11]  A. Lloris,et al.  A new implementation of the discrete cosine transform in the residue number system , 1999, Conference Record of the Thirty-Third Asilomar Conference on Signals, Systems, and Computers (Cat. No.CH37020).

[12]  R.F. Woods,et al.  Novel VLSI implementation of (8×8) point 2-D DCT , 1994 .

[13]  Fred J. Taylor,et al.  Pipelined Hogenauer CIC filters using field-programmable logic and residue number system , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[14]  F. Taylor,et al.  New Scaling Algorithms For The Chinese Remainder Theorem , 1988, Twenty-Second Asilomar Conference on Signals, Systems and Computers.