Discrete cosine and sine transforms - regular algorithms and pipeline architectures

In this paper, regular fast algorithms for discrete cosine transform (DCT) and discrete sine transform (DST) of types II-IV are proposed and mapped onto pipeline architectures. The algorithms are based on the factorization of transform matrices described earlier by Wang. The regular structures of the algorithms are advantageous when mapping them onto hardware although such algorithms do not reach the theoretical lower bound on multiplicative complexity. Instead, the algorithms lend themselves for vertical mapping resulting in area-efficient pipeline structures. A unified pipeline architecture supporting both the DCT-II and its inverse is implemented with data path synthesis for proving the feasibility and estimating the performance. The latency of an ASIC implementation is 94 cycles while operating at 250 MHz frequency.

[1]  Marc Davio,et al.  Kronecker products and shuffle algebra , 1981, IEEE Transactions on Computers.

[2]  Paul H. Siegel,et al.  Area-efficient architectures for the Viterbi algorithm. I. Theory , 1993, IEEE Trans. Commun..

[3]  Hyesook Lim,et al.  A Serial-Parallel Architecture for Two-Dimensional Discrete Cosine and Inverse Discrete Cosine Transforms , 2000, IEEE Trans. Computers.

[4]  Jaakko Astola,et al.  Architecture-oriented regular algorithms for discrete sine and cosine transforms , 1999, IEEE Trans. Signal Process..

[5]  Jari Nikara,et al.  Pipeline architecture for two-dimensional discrete cosine transform and its inverse , 2002, 9th International Conference on Electronics, Circuits and Systems.

[6]  G.S. Moschytz,et al.  Practical fast 1-D DCT algorithms with 11 multiplications , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[7]  Liang-Gee Chen,et al.  A cost-effective architecture for 8×8 two-dimensional DCT/IDCT using direct method , 1997, IEEE Trans. Circuits Syst. Video Technol..

[8]  David Akopian,et al.  Pipeline architecture for DCT/IDCT , 2001, ISCAS 2001. The 2001 IEEE International Symposium on Circuits and Systems (Cat. No.01CH37196).

[9]  Jaehee You,et al.  One- and two-dimensional constant geometry fast cosine transform algorithms and architectures , 1999, IEEE Trans. Signal Process..

[10]  Jarmo Takala,et al.  Stride Permutation Access In Interleaved Memory Systems , 2003 .

[11]  Alan N. Willson,et al.  A 100 MHz 2-D 8×8 DCT/IDCT processor for HDTV applications , 1995, IEEE Trans. Circuits Syst. Video Technol..

[12]  Zhongde Wang Fast discrete sine transform algorithms , 1990 .

[13]  S.-F. Hsiao,et al.  New matrix formulation for two-dimensional DCT/IDCT computation and its distributed-memory VLSI implementation , 2002 .

[14]  Paul H. Siegel,et al.  Area-efficient architectures for the Viterbi algorithm II. Applications , 1993, IEEE Trans. Commun..

[15]  Zhongde Wang Fast algorithms for the discrete W transform and for the discrete Fourier transform , 1984 .

[16]  Paul H. Siegel,et al.  Area-efficient architectures for the Viterbi algorithm , 1990, [Proceedings] GLOBECOM '90: IEEE Global Telecommunications Conference and Exhibition.

[17]  Michael Conner,et al.  Recursive fast algorithm and the role of the tensor product , 1992, IEEE Trans. Signal Process..

[18]  P. Laguna,et al.  Signal Processing , 2002, Yearbook of Medical Informatics.

[19]  David Akopian,et al.  Constant geometry algorithm for discrete cosine transform , 2000, IEEE Trans. Signal Process..

[20]  Zhongde Wang,et al.  Pruning the fast discrete cosine transform , 1991, IEEE Trans. Commun..

[21]  Markku Renfors,et al.  Implementation of parallel cosine and sine modulated filter banks for equalized transmultiplexer systems , 2001, 2001 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.01CH37221).

[22]  PeiZong Lee,et al.  Restructured recursive DCT and DST algorithms , 1994, IEEE Trans. Signal Process..

[23]  Jaakko Astola,et al.  Pipeline architecture for 8/spl times/8 discrete cosine transform , 2000, 2000 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings (Cat. No.00CH37100).

[24]  Yasushi Ooi,et al.  A Block Processing Unit in a Single-Chip MPEG-2 Video Encoder LSI , 1999, J. VLSI Signal Process..