An efficient CORDIC array structure for the implementation of discrete cosine transform

We propose a novel implementation of the discrete cosine transform (DCT) and the inverse DCT (IDCT) algorithms using a CORDIC (coordinate rotation digital computer)-based systolic processor array structure. First, we reformulate an N-point DCT or IDCT algorithm into a rotation formulation which makes it suitable for CORDIC processor implementation. We then propose to use a pipelined CORDIC processor as the basic building block to construct l-D and 2-D systolic-type processor arrays to speed up the DCT and IDCT computation. Due to the proposed novel rotation formulation, we achieve 100% processor utilization in both 1-D and 2-D configurations. Furthermore, we show that for the 2-D configurations, the same data processing throughput rate ran be maintained as long as the processor array dimensions are increased linearly with N. Neither the algorithm formulation or the array configuration need to be modified. Hence, the proposed parallel architecture is scalable to the problem size. These desirable features make this novel implementation compare favorably to previously proposed DCT implementations. >

[1]  Ephraim Feig,et al.  Fast algorithms for the discrete cosine transform , 1992, IEEE Trans. Signal Process..

[2]  Y.H. Hu,et al.  CORDIC-based VLSI architectures for digital signal processing , 1992, IEEE Signal Processing Magazine.

[3]  Long-Wen Chang,et al.  Systolic arrays for the discrete Hartley transform , 1991, IEEE Trans. Signal Process..

[4]  Weiping Li,et al.  A new algorithm to compute the DCT and its inverse , 1991, IEEE Trans. Signal Process..

[5]  Ming-Chang Wu,et al.  A unified systolic array for discrete cosine and sine transforms , 1991, IEEE Trans. Signal Process..

[6]  Chaitali Chakrabarti,et al.  Systolic Architectures for the Computation of the Discrete Hartley and the Discrete Cosine Transforms Based on Prime Factor Decomposition , 1990, IEEE Trans. Computers.

[7]  Moon Ho Lee On computing 2-D systolic algorithm for discrete cosine transform , 1990 .

[8]  Ja-Ling Wu,et al.  Novel concurrent architecture to implement the discrete cosine transform based on index partitions , 1990 .

[9]  P. Yip,et al.  FAST ALGORITHMS FOR DCT-II , 1990 .

[10]  J.W. Mark,et al.  A new look at DCT-type transforms , 1989, IEEE Trans. Acoust. Speech Signal Process..

[11]  Hsieh Hou,et al.  A Fast Recursive Algorithm For Computing The Discrete Cosine Transform , 1986, Optics & Photonics.

[12]  S. Kung,et al.  VLSI Array processors , 1985, IEEE ASSP Magazine.

[13]  B. Lee A new algorithm to compute the discrete cosine Transform , 1984 .

[14]  Ed F. Deprettere,et al.  Pipelined cordic architectures for fast VLSI filtering and array processing , 1984, ICASSP.

[15]  Alvin M. Despain,et al.  Very Fast Fourier Transform Algorithms Hardware for Implementation , 1979, IEEE Transactions on Computers.

[16]  Wen-Hsiung Chen,et al.  A Fast Computational Algorithm for the Discrete Cosine Transform , 1977, IEEE Trans. Commun..

[17]  N. Ahmed,et al.  Discrete Cosine Transform , 1996 .

[18]  Alvin M. Despain,et al.  Fourier Transform Computers Using CORDIC Iterations , 1974, IEEE Transactions on Computers.