Among various transform coding techniques for image compression the Discrete Cosine Transform (DCT) is considered to be the most effective method and has been widely used in the laboratory as well as in the market, place. DCT is computationally intensive. For video application at 14.3 MHz sample rate, a direct implementation of a 16x16 DCT requires a throughput, rate of approximately half a billion multiplications per second. In order to reduce the cost of hardware implementation, a single chip DCT implementation is highly desirable. In this paper, the implementation of a 16x16 DCT chip using a concurrent architecture will be presented. The chip is designed for real-time processing of 14.3 MHz sampled video data. It uses row-column decomposition to implement the two-dimensional transform. Distributed arithmetic combined with hit-serial and hit-parallel structures is used to implement the required vector inner products concurrently. Several schemes are utilized to reduce the size of required memory. The resultant circuit only uses memory, shift registers, and adders. No multipliers are required. It achieves high speed performance with a very regular and efficient integrated circuit realization. The chip accepts 0-bit input and produces 14-bit DCT coefficients. 12 bits are maintained after the first one-dimensional transform. The circuit has been laid out using a 2-μm CMOS technology with a symbolic design tool MULGA. The core contains approximately 73,000 transistors in an area of 7.2 x 7.0
[1]
M. Vetterli,et al.
Simple FFT and DCT algorithms with reduced number of operations
,
1984
.
[2]
Martin Vetterli,et al.
A Discrete Fourier-Cosine Transform Chip
,
1986,
IEEE J. Sel. Areas Commun..
[3]
F. Jutand,et al.
A single chip video rate 16×16 discrete cosine transform
,
1986,
ICASSP '86. IEEE International Conference on Acoustics, Speech, and Signal Processing.
[4]
N. Ahmed,et al.
Discrete Cosine Transform
,
1996
.
[5]
N. H. E. Weste.
Mulga — an interactive symbolic layout system for the design of integrated circuits
,
1981,
The Bell System Technical Journal.