The discrete cosine transform (DCT) is one of the major components in most of image and video compression systems. The variable complexity algorithm framework has been applied successfully to achieve complexity savings in the computation of the inverse DCT in decoders. These gains can be achieved due to the highly predictable sparseness of the quantized DCT coefficients in natural image/video data. With the increasing demand for instant video messaging and two-way video transmission over mobile communication systems running on general-purpose embedded processors, the encoding complexity needs to be optimized. In this paper, we focus on complexity reduction techniques for the forward DCT, which is one of the more computationally intensive tasks in the encoder. Unlike the inverse DCT, the forward DCT does not operate on sparse input data, but rather generates sparse output data. Thus, complexity reduction must be obtained using different methods from those used for the inverse DCT. In the literature, two major approaches have been applied to speed up the forward DCT computation, namely, frequency selection, in which only a subset of DCT coefficients is computed, and accuracy selection, in which all the DCT coefficients are computed with reduced accuracy. These two approaches can achieve significant computation savings with minor output quality degradation, as long as the coding parameters are such that the quantization error is larger than the error due to the approximate DCT computation. Thus, in order to be useful, these algorithms have to be combined using an efficient mechanism that can select the "right" level of approximation as a function of the characteristics of the input and the target rate, a selection that is often based on heuristic criteria. In this paper, we consider two previously proposed fast, variable complexity, forward DCT algorithms, one based on frequency selection, the other based on accuracy selection. We provide an explicit analysis of the additional distortion that each scheme introduces as a function of the quantization parameter and the variance of the input block. This analysis then allows us to improve the performance of these algorithms by making it possible to select the best approximation level for each block and a target quantization parameter. We also propose a hybrid algorithm that combines both forms of complexity reduction in order to achieve overall better performance over a broader range of operating rates. We show how our techniques lead to scalable implementations where complexity can be reduced if needed, at the cost of small reductions in video quality. Our hybrid algorithm can speed up the DCT and quantization process by close to a factor of 4 as compared to fixed-complexity forward DCT implementations, with only a slight quality degradation in PSNR.
[1]
Martin Vetterli,et al.
A Discrete Fourier-Cosine Transform Chip
,
1986,
IEEE J. Sel. Areas Commun..
[2]
Antonio Ortega,et al.
Complexity-distortion tradeoffs in image and video compression
,
2000
.
[3]
Ming-Ting Sun,et al.
Modeling DCT coefficients for fast video encoding
,
1999,
IEEE Trans. Circuits Syst. Video Technol..
[4]
Antonio Ortega,et al.
DCT computation based on variable complexity fast approximations
,
1998,
Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).
[5]
Sanjit K. Mitra,et al.
Subband DCT: definition, analysis, and applications
,
1996,
IEEE Trans. Circuits Syst. Video Technol..
[6]
Zhongde Wang.
Fast algorithms for the discrete W transform and for the discrete Fourier transform
,
1984
.
[7]
Zhihong Man,et al.
Comments on "Fast algorithms and implementation of 2-D discrete cosine transform"
,
1998,
IEEE Trans. Circuits Syst. Video Technol..
[8]
Y. Arai,et al.
A Fast DCT-SQ Scheme for Images
,
1988
.
[9]
P. Duhamel,et al.
New 2nDCT algorithms suitable for VLSI implementation
,
1987,
ICASSP '87. IEEE International Conference on Acoustics, Speech, and Signal Processing.
[10]
G.S. Moschytz,et al.
Practical fast 1-D DCT algorithms with 11 multiplications
,
1989,
International Conference on Acoustics, Speech, and Signal Processing,.
[11]
Bernd Girod,et al.
A content-dependent fast DCT for low bit-rate video coding
,
1998,
Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).
[12]
David A. Patterson,et al.
Computer Architecture - A Quantitative Approach, 5th Edition
,
1996
.
[13]
A global decision method for moving picture coding
,
1999
.
[14]
Anil K. Jain.
Fundamentals of Digital Image Processing
,
2018,
Control of Color Imaging Systems.
[15]
Xuelong Zhu,et al.
A global decision method for moving picture coding
,
1999,
IEEE Trans. Consumer Electron..
[16]
Faouzi Kossentini,et al.
The quantized DCT and its application to DCT-based video coding
,
2002,
IEEE Trans. Image Process..
[17]
Michael Joseph Gormish,et al.
Source coding with channel, distortion, and complexity constraints
,
1994
.
[18]
David A. Patterson,et al.
Computer Architecture: A Quantitative Approach
,
1969
.
[19]
Masao Ikekawa,et al.
Fast 2D IDCT implementation with multimedia instructions for a software MPEG2 decoder
,
1998,
Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).
[20]
Wen-Hsiung Chen,et al.
A Fast Computational Algorithm for the Discrete Cosine Transform
,
1977,
IEEE Trans. Commun..
[21]
P. Yip,et al.
Discrete Cosine Transform: Algorithms, Advantages, Applications
,
1990
.