Complexity-distortion tradeoffs in image and video compression

In this thesis we investigate variable complexity algorithms. The complexities of these algorithms are input-dependent, i.e., the type of input determines the complexity required to complete the operation. The key idea is to enable the algorithm to classify the inputs so that unnecessary operations can be pruned. The goal of the design of the variable complexity algorithm is to minimize the average complexity over all possible input types, including the cost of classifying the inputs. We study two of the fundamental operations in standard image/video compression, namely, the discrete cosine transform (DCT) and motion estimation (ME). We first explore variable complexity in inverse DCT by testing for zero inputs. The test structure can also be optimized for minimal total complexity for a given inputs statistics. In this case, the larger the number of zero coefficients, i.e., the coarser the quantization stepsize, the greater the complexity reduction. As a consequence, tradeoffs between complexity and distortion can be achieved. For direct DCT we propose a variable complexity fast approximation algorithm. The variable complexity part computes only DCT coefficients that will not be quantized to zeros according to the classification results (in addition the quantizer can benefit from this information by by-passing its operations for zero coefficients). The classification structure can also be optimized for a given input statistics. On the other hand, the fast approximation part approximates the DCT coefficients with much less complexity. The complexity can be scaled, i.e., it allows more complexity reduction at lower quality coding, and can be made quantization-dependent to keep the distortion degradation at a certain level. In video coding, ME is the part of the encoder that requires the most complexity and therefore achieving significant complexity reduction in ME has always been a goal in video coding research. We propose two fast algorithms based on fast distance metric computation or fast matching approaches. Both of our algorithms allow computational scalability in distance computation with graceful degradation in the overall image quality. The first algorithm exploits hypothesis testing in fast metric computation whereas the second algorithm uses thresholds obtained from partial distances in hierarchical candidate elimination. (Abstract shortened by UMI.)

[1]  Liang-Gee Chen,et al.  A new block-matching criterion for motion estimation and its implementation , 1995, IEEE Trans. Circuits Syst. Video Technol..

[2]  Chok-Kwan Cheung,et al.  Adaptive motion tracking block matching algorithms for video coding , 1999, IEEE Trans. Circuits Syst. Video Technol..

[3]  Allen Gersho,et al.  Vector quantization and signal compression , 1991, The Kluwer international series in engineering and computer science.

[4]  G. G. Stokes "J." , 1890, The New Yale Book of Quotations.

[5]  Kevin Barraclough,et al.  I and i , 2001, BMJ : British Medical Journal.

[6]  Aaas News,et al.  Book Reviews , 1893, Buffalo Medical and Surgical Journal.

[7]  C. Sidney Burrus,et al.  Efficient computation of the DFT with only a subset of input or output points , 1993, IEEE Trans. Signal Process..

[8]  Chi-Ying Tsui,et al.  Low power motion estimation design using adaptive pixel truncation , 1997, Proceedings of 1997 International Symposium on Low Power Electronics and Design.

[9]  Bede Liu,et al.  New fast algorithms for the estimation of block motion vectors , 1993, IEEE Trans. Circuits Syst. Video Technol..

[10]  F. Kossentini,et al.  Computation-constrained fast MPEG-2 encoding , 1997, IEEE Signal Processing Letters.

[11]  Eugene Cheung Optimal Bit Allocation Strategy for Joint Source/Channel Coding of Scalable Video , 2000 .

[12]  Konstantinos Konstantinides,et al.  Low-complexity block-based motion estimation via one-bit transforms , 1997, IEEE Trans. Circuits Syst. Video Technol..

[13]  K. J. Ray Liu,et al.  VLSI design of high-speed time-recursive 2-D DCT/IDCT processor for video applications , 1996, IEEE Trans. Circuits Syst. Video Technol..

[14]  Shen-Chuan Tai,et al.  Fast full-search block-matching algorithm for motion-compensated video compression , 1996, Proceedings of 13th International Conference on Pattern Recognition.

[15]  Ephraim Feig,et al.  New scaled DCT algorithms for fused multiply/add architectures , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[16]  Vivek K. Goyal,et al.  Computation-distortion characteristics of block transform coding , 1997, 1997 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[17]  Joan L. Mitchell,et al.  JPEG: Still Image Data Compression Standard , 1992 .

[18]  Lurng-Kuo Liu,et al.  A block-based gradient descent search algorithm for block motion estimation in video coding , 1996, IEEE Trans. Circuits Syst. Video Technol..

[19]  Jo Yew Tham,et al.  A novel unrestricted center-biased diamond search algorithm for block motion estimation , 1998, IEEE Trans. Circuits Syst. Video Technol..

[20]  Antonio Ortega,et al.  DCT computation with minimal average number of operations , 1997, Electronic Imaging.

[21]  Anil K. Jain Fundamentals of Digital Image Processing , 2018, Control of Color Imaging Systems.

[22]  Zhongde Wang Fast algorithms for the discrete W transform and for the discrete Fourier transform , 1984 .

[23]  Ya-Qin Zhang,et al.  A fast hierarchical motion-compensation scheme for video coding using block feature matching , 1996, IEEE Trans. Circuits Syst. Video Technol..

[24]  Michael Joseph Gormish,et al.  Source coding with channel, distortion, and complexity constraints , 1994 .

[25]  Antonio Ortega,et al.  Bit-rate control using piecewise approximated rate-distortion characteristics , 1998, IEEE Trans. Circuits Syst. Video Technol..

[26]  Kenneth Y. Yun,et al.  A low-power VLSI architecture for full-search block-matching motion estimation , 1998, IEEE Trans. Circuits Syst. Video Technol..

[27]  S. Hamid Nawab,et al.  Incremental refinement of DFT and STFT approximations , 1995, IEEE Signal Processing Letters.

[28]  Anil K. Jain,et al.  Displacement Measurement and Its Application in Interframe Image Coding , 1981, IEEE Trans. Commun..

[29]  Peter Pirsch,et al.  VLSI architectures for video compression-a survey , 1995, Proc. IEEE.

[30]  C.-C. Jay Kuo,et al.  A multi-threshold wavelet coder (MTWC) for high fidelity image compression , 1997, Proceedings of International Conference on Image Processing.

[31]  Gary J. Sullivan,et al.  Efficient quadtree coding of images and video , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[32]  Kannan Ramchandran,et al.  Joint thresholding and quantizer selection for transform image coding: entropy-constrained analysis and applications to baseline JPEG , 1997, IEEE Trans. Image Process..

[33]  John B. Anderson,et al.  Instrumentable tree encoding of information sources (Corresp.) , 1971, IEEE Trans. Inf. Theory.

[34]  C.-T. Chen,et al.  Adaptive transform coding via quadtree-based variable blocksize DCT , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[35]  Antonio Ortega,et al.  Probabilistic partial-distance fast matching algorithms for motion estimation , 2001, IEEE Trans. Circuits Syst. Video Technol..

[36]  K. Rijkse,et al.  H.263: video coding for low-bit-rate communication , 1996, IEEE Commun. Mag..

[37]  Bing Zeng,et al.  A new three-step search algorithm for block motion estimation , 1994, IEEE Trans. Circuits Syst. Video Technol..

[38]  Antonio Ortega,et al.  Complexity-distortion tradeoffs in vector matching based on probabilistic partial distance techniques , 1999, Proceedings DCC'99 Data Compression Conference (Cat. No. PR00096).

[39]  Ming-Ting Sun,et al.  Modeling DCT coefficients for fast video encoding , 1999, IEEE Trans. Circuits Syst. Video Technol..

[40]  Martin Vetterli,et al.  Interpolative multiresolution coding of advance television with compatible subchannels , 1991, IEEE Trans. Circuits Syst. Video Technol..

[41]  Joan L. Mitchell,et al.  MPEG Video Compression Standard , 1996, Springer US.

[42]  W.K. Cham,et al.  On compatibility of order-8 integer cosine transforms and the discrete cosine transform , 1990, IEEE TENCON'90: 1990 IEEE Region 10 Conference on Computer and Communication Systems. Conference Proceedings.

[43]  Jerome M. Shapiro,et al.  Embedded image coding using zerotrees of wavelet coefficients , 1993, IEEE Trans. Signal Process..

[44]  Ronald W. Schafer,et al.  Low complexity matching criteria for image/video applications , 1994, Proceedings of 1st International Conference on Image Processing.

[45]  Y. Arai,et al.  A Fast DCT-SQ Scheme for Images , 1988 .

[46]  Sohail Zafar,et al.  Motion-compensated wavelet transform coding for color video compression , 1992, IEEE Trans. Circuits Syst. Video Technol..

[47]  C.-C. Jay Kuo,et al.  Fast motion vector estimation using multiresolution-spatio-temporal correlations , 1997, IEEE Trans. Circuits Syst. Video Technol..

[48]  Martin Vetterli,et al.  A Discrete Fourier-Cosine Transform Chip , 1986, IEEE J. Sel. Areas Commun..

[49]  Avideh Zakhor,et al.  Multirate 3-D subband coding of video , 1994, IEEE Trans. Image Process..

[50]  R. Srinivasan,et al.  Predictive Coding Based on Efficient Motion Estimation , 1985, IEEE Trans. Commun..

[51]  Konrad Froitzheim,et al.  Knowledge-based approach to JPEG acceleration , 1995, Electronic Imaging.

[52]  Zhihong Man,et al.  Comments on "Fast algorithms and implementation of 2-D discrete cosine transform" , 1998, IEEE Trans. Circuits Syst. Video Technol..

[53]  B. Lee A new algorithm to compute the discrete cosine Transform , 1984 .

[54]  Chi-Wah Kok,et al.  Fast algorithm for computing discrete cosine transform , 1997, IEEE Trans. Signal Process..

[55]  Robert M. Gray,et al.  An Improvement of the Minimum Distortion Encoding Algorithm for Vector Quantization , 1985, IEEE Trans. Commun..

[56]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[57]  Yu Hen Hu,et al.  A novel modular systolic array architecture for full-search block matching motion estimation , 1995, IEEE Trans. Circuits Syst. Video Technol..

[58]  Neri Merhav,et al.  Inverse motion compensation of fast DCT , 1997 .

[59]  Jin Li,et al.  An embedded still image coder with rate-distortion optimization , 1999, IEEE Trans. Image Process..

[60]  Yair Shoham,et al.  Efficient bit allocation for an arbitrary set of quantizers [speech coding] , 1988, IEEE Trans. Acoust. Speech Signal Process..

[61]  M. J. Narasimha,et al.  On the Computation of the Discrete Cosine Transform , 1978, IEEE Trans. Commun..

[62]  Boon-Lock Yeo,et al.  Fast extraction of spatially reduced image sequences from MPEG-2 compressed video , 1999, IEEE Trans. Circuits Syst. Video Technol..

[63]  Fang-Hsuan Cheng,et al.  New fast and efficient two-step search algorithm for block motion estimation , 1999, IEEE Trans. Circuits Syst. Video Technol..