Algorithmic and architectural co-design of a motion-estimation engine for low-power video devices

Due to the large amount of data transfers it involves, the motion estimation (ME) engine is one of the most power-consuming components of any predictive video codec. As a consequence, power-optimized video coding primarily relies on a carefully designed motion estimator. This paper first presents a block ME algorithm that meets high-quality inter-frame prediction and low computational complexity requirements. It relies on a set of rules common to all recent fast and adaptive ME algorithms, but is designed in order to allow for easy and prolific data reuse. The adjacent order of the candidate positions during the search increases the locality and maintains a near-regular data flow, which results in a decrease of the data transfers and a low control complexity. Together with the computational complexity reduction, it enables cost-efficient very large scale integration realizations. A pipelined parallel architecture is then proposed and discussed. It is generic in the sense that it is suited both to the full-pel and half-pel ME. It is efficient because it allows for close to 100% hardware utilization and a sharp decrease of the peak memory bandwidth. It is suited to low-power implementation, as it enables larger data reuse factors for the most probable stages of the adaptive algorithm, which reduces the average memory bandwidth and power consumption.

[1]  Liang-Gee Chen,et al.  An efficient and simple VLSI tree architecture for motion estimation algorithms , 1993, IEEE Trans. Signal Process..

[2]  Bing Zeng,et al.  Optimization of fast block motion estimation algorithms , 1997, IEEE Trans. Circuits Syst. Video Technol..

[3]  Jhing-Fa Wang,et al.  Dynamic search-window adjustment and interlaced search for block-matching algorithm , 1993, IEEE Trans. Circuits Syst. Video Technol..

[4]  Francky Catthoor,et al.  Power and Speed-Efficient Code Transformation of Video Compression Algorithms for RISC Processors , 2001, J. VLSI Signal Process..

[5]  Christophe De Vleeschouwer,et al.  Motion estimation for low power video devices , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[6]  Peter A. Ruetz,et al.  A high-performance full-motion video compression chip set , 1992, IEEE Trans. Circuits Syst. Video Technol..

[7]  Oscar C. Au,et al.  A high performance algorithm for fast block based motion estimation , 1999 .

[8]  Ezzatollah Salari,et al.  Successive elimination algorithm for motion estimation , 1995, IEEE Trans. Image Process..

[9]  Liang-Gee Chen,et al.  Accuracy improvement and cost reduction of 3-step search block matching algorithm for video coding , 1994, IEEE Trans. Circuits Syst. Video Technol..

[10]  Kenneth Y. Yun,et al.  A low-power VLSI architecture for full-search block-matching motion estimation , 1998, IEEE Trans. Circuits Syst. Video Technol..

[11]  Jo Yew Tham,et al.  A novel unrestricted center-biased diamond search algorithm for block motion estimation , 1998, IEEE Trans. Circuits Syst. Video Technol..

[12]  R. Srinivasan,et al.  Predictive Coding Based on Efficient Motion Estimation , 1985, IEEE Trans. Commun..

[13]  Hugo De Man,et al.  Low Power Memory Storage and Transfer Organization for the MPEG-4 Full Pel Motion Estimation on a Multimedia Processor , 1999, IEEE Trans. Multim..

[14]  Zhongli He,et al.  A high performance fast search algorithm for block matching motion estimation , 1997, IEEE Trans. Circuits Syst. Video Technol..

[15]  Sujit Dey,et al.  Power management in high-level synthesis , 1999, IEEE Trans. Very Large Scale Integr. Syst..

[16]  Ming Lei Liou,et al.  Design of fast motion estimation algorithm based on hardware consideration , 1997, IEEE Trans. Circuits Syst. Video Technol..

[17]  Oscar C. Au,et al.  Fast motion estimation using modified circular zonal search , 1999, ISCAS'99. Proceedings of the 1999 IEEE International Symposium on Circuits and Systems VLSI (Cat. No.99CH36349).

[18]  Gauthier Lafruit,et al.  Optimal memory organization for scalable texture codecs in MPEG-4 , 1999, IEEE Trans. Circuits Syst. Video Technol..

[19]  Lurng-Kuo Liu,et al.  A block-based gradient descent search algorithm for block motion estimation in video coding , 1996, IEEE Trans. Circuits Syst. Video Technol..

[20]  Liang-Gee Chen,et al.  One-dimensional full search motion estimation algorithm for video coding , 1994, IEEE Trans. Circuits Syst. Video Technol..

[21]  Yu Hen Hu,et al.  A modular high-throughput architecture for logarithmic search block-matching motion estimation , 1998, IEEE Trans. Circuits Syst. Video Technol..

[22]  M. GHANBARI,et al.  The cross-search algorithm for motion estimation [image coding] , 1990, IEEE Trans. Commun..

[23]  Ming-Ting Sun,et al.  Modeling DCT coefficients for fast video encoding , 1999, IEEE Trans. Circuits Syst. Video Technol..

[24]  Ming Lei Liou,et al.  Genetic motion search algorithm for video compression , 1993, IEEE Trans. Circuits Syst. Video Technol..

[25]  Oscar C. Au,et al.  Predictive motion vector field adaptive search technique (PMVFAST): enhancing block-based motion estimation , 2000, IS&T/SPIE Electronic Imaging.

[26]  Alan N. Willson,et al.  A high accuracy predictive logarithmic motion estimation algorithm for video coding , 1995, Proceedings of ISCAS'95 - International Symposium on Circuits and Systems.

[27]  Young Serk Shim,et al.  A fast hierarchical motion vector estimation algorithm using mean pyramid , 1995, IEEE Trans. Circuits Syst. Video Technol..

[28]  Liang-Gee Chen,et al.  Parallel architectures for 3-step hierarchical search block-matching algorithm , 1994, IEEE Trans. Circuits Syst. Video Technol..

[29]  Fang-Hsuan Cheng,et al.  New fast and efficient two-step search algorithm for block motion estimation , 1999, IEEE Trans. Circuits Syst. Video Technol..

[30]  Chaitali Chakrabarti,et al.  Architectures for hierarchical and other block matching algorithms , 1995, IEEE Trans. Circuits Syst. Video Technol..

[31]  Walter Stechele,et al.  Complexity analysis of the emerging MPEG-4 standard as a basis for VLSI implementation , 1998, Electronic Imaging.

[32]  Ken D. Sauer,et al.  Efficient block motion estimation using integral projections , 1996, IEEE Trans. Circuits Syst. Video Technol..

[33]  Alexis M. Tourapis,et al.  Fast motion estimation using circular zonal search , 1998, Electronic Imaging.

[34]  Teresa H. Y. Meng,et al.  Design of a low power video decompression chip set for portable applications , 1996, J. VLSI Signal Process..

[35]  Ja-Ling Wu,et al.  A lightweight genetic block-matching algorithm for video coding , 1998, IEEE Trans. Circuits Syst. Video Technol..

[36]  Lai-Man Po,et al.  A novel four-step search algorithm for fast block motion estimation , 1996, IEEE Trans. Circuits Syst. Video Technol..

[37]  Peter M. Kuhn Fast MPEG-4 Motion Estimation: Processor Based and Flexible VLSI Implementations , 1999, J. VLSI Signal Process..

[38]  Liang-Gee Chen,et al.  A new block-matching criterion for motion estimation and its implementation , 1995, IEEE Trans. Circuits Syst. Video Technol..

[39]  Michael Stegherr,et al.  Parameterizable VLSI architectures for the full-search block-matching algorithm , 1989 .

[40]  Anil K. Jain,et al.  Displacement Measurement and Its Application in Interframe Image Coding , 1981, IEEE Trans. Commun..

[41]  Francky Catthoor,et al.  Low-power data transfer and storage exploration for H.263 video decoder system , 1998, IEEE J. Sel. Areas Commun..

[42]  Bing Zeng,et al.  A new three-step search algorithm for block motion estimation , 1994, IEEE Trans. Circuits Syst. Video Technol..

[43]  Oscar C. Au,et al.  An advanced zonal block based algorithm for motion estimation , 1999, Proceedings 1999 International Conference on Image Processing (Cat. 99CH36348).

[44]  Peter Pirsch,et al.  Array architectures for block matching algorithms , 1989 .

[45]  Kai-Kuang Ma,et al.  A new diamond search algorithm for fast block-matching motion estimation , 2000, IEEE Trans. Image Process..

[46]  Konstantinos Konstantinides,et al.  Low-complexity block-based motion estimation via one-bit transforms , 1997, IEEE Trans. Circuits Syst. Video Technol..

[47]  Jianhua Lu,et al.  A simple and efficient search algorithm for block-matching motion estimation , 1997, IEEE Trans. Circuits Syst. Video Technol..

[48]  C.-C. Jay Kuo,et al.  Fast motion vector estimation using multiresolution-spatio-temporal correlations , 1997, IEEE Trans. Circuits Syst. Video Technol..

[49]  Oscar C. Au,et al.  An adaptive center (radar) zonal based algorithm for motion estimation , 1999, ICECS'99. Proceedings of ICECS '99. 6th IEEE International Conference on Electronics, Circuits and Systems (Cat. No.99EX357).

[50]  Teresa H. Meng,et al.  Portable video-on-demand in wireless communication , 1995, Proc. IEEE.

[51]  Bede Liu,et al.  New fast algorithms for the estimation of block motion vectors , 1993, IEEE Trans. Circuits Syst. Video Technol..

[52]  Kristof Denolf,et al.  Cost-Efficient C-Level Design of an MPEG-4 Video Decoder , 2000, PATMOS.

[53]  T Koga,et al.  MOTION COMPENSATED INTER-FRAME CODING FOR VIDEO CONFERENCING , 1981 .

[54]  Liang-Gee Chen,et al.  A predictive parallel motion estimation algorithm for digital image processing , 1991, [1991 Proceedings] IEEE International Conference on Computer Design: VLSI in Computers and Processors.