The Motion Estimator Implementation with Efficient Structure for Full Search Algorithm of Variable Block Size

The motion estimation in video encoding system occupies the biggest part. So, we require the motion estimator with efficient structure for real-time operation. And for motion estimator's implementation, it is desired to design hardware module of an exclusive use that perform the encoding process at high speed. This paper proposes motion estimation detection block(MED), 41 SADs(Sum of Absolute Difference) calculation block, minimum SAD calculation and motion vector generation block based on parallel processing. The parallel processing can reduce effectively the amount of the operation. The minimum SAD calculation and MED block uses the pre-computation technique for reducing switching activity of the input signal. It results in high-speed operation. The MED and 41 SADs calculation blocks are composed of adder tree which causes the problem of critical path. So, the structure of adder tree has changed the most commonly used ripple carry adder(RCA) with carry skip adder(CSA). It enables adder tree to operate at high speed. In addition, as we enabled to easily control key variables such as control signal of search range from the outside, the efficiency of hardware structure increased. Simulation and FPGA verification results show that the delay of MED block generating the critical path at the motion estimator is reduced about 19.89% than the conventional strukcture.

[1]  Alaaeldin Amin High-speed self-timed carry-skip adder , 2006 .

[2]  Young Hwan Kim,et al.  Hardware implementation of motion estimation using a sub-sampled block for frame rate up-conversion , 2008, 2008 International SoC Design Conference.

[3]  Liang-Gee Chen,et al.  Analysis and architecture design of variable block-size motion estimation for H.264/AVC , 2006, IEEE Transactions on Circuits and Systems I: Regular Papers.

[4]  Lai-Man Po,et al.  A novel cross-diamond search algorithm for fast block motion estimation , 2002, IEEE Trans. Circuits Syst. Video Technol..

[5]  Amlan Chakrabarti,et al.  Parallel Hardware Design for Motion Estimation , 2009 .

[6]  Liang-Gee Chen,et al.  Low-power parallel tree architecture for full search block-matching motion estimation , 2004, 2004 IEEE International Symposium on Circuits and Systems (IEEE Cat. No.04CH37512).

[7]  John V. McCanny,et al.  A VLSI architecture for advanced video coding motion estimation , 2003, Proceedings IEEE International Conference on Application-Specific Systems, Architectures, and Processors. ASAP 2003.

[8]  Wenjun Zhang,et al.  An improved block size selection method based on macroblock movement characteristic , 2009, Multimedia Tools and Applications.

[9]  Chein-Wei Jen,et al.  On the data reuse and memory bandwidth analysis for full-search block-matching VLSI architecture , 2002, IEEE Trans. Circuits Syst. Video Technol..

[10]  Zhi Zhou,et al.  Fast variable block-size motion estimation algorithm based on merge and slit procedures for H.264/MPEG-4 AVC , 2004, 2004 IEEE International Symposium on Circuits and Systems (IEEE Cat. No.04CH37512).

[11]  R. Srinivasan,et al.  Predictive Coding Based on Efficient Motion Estimation , 1985, IEEE Trans. Commun..

[12]  Shyang Chang,et al.  Zero waiting-cycle hierarchical block matching algorithm and its array architectures , 1994, IEEE Trans. Circuits Syst. Video Technol..

[13]  Hubert Kaeslin,et al.  Cell-Based Multilevel Carry-Increment Adders with Minimal AT- and PT-Products , 1996 .

[14]  오세만,et al.  H.264 움직임 추정을 위한 효율적인 SAD 프로세서 , 2007 .