A Novel Search Area Data Reuse Method for Memory Access Optimization of Fast Search Motion Estimation

Motion estimation (ME) is a time-consuming algorithm to find a matching block in the search area for video applications, such as video compression. Motion estimation algorithm includes full search algorithm and fast search algorithm. If the width of the motion estimation algorithm search area is twice the size of the block, half of the search area for adjacent blocks overlaps. In view of this, this paper proposes a search area data reuse method for fast search motion estimation. With the proposed data reuse implementation, only the first block needs to read the data of the entire search area from the off-chip memory. The other blocks only need to read half the original search area. In this way, time of access to external memory is reduced, and running time of the algorithm is also decreased. Experimental results of diamond search show that the search area data reuse methods can reduce the running time by 40% to 60% compared with the algorithm of no use data reuse, and it can also reduce the power consumption by 62% to 73%. Compared with other methods in the literature, the proposed method also performs better on running time and power consumption.

[1]  Tao Liu,et al.  The Improved Full Search Algorithm for Motion Estimation with GPU Accelection , 2016 .

[2]  Antti Heikkinen,et al.  Parallel implementations of motion estimation algorithms using OpenCL , 2013, 2013 18th International Conference on Digital Signal Processing (DSP).

[3]  Hsueh-Ming Hang,et al.  Multiview encoder parallelized fast search realization on NVIDIA CUDA , 2011, 2011 Visual Communications and Image Processing (VCIP).

[4]  Zhiyong Liu,et al.  High-Performance Motion Estimation for Image Sensors with Video Compression , 2015, Sensors.

[5]  Zhiping Jia,et al.  Hybrid scratchpad and cache memory management for energy-efficient parallel HEVC encoding , 2015, 2015 33rd IEEE International Conference on Computer Design (ICCD).

[6]  Chein-Wei Jen,et al.  On the data reuse and memory bandwidth analysis for full-search block-matching VLSI architecture , 2002, IEEE Trans. Circuits Syst. Video Technol..

[7]  Jo Yew Tham,et al.  A novel unrestricted center-biased diamond search algorithm for block motion estimation , 1998, IEEE Trans. Circuits Syst. Video Technol..

[8]  Lu Fang,et al.  Novel RD-Optimized VBSME With Matching Highly Data Re-Usable Hardware Architecture , 2011, IEEE Transactions on Circuits and Systems for Video Technology.

[9]  P. Lakamsani An architecture for enhanced three step search generalized for hierarchical motion estimation algorithms , 1997 .

[10]  Kai-Kuang Ma,et al.  A new diamond search algorithm for fast block-matching motion estimation , 2000, IEEE Trans. Image Process..

[11]  Upena D. Dalal,et al.  Hardware Efficient Double Diamond Search Block Matching Algorithm for Fast Video Motion Estimation , 2016, J. Signal Process. Syst..

[12]  Seoung-Jun Oh,et al.  Fast motion estimation for HEVC with adaptive search range decision on CPU and GPU , 2014, 2014 IEEE China Summit & International Conference on Signal and Information Processing (ChinaSIP).

[13]  Yongfei Zhang,et al.  Motion Classification-Based Fast Motion Estimation for High-Efficiency Video Coding , 2017, IEEE Transactions on Multimedia.

[14]  Dongkyun Park,et al.  A new fast Three Step Search motion estimation algorithm in H.264 , 2007, 2007 International Forum on Strategic Technology.

[15]  Jong-Nam Kim,et al.  Fast full search motion estimation algorithm using early detection of impossible candidate vectors , 2002, IEEE Trans. Signal Process..

[16]  Zhao Zhang,et al.  Thermal Modeling and Management of DRAM Systems , 2013, IEEE Transactions on Computers.

[17]  Hiroshi Sakamoto,et al.  Lazy Management for Frequency Table on Hardware-Based Stream Lossless Data Compression , 2016, Inf..

[18]  Liang-Gee Chen,et al.  Single Reference Frame Multiple Current Macroblocks Scheme for Multiple Reference Frame Motion Estimation in H.264/AVC , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[19]  Jun Luo,et al.  An Unsymmetrical Diamond Search Algorithm for H.264/AVC Motion Estimation , 2013 .

[20]  Angus K. M. Wu,et al.  Four-step genetic search for block motion estimation , 1998, Proceedings of the 1998 IEEE International Conference on Acoustics, Speech and Signal Processing, ICASSP '98 (Cat. No.98CH36181).

[21]  Yun Zhou,et al.  Efficient Software HEVC to AVS2 Transcoding , 2016, Inf..

[22]  Satoshi Goto,et al.  OpenCL based high-quality HEVC motion estimation on GPU , 2014, 2014 IEEE International Conference on Image Processing (ICIP).

[23]  Chong-Min Kyung,et al.  Selective Search Area Reuse Algorithm for Low External Memory Access Motion Estimation , 2009, IEEE Transactions on Circuits and Systems for Video Technology.

[24]  Chong-Min Kyung,et al.  Search Area Selective Reuse Algorithm in Motion Estimation , 2007, 2007 IEEE International Conference on Multimedia and Expo.

[25]  Liang-Gee Chen,et al.  Level C+ data reuse scheme for motion estimation with corresponding coding orders , 2006, IEEE Transactions on Circuits and Systems for Video Technology.

[26]  Saeid Nooshabadi,et al.  GPU accelerated motion and disparity estimations for multiview coding , 2013, 2013 IEEE International Conference on Image Processing.

[27]  Myung Hoon Sunwoo,et al.  MESIP: A Configurable and Data Reusable Motion Estimation Specific Instruction-Set Processor , 2013, IEEE Transactions on Circuits and Systems for Video Technology.

[28]  Dianjie Lu,et al.  A Novel Data Reuse Method to Reduce Demand on Memory Bandwidth and Power Consumption For True Motion Estimation , 2018, IEEE Access.

[29]  Lap-Pui Chau,et al.  Hexagon-based search pattern for fast block motion estimation , 2002, IEEE Trans. Circuits Syst. Video Technol..