Power efficient and workload balanced tiling for parallelized high efficiency video coding

The increased workload of the High Efficiency Video Coding (HEVC) and processing of high resolution videos require parallelization of the encoding/decoding process. However, to efficiently utilize the hardware resources and power budgets in a many-core processor, workload balanced parallelization of HEVC encoding is of high importance. Further, minimizing the number of active cores for processing the given HEVC encoding workload is required to decrease the power consumption. In order to address the above challenges, this work presents a HEVC parallelization technique to adaptively determine the Tile partitioning while accounting for the compute capabilities of the underlying processing cores. Afterwards, it determines a mapping of Tiled-HEVC processing on different cores such that the number of compute cores is minimized, and hence reducing the power consumption. Experimental results demonstrate that in addition to reducing the total compute cores, our technique provides up to 14.4% power savings compared to state-of-the-art uniform Tile partitioning approach.

[1]  Colin Doutre,et al.  HEVC: The New Gold Standard for Video Compression: How Does HEVC Compare with H.264/AVC? , 2012, IEEE Consumer Electronics Magazine.

[2]  Guifen Tian,et al.  Content adaptive prediction unit size decision algorithm for HEVC intra coding , 2012, 2012 Picture Coding Symposium.

[3]  Gary J. Sullivan,et al.  Overview of the High Efficiency Video Coding (HEVC) Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[4]  G. Bjontegaard,et al.  Calculation of Average PSNR Differences between RD-curves , 2001 .

[5]  Jung Ho Ahn,et al.  McPAT: An integrated power, area, and timing modeling framework for multicore and manycore architectures , 2009, 2009 42nd Annual IEEE/ACM International Symposium on Microarchitecture (MICRO).

[6]  Ben H. H. Juurlink,et al.  Parallel Scalability and Efficiency of HEVC Parallelization Approaches , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[7]  Muhammad Usman Karim Khan,et al.  Software architecture of High Efficiency Video Coding for many-core systems with power-efficient workload balancing , 2014, 2014 Design, Automation & Test in Europe Conference & Exhibition (DATE).

[8]  You Zhou,et al.  A dynamic slice control scheme for slice-parallel video encoding , 2012, 2012 19th IEEE International Conference on Image Processing.

[9]  Byeungwoo Jeon,et al.  Adaptive slice-level parallelism for H.264/AVC encoding using pre macroblock mode selection , 2008, J. Vis. Commun. Image Represent..

[10]  Lieven Eeckhout,et al.  Sniper: Exploring the level of abstraction for scalable and accurate parallel multi-core simulation , 2011, 2011 International Conference for High Performance Computing, Networking, Storage and Analysis (SC).

[11]  D. Marpe,et al.  Video coding with H.264/AVC: tools, performance, and complexity , 2004, IEEE Circuits and Systems Magazine.

[12]  Detlev Marpe,et al.  Performance comparison of H.265/MPEG-HEVC, VP9, and H.264/MPEG-AVC encoders , 2013, 2013 Picture Coding Symposium (PCS).

[13]  Muhammad Shafique,et al.  enBudget: A Run-Time Adaptive Predictive Energy-Budgeting scheme for energy-aware Motion Estimation in H.264/MPEG-4 AVC video encoder , 2010, 2010 Design, Automation & Test in Europe Conference & Exhibition (DATE 2010).

[14]  Muhammad Usman Karim Khan,et al.  Hardware-software collaborative complexity reduction scheme for the emerging HEVC intra encoder , 2013, 2013 Design, Automation & Test in Europe Conference & Exhibition (DATE).

[15]  Muhammad Shafique,et al.  An HVS-based Adaptive Computational Complexity Reduction Scheme for H.264/AVC video encoder using Prognostic Early Mode Exclusion , 2010, 2010 Design, Automation & Test in Europe Conference & Exhibition (DATE 2010).

[16]  J-F Franche,et al.  A multi-frame and multi-slice H.264 parallel video encoding approach with simultaneous encoding of prediction frames , 2012, 2012 2nd International Conference on Consumer Electronics, Communications and Networks (CECNet).

[17]  Muhammad Shafique,et al.  Low power design of the next-generation High Efficiency Video Coding , 2014, 2014 19th Asia and South Pacific Design Automation Conference (ASP-DAC).

[18]  Ben H. H. Juurlink,et al.  Improving the parallelization efficiency of HEVC decoding , 2012, 2012 19th IEEE International Conference on Image Processing.

[19]  Sergio Bampi,et al.  Adaptive content-based Tile partitioning algorithm for the HEVC standard , 2013, 2013 Picture Coding Symposium (PCS).