Highly Parallel and Fully Reused H.264/AVC High Profile Intra Predictor Generation Engine for Super Hi-Vision 4k×4k@60 fps

One high profile intra predictor generation engine is proposed in this paper. Firstly, hardware level algorithm optimization for intra 8 × 8 (I8MB) mode is introduced. The original candidate pixels for generating prediction samples of I8MB are replaced with boundary pixels of intra 4 × 4 (I4MB) blocks. Based on this adoption, full data reuse between predictors of I4MB and filtered samples of I8MB can be achieved with almost no quality loss. Secondly, one lossless two-4 × 4-block based parallel predictor generation flow is proposed. The original predictor generation flow is optimized from 16 stages to 10 stages for I4MB and Intra 16 × 16 (I16MB), which saves 37.5% processing cycles. For I8MB, similar methodology with different processing order of 4 × 4 scaled blocks is introduced. Thirdly, fully utilized hardwired engines for I4MB, I16MB and I8MB are proposed in this paper. Except DC (direct current) and plane modes, full data reuse among all intra modes of high profile can be achieved. Fourthly, for DC mode, one combined predictor generation process is introduced and predictor generation of I16MB's DC mode is merged into the process of I4MB's DC mode. Moreover, by configuring proposed hardwired engines, predictor generation of I16MB's plane mode and chrominance plane mode can be accomplished with only 50% cycles of original design. Totally, when compared with original full-mode design and latest dynamic mode reused design, the proposed predictor generation engine can achieve 89.5% and 73.2% saving of processing cycles, respectively. Synthesized by TSMC 0.18µm technology under worst work conditions (1.62V, 125°C), with 380MHz and 37.2k gates, the proposed design can handle real-time high profile intra predictor generation of Super Hi-Vision 4k × 4k@60fps. The maximum work frequency of our design under worst condition is 468MHz.

[1]  Jiun-In Guo,et al.  A Dynamic Quality-Adjustable H.264 Video Encoder for Power-Aware Video Applications , 2009, IEEE Transactions on Circuits and Systems for Video Technology.

[2]  Yang Song,et al.  HDTV1080p H.264/AVC Encoder Chip Design and Performance Analysis , 2009, IEEE Journal of Solid-State Circuits.

[3]  Ajay Luthra,et al.  Overview of the H.264/AVC video coding standard , 2003, IEEE Trans. Circuits Syst. Video Technol..

[4]  Susanto Rahardja,et al.  Fast mode decision algorithm for intraprediction in H.264/AVC video coding , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[5]  Liang-Gee Chen,et al.  Analysis and complexity reduction of multiple reference frames motion estimation in H.264/AVC , 2006, IEEE Transactions on Circuits and Systems for Video Technology.

[6]  Guifen Tian,et al.  A Fast Hybrid Decision Algorithm for H.264/AVC Intra Prediction Based on Entropy Theory , 2009, MMM.

[7]  T. Berger Rate-Distortion Theory , 2003 .

[8]  Liang-Gee Chen,et al.  Analysis, fast algorithm, and VLSI architecture design for H.264/AVC intra frame coder , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[9]  Oscar C. Au,et al.  Highly Parallel Rate-Distortion Optimized Intra-Mode Decision on Multicore Graphics Processors , 2009, IEEE Transactions on Circuits and Systems for Video Technology.

[10]  Jhing-Fa Wang,et al.  A Fast Mode Decision Algorithm and Its VLSI Design for H.264/AVC Intra-Prediction , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[11]  Tian-Sheuan Chang,et al.  A 242mW, 10mm2 1080p H.264/AVC high profile encoder chip , 2008, 2008 45th ACM/IEEE Design Automation Conference.

[12]  Takeshi Ikenaga,et al.  Macroblock Feature Based Complexity Reduction for H.264/AVC Motion Estimation , 2008, IEICE Trans. Fundam. Electron. Commun. Comput. Sci..

[13]  Tian-Sheuan Chang,et al.  Fast block type decision algorithm for intra prediction in H.264 FRext , 2005, IEEE International Conference on Image Processing 2005.

[14]  Satoshi Goto,et al.  Edge Block Detection and Motion Vector Information Based Fast VBSME Algorithm , 2008, IEICE Trans. Fundam. Electron. Commun. Comput. Sci..