A 16-pixel parallel architecture with block-level/mode-level co-reordering approach for intra prediction in 4k×2k H.264/AVC video encoder

Intra prediction is the most important technology in H.264/AVC intra frame encoder. But there is extremely complicated data dependency and an immense amount of computation in intra prediction process. In order to meet the requirements of real-time coding and avoid hardware waste, this paper presents a parallel and high efficient H.264/AVC intra prediction architecture which targets high-resolution (e.g. 4k×2k) video encoding applications. In this architecture, the optimized intra 4×4 prediction engine can process sixteen pixels in parallel at a slightly higher hardware cost (compared to the previous four-pixel parallel architecture). The intra 16×16 prediction engine works in parallel with intra 4×4 prediction engine. It reuses the adder-tree of Sum of Absolute Transformed Difference (SATD) generator. Moreover, in order to reduce the data-dependency in intra 4×4 reconstruction loop, a block-level and mode-level co-reordering strategy is proposed. Therefore, the performance bottleneck of H.264/AVC intra encoding can be alleviated to a great extent. The proposed architecture supports full-mode intra prediction for H.264/AVC baseline, main and extended profiles. It takes only 163 cycles to complete the intra prediction process of one macroblock (MB). This design is synthesized with a SMIC 0.13μm CMOS cell library. The result shows that it takes 61k gates and can run at 215MHz, supporting real-time encoding of 4k×2k@40fps video sequences.

[1]  Youn-Long Lin,et al.  An H.264/AVC full-mode intra-frame encoder for 1080HD video , 2008, 2008 IEEE International Conference on Multimedia and Expo.

[2]  Luciano Volcan Agostini,et al.  Transforms and quantization design targeting the H.264/AVC intra prediction constraints , 2009, SBCCI.

[3]  Liang-Gee Chen,et al.  Analysis, fast algorithm, and VLSI architecture design for H.264/AVC intra frame coder , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[4]  Gary J. Sullivan,et al.  Rate-constrained coder control and comparison of video coding standards , 2003, IEEE Trans. Circuits Syst. Video Technol..

[5]  Hyuk-Jae Lee,et al.  An Efficient Pipelined Architecture for H.264/AVC Intra Frame Processing , 2007, 2007 IEEE International Symposium on Circuits and Systems.

[6]  Sergio Bampi,et al.  A real time H.264/AVC intra frame prediction hardware architecture for HDTV 1080P video , 2009, 2009 IEEE International Conference on Multimedia and Expo.

[7]  Muhammad Shafique,et al.  A parallel approach for high performance hardware design of intra prediction in H.264/AVC Video Codec , 2009, 2009 Design, Automation & Test in Europe Conference & Exhibition.

[8]  Tian-Sheuan Chang,et al.  A High-Definition H.264/AVC Intra-Frame Codec IP for Digital Video and Still Camera Applications , 2006, IEEE Transactions on Circuits and Systems for Video Technology.

[9]  Ajay Luthra,et al.  Overview of the H.264/AVC video coding standard , 2003, SPIE Optics + Photonics.

[10]  Itu-T and Iso Iec Jtc Advanced video coding for generic audiovisual services , 2010 .

[11]  Ajay Luthra,et al.  Overview of the H.264/AVC video coding standard , 2003, IEEE Trans. Circuits Syst. Video Technol..

[12]  Youn-Long Lin,et al.  A High-Performance VLSI Architecture for Intra Prediction and Mode Decision in H.264/AVC Video Encoding , 2006, APCCAS 2006 - 2006 IEEE Asia Pacific Conference on Circuits and Systems.