A Highly Parallel and Scalable CABAC Decoder for Next Generation Video Coding

Future video decoders will need to support high resolutions such as Quad Full HD (QFHD, 4096 × 2160) and fast frame rates (e.g., 120 fps). Many of these decoders will also reside in portable devices. Parallel processing can be used to increase the throughput for higher performance (i.e., processing speed), which can be traded-off for lower power with voltage scaling. The next generation standard called High Efficiency Video Coding (HEVC), which is being developed as a successor to H.264/AVC, not only seeks to improve the coding efficiency but also to account for implementation complexity and leverage parallelism to meet future power and performance demands. This paper presents a silicon prototype for a pre-standard algorithm developed for HEVC (“H.265”) called Massively Parallel CABAC (MP-CABAC) that addresses a key video decoder bottleneck. A scalable test chip is implemented in 65-nm and achieves a throughput of 24.11 bins/cycle, which enables it to decode the max H.264/AVC bit-rate (300 Mb/s) with only a 18 MHz clock at 0.7 V, while consuming 12.3 pJ/bin. At 1.0 V, it decodes a peak of 3026 Mbins/s for a bit-rate of 2.3 Gb/s, enough for QFHD at 186 fps. Both architecture and joint algorithm-architecture optimizations used to reduce critical path delay, area cost and memory size are discussed.

[1]  Jiun-In Guo,et al.  High-Throughput H.264/AVC High-Profile CABAC Decoder for HDTV Applications , 2009, IEEE Transactions on Circuits and Systems for Video Technology.

[2]  Satoshi Goto,et al.  A 530Mpixels/s 4096×2160@60fps H.264/AVC high profile video decoder chip , 2010, 2010 Symposium on VLSI Circuits.

[3]  Liang-Gee Chen,et al.  A 59.5mW scalable/multi-view video decoder chip for Quad/3D Full HDTV and video streaming applications , 2010, 2010 IEEE International Solid-State Circuits Conference - (ISSCC).

[4]  Anantha Chandrakasan,et al.  Multicore Processing and Efficient On-Chip Caching for H.264 and Future Video Decoders , 2009, IEEE Transactions on Circuits and Systems for Video Technology.

[5]  Anantha Chandrakasan,et al.  Joint algorithm-architecture optimization of CABAC to increase speed and reduce area cost , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[6]  Detlev Marpe,et al.  A highly efficient multiplication-free binary arithmetic coder and its application in video coding , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[7]  F. Bossen,et al.  Common test conditions and software reference configurations , 2010 .

[8]  Ian H. Witten,et al.  Arithmetic coding for data compression , 1987, CACM.

[9]  Itu-T and Iso Iec Jtc Advanced video coding for generic audiovisual services , 2010 .

[10]  Chen Kong Teh,et al.  A 9.7mW AAC-Decoding, 620mW H.264 720p 60fps Decoding, 8-Core Media Processor with Embedded Forward-Body-Biasing and Power-Gating Circuit in 65nm CMOS Technology , 2008, 2008 IEEE International Solid-State Circuits Conference - Digest of Technical Papers.

[11]  Joan L. Mitchell,et al.  Optimal Hardware and Software Arithmetic Coding Procedures for the Q-Coder , 1988, IBM J. Res. Dev..

[12]  Anantha Chandrakasan,et al.  A Highly Parallel and Scalable CABAC Decoder for Next Generation Video Coding , 2012, IEEE J. Solid State Circuits.

[13]  Anantha Chandrakasan,et al.  A high throughput CABAC algorithm using syntax element partitioning , 2009, 2009 16th IEEE International Conference on Image Processing (ICIP).

[14]  A.P. Chandrakasan,et al.  A low-power 0.7-V H.264 720p video decoder , 2008, 2008 IEEE Asian Solid-State Circuits Conference.

[15]  Wen Gao,et al.  Variable-Bin-Rate CABAC Engine for H.264/AVC High Definition Real-Time Decoding , 2009, IEEE Transactions on Very Large Scale Integration (VLSI) Systems.

[16]  Chun-Chieh Lin,et al.  H.264 Decoder: A Case Study in Multiple Design Points , 2008, 2008 6th ACM/IEEE International Conference on Formal Methods and Models for Co-Design.

[17]  Youn-Long Lin,et al.  A high-performance hardwired CABAC decoder for ultra-high resolution video , 2009, IEEE Transactions on Consumer Electronics.

[18]  Heiko Schwarz,et al.  Context-based adaptive binary arithmetic coding in the H.264/AVC video compression standard , 2003, IEEE Trans. Circuits Syst. Video Technol..

[19]  Vivienne Sze Parallel algorithms and architectures for low power video decoding , 2010 .