A Bin-Based Bitstream Partitioning Approach for Parallel CABAC Decoding in Next Generation Video Coding

Context-based Adaptive Binary Arithmetic Coding (CABAC) is one of the main throughput bottlenecks in video decoding due to its sequential nature and the lack of data-level parallelism. High-level parallelization techniques can be used in most state-of-the-art video codecs, but they usually require a full replication of the decoding hardware and decrease the coding efficiency. We present a Bin-based Bitstream Partitioning (B3P) scheme to enable additional thread-level parallelism in CABAC decoding. Binary symbols are distributed over eight bitstream partitions that can be decoded simultaneously. The implementation and evaluation are based on the High Efficiency Video Coding Standard (HEVC/H.265). Significant speedups up to 8.5x are achieved for CABAC decoding while only 9.2% extra cell area is required and the bitstream overhead remains below 1% for high bitrates. The B3P hardware decoder can process up to 3.94 Gbins/s. Compared to state-of-the-art related work, we achieve higher throughput with slightly lower hardware cost and similar coding efficiency.

[1]  Liang-Gee Chen,et al.  A branch selection multi-symbol high throughput CABAC decoder architecture for H.264/AVC , 2009, 2009 IEEE International Symposium on Circuits and Systems.

[2]  Madhukar Budagavi,et al.  High Throughput CABAC Entropy Coding in HEVC , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[3]  In-Cheol Park,et al.  High speed decoding of context-based adaptive binary arithmetic codes using most probable symbol prediction , 2006, 2006 IEEE International Symposium on Circuits and Systems.

[4]  Shuming Chen,et al.  P3-CABAC: A Nonstandard Tri-Thread Parallel Evolution of CABAC in the Manycore Era , 2010, IEEE Transactions on Circuits and Systems for Video Technology.

[5]  Vivienne Sze,et al.  A Deeply Pipelined CABAC Decoder for HEVC Supporting Level 6.2 High-Tier Applications , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[6]  F. Bossen,et al.  Common test conditions and software reference configurations , 2010 .

[7]  Vivienne Sze Parallel algorithms and architectures for low power video decoding , 2010 .

[8]  Ajay Luthra,et al.  Overview of the H.264/AVC video coding standard , 2003, IEEE Trans. Circuits Syst. Video Technol..

[9]  Ben H. H. Juurlink,et al.  Syntax Element Partitioning for high-throughput HEVC CABAC decoding , 2017, 2017 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).