A Multi-Pass Coding Mode Search Framework For AV1 Encoder Optimization

The AV1 codec recently released by the Alliance of Open Media provides nearly 30% BDrate reduction over its predecessor VP9. It substantially extends the available coding block sizes and supports a wide range of prediction modes. There are also a large variety of transform kernel types and sizes. The combination provides an extremely wide range of flexible coding options. To translate such flexibility into compression efficiency, the encoder needs to conduct an extensive search over the space of coding modes. Optimization of the encoder complexity and compression efficiency trade-off is critical to productionizing AV1. Many research efforts have been devoted to devising feature space based pruning methods ranging from decision rules based on some simple observations to more complex neural network models. A multi-pass coding mode search framework is proposed in this work to provide a structural approach to reduce the search volume. It decomposes the original high dimensional space search into cascaded stages of lower dimensional space searches. To retain a near optimal search result, the scheme departs from conventional dimension reduction approach in which one retains a single winner at each stage, and uses that winner for the next stage (dimension). Instead, this framework retains a subset of the states that are the most likely winners at each stage, which are then fed into the next stage to find the next subset of winners. The subset size at each stage is determined by the likelihood that the optimal route will be captured in the current stage. Changing this likelihood parameter tunes the encoder for speed and compression performance trade-off. This framework can integrate with most existing feature based methods at its various stages. The framework provides 60% encoding time reduction at the expense of 0.6% compression loss in libaom AV1 encoder.

[1]  Chuohao Yeo,et al.  On fast coding tree block and mode decision for high-Efficiency Video Coding (HEVC) , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[2]  King Ngi Ngan,et al.  Fast HEVC Inter CU Decision Based on Latent SAD Estimation , 2015, IEEE Transactions on Multimedia.

[3]  Zulin Wang,et al.  Reducing Complexity of HEVC: A Deep Learning Approach , 2017, IEEE Transactions on Image Processing.

[4]  Peng Jin,et al.  Fast reference frame selection based on content similarity for low complexity HEVC encoder , 2016, J. Vis. Commun. Image Represent..

[5]  Yue Chen,et al.  An Overview of Core Coding Tools in the AV1 Video Codec , 2018, 2018 Picture Coding Symposium (PCS).

[6]  Takao Onoye,et al.  Hierarchical Structure-Based Fast Mode Decision for H.265/HEVC , 2015, IEEE Transactions on Circuits and Systems for Video Technology.

[7]  Xinfeng Zhang,et al.  Fast QTBT Partitioning Decision for Interframe Coding with Convolution Neural Network , 2018, 2018 25th IEEE International Conference on Image Processing (ICIP).

[8]  En-Hui Yang,et al.  Rate Distortion Optimization for H.264 Interframe Coding: A General Framework and Algorithms , 2007, IEEE Transactions on Image Processing.

[9]  Bin Li,et al.  An Efficient Fast Mode Decision Method for Inter Prediction in HEVC , 2016, IEEE Transactions on Circuits and Systems for Video Technology.

[10]  Zhenyu Liu,et al.  CU Partition Mode Decision for HEVC Hardwired Intra Encoder Using Convolution Neural Network , 2016, IEEE Transactions on Image Processing.

[11]  Marko Viitanen,et al.  Efficient Mode Decision Schemes for HEVC Inter Prediction , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[12]  Zhi Liu,et al.  Adaptive Inter-Mode Decision for HEVC Jointly Utilizing Inter-Level and Spatiotemporal Correlations , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[13]  Zhiyong Gao,et al.  Neyman-Pearson-Based Early Mode Decision for HEVC Encoding , 2016, IEEE Transactions on Multimedia.

[14]  Munchurl Kim,et al.  A Novel Fast CU Encoding Scheme Based on Spatiotemporal Encoding Parameters for HEVC Inter Coding , 2015, IEEE Transactions on Circuits and Systems for Video Technology.