Efficient Fixed-Point Implementation Of Matrix-Based Intra Prediction

In the current version of the evolving Versatile Video Coding (VVC) standard, the Matrix-Based Intra Prediction (MIP) tool is included which is based on data-driven training techniques. This paper describes how an efficient fixed point implementation of the MIP prediction modes was designed. First, a method is presented that improves the integer quantization of the floating-point matrices resulting from the training by geometrically transforming the input and output. Second, a soft-clipping function can be used during the training stage to restrict the range of floating-point matrix entries. This enables the quantization of the trained coefficients with a fixed bit depth and precision. The clipping itself has no impact on the compression efficiency. The obtained predictors are incorporated into the Versatile Video Coding Test Model 7. All-Intra bit-rate savings of 0.6 % across different resolutions in terms of the Bjøntegaard-Delta bit rate (BD-rate) are reported.

[1]  Jaehyun Lim,et al.  Low Frequency Non-Separable Transform (LFNST) , 2019, 2019 Picture Coding Symposium (PCS).

[2]  D. Marpe,et al.  Neural network based intra prediction for video coding , 2018, Optical Engineering + Applications.

[3]  Heiko Schwarz,et al.  Intra Picture Prediction for Video Coding with Neural Networks , 2019, 2019 Data Compression Conference (DCC).

[4]  Heiko Schwarz,et al.  A Data-Trained, Affine-Linear Intra-Picture Prediction in the Frequency Domain , 2019, 2019 Picture Coding Symposium (PCS).

[5]  Gary J. Sullivan,et al.  Overview of the High Efficiency Video Coding (HEVC) Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[6]  Heiko Schwarz,et al.  An Affine-Linear Intra Prediction With Complexity Constraints , 2019, 2019 IEEE International Conference on Image Processing (ICIP).

[7]  G. Bjontegaard,et al.  Calculation of Average PSNR Differences between RD-curves , 2001 .

[8]  Koo Moonmo,et al.  Low Frequency Non-Separable Transform (LFNST) , 2019 .

[9]  Bin Li,et al.  Fully Connected Network-Based Intra Prediction for Image Coding , 2018, IEEE Transactions on Image Processing.