An LSTM based Rate and Distortion Prediction Method for Low-delay Video Coding

In this paper, an LSTM based rate-distortion (R-D) prediction method for low-delay video coding has been proposed. Unlike the traditional rate control algorithms, LSTM is introduced to learn the latent pattern of the R-D relationship in the progress of video coding. Temporal information, hierarchical coding structure information and the content of the frame which is to be encoded have been used to achieve more accurate prediction. Based on the proposed network, a new R-D model parameters prediction method is proposed and tested on test model of Versatile Video Coding (VVC). According to the experimental results, compared with the state-of-the-art method used in VVC, the proposed method can achieve better performance.

[1]  Antonio Ortega,et al.  Bit-rate control using piecewise approximated rate-distortion characteristics , 1998, IEEE Trans. Circuits Syst. Video Technol..

[2]  Trevor Darrell,et al.  Sequence to Sequence -- Video to Text , 2015, 2015 IEEE International Conference on Computer Vision (ICCV).

[3]  Jürgen Schmidhuber,et al.  Long Short-Term Memory , 1997, Neural Computation.

[4]  Wen Gao,et al.  Rate-distortion analysis for H.264/AVC video coding and its application to rate control , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[5]  Ajay Luthra,et al.  Overview of the H.264/AVC video coding standard , 2003, IEEE Trans. Circuits Syst. Video Technol..

[6]  Stéphane Mallat,et al.  Analysis of low bit rate image transform coding , 1998, IEEE Trans. Signal Process..

[7]  King Ngi Ngan,et al.  Recent advances in rate control for video coding , 2007, Signal Process. Image Commun..

[8]  Jong-Ki Han,et al.  Rate Control for Consistent Objective Quality in High Efficiency Video Coding , 2013, IEEE Transactions on Image Processing.

[9]  Tihao Chiang,et al.  A new rate control scheme using quadratic rate distortion model , 1997, IEEE Trans. Circuits Syst. Video Technol..

[10]  Truong Q. Nguyen,et al.  A Frame-Level Rate Control Scheme Based on Texture and Nontexture Rate Models for High Efficiency Video Coding , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[11]  King Ngi Ngan,et al.  Distortion variation minimization in real-time video coding , 2006, Signal Process. Image Commun..

[12]  Jürgen Schmidhuber,et al.  Learning to Forget: Continual Prediction with LSTM , 2000, Neural Computation.

[13]  Aggelos K. Katsaggelos,et al.  A Review of the Minimum Maximum Criterion for Optimal Bit Allocation Among Dependent Quantizers , 1999, IEEE Trans. Multim..

[14]  Gary J. Sullivan,et al.  Overview of the High Efficiency Video Coding (HEVC) Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[15]  Munchurl Kim,et al.  Modeling Rates and Distortions Based on a Mixture of Laplacian Distributions for Inter-Predicted Residues in Quadtree Coding of HEVC , 2011, IEEE Signal Processing Letters.

[16]  Gary J. Sullivan,et al.  Rate-distortion optimization for video compression , 1998, IEEE Signal Process. Mag..

[17]  Shuai Li,et al.  Temporally Dependent Rate-Distortion Optimization for Low-Delay Hierarchical Video Coding , 2017, IEEE Transactions on Image Processing.