论文信息 - Parametric Graph-Based Separable Transforms For Video Coding

Parametric Graph-Based Separable Transforms For Video Coding

In many video coding systems, separable transforms (such as two-dimensional DCT-2) have been used to code block residual signals obtained after prediction. This paper proposes a parametric approach to build graph-based separable transforms (GBSTs) for video coding. Specifically, a GBST is derived from a pair of line graphs, whose weights are determined based on two non-negative parameters. As certain choices of those parameters correspond to the discrete sine and cosine transform types used in recent video coding standards (including DCT-2, DST-7 and DCT-8), this paper further optimizes these graph parameters to better capture residual block statistics and improve video coding efficiency. The proposed GBSTs are tested on the Versatile Video Coding (VVC) reference software, and the experimental results show that about 0.4% average coding gain is achieved over the existing set of separable transforms constructed based on DCT-2, DST-7 and DCT-8 in VVC.

Marta Karczewicz | Hilmi E. Egilmez | Oguzhan Teke | Amir Said | Vadim Seregin

[1] José M. F. Moura,et al. Algebraic Signal Processing Theory: 1-D Space , 2008, IEEE Transactions on Signal Processing.

[2] Yue Chen,et al. An Overview of Core Coding Tools in the AV1 Video Codec , 2018, 2018 Picture Coding Symposium (PCS).

[3] Antonio Ortega,et al. Graph-Based Transforms for Video Coding , 2020, IEEE Transactions on Image Processing.

[4] Antonio Ortega,et al. Graph-based transforms for inter predicted video coding , 2015, 2015 IEEE International Conference on Image Processing (ICIP).

[5] Vladimir Vapnik,et al. An overview of statistical learning theory , 1999, IEEE Trans. Neural Networks.

[6] Stephen P. Boyd,et al. Convex Optimization , 2004, Algorithms and Theory of Computation Handbook.

[7] Jianqin Zhou,et al. On discrete cosine transform , 2011, ArXiv.

[8] G. Bjontegaard,et al. Calculation of Average PSNR Differences between RD-curves , 2001 .

[9] Humberto de Jesús Ochoa Domínguez,et al. Versatile Video Coding , 2019 .

[10] Kenneth Rose,et al. Jointly Optimized Spatial Prediction and Block Transform for Video and Image Coding , 2012, IEEE Transactions on Image Processing.

[11] Gary J. Sullivan,et al. Overview of the High Efficiency Video Coding (HEVC) Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[12] Hilmi E. Egilmez,et al. Low-Complexity Transform Adjustments for Video Coding , 2019, 2019 IEEE International Conference on Image Processing (ICIP).

[13] Fan Chung,et al. Spectral Graph Theory , 1996 .

[14] Antonio Ortega,et al. Graph Learning From Data Under Laplacian and Structural Constraints , 2016, IEEE Journal of Selected Topics in Signal Processing.

[15] Bernhard Schölkopf,et al. Statistical Learning Theory: Models, Concepts, and Results , 2008, Inductive Logic.

[16] Antonio Ortega,et al. GTT: Graph template transforms with applications to image coding , 2015, 2015 Picture Coding Symposium (PCS).

[17] Antonio Ortega,et al. Intra-Prediction and Generalized Graph Fourier Transform for Image Coding , 2015, IEEE Signal Processing Letters.

[18] Antonio Ortega,et al. GBST: Separable transforms based on line graphs for predictive video coding , 2016, 2016 IEEE International Conference on Image Processing (ICIP).