A Fixed Point Approach to Analysis and Optimization of Motion Compensated Predictive Coding

In this paper, we propose a fixed point theoretical analysis of motion compensated predictive coding and demonstrate its potential in encoder optimization. Through viewing the encoder-decoder pair as a nonlinear dynamical system and inquiring about its convergent property, we demonstrate the feasibility of approximating the fixed point through recursive coding both theoretically and experimentally. Such a recursive coding approach to encoder optimization admits an interpretation of finding a more compact representation through local perturbation on the perceptual manifold. Experimental results have shown that our approach can achieve bit savings of 5-40% without sacrificing visual quality when tested on the KTA implementation of H.264 (JM14.2).

[1]  Gary J. Sullivan,et al.  Recent developments in standardization of high efficiency video coding (HEVC) , 2010, Optical Engineering + Applications.

[2]  R. Gray,et al.  Dithered Quantizers , 1993, Proceedings. 1991 IEEE International Symposium on Information Theory.

[3]  Michael T. Orchard,et al.  On the DPCM compression of Gaussian autoregressive sequences , 2001, IEEE Trans. Inf. Theory.

[4]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[5]  Michael W. Marcellin,et al.  JPEG2000 - image compression fundamentals, standards and practice , 2002, The Kluwer International Series in Engineering and Computer Science.

[6]  Tosio Kato,et al.  Perturbation theory for nullity, deficiency and other quantities of linear operators , 1958 .

[7]  Robert M. Gray,et al.  Dithered quantizers , 1993, IEEE Trans. Inf. Theory.

[8]  Xin Li Exemplar-Based EM-like image denoising via manifold reconstruction , 2010, 2010 IEEE International Conference on Image Processing.

[9]  Gary J. Sullivan,et al.  Overview of the High Efficiency Video Coding (HEVC) Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[10]  W. A. Kirk,et al.  Topics in Metric Fixed Point Theory , 1990 .

[11]  Gary J. Sullivan,et al.  Rate-distortion optimization for video compression , 1998, IEEE Signal Process. Mag..

[12]  G. Rees,et al.  The Neural Bases of Multistable Perception , 2022 .

[13]  M. Ledoux The concentration of measure phenomenon , 2001 .

[14]  Michael W. Marcellin,et al.  JPEG2000 - image compression fundamentals, standards and practice , 2013, The Kluwer international series in engineering and computer science.

[15]  C. Lambrecht Perceptual models and architectures for video coding applications , 1996 .

[16]  Scott J. Daly,et al.  Visible differences predictor: an algorithm for the assessment of image fidelity , 1992, Electronic Imaging.

[17]  Stefan Winkler,et al.  Issues in vision modeling for perceptual video quality assessment , 1999, Signal Process..

[18]  Jerry D. Gibson,et al.  Digital coding of waveforms: Principles and applications to speech and video , 1985, Proceedings of the IEEE.

[19]  James H. Johnson,et al.  Just Noticeable Difference , 2010 .

[20]  N. Logothetis,et al.  Multistable phenomena: changing views in perception , 1999, Trends in Cognitive Sciences.

[21]  A. Murat Tekalp,et al.  Digital Video Processing , 1995 .

[22]  Shirley Dex,et al.  JR 旅客販売総合システム(マルス)における運用及び管理について , 1991 .

[23]  William A. Kirk,et al.  A FIXED POINT THEOREM FOR ASYMPTOTICALLY NONEXPANSIVE MAPPINGS , 1972 .