Temporally adaptive motion interpolation exploiting temporal masking in visual perception

In this paper we present a novel technique to dynamically adapt motion interpolation structures by temporal segmentation. The number of reference frames and the intervals between them are adjusted according to the temporal variation of the input video. Bit-rate control for this dynamic group of pictures (GOP) structure is achieved by taking advantage of temporal masking in human vision. Constant picture quality can be obtained by variable-bit-rate coding using this approach. Further improvement can be made when the intervals between reference frames are chosen by minimizing a measure of the coding difficulty of a GOP. Advantages for low bit-rate coding and implications for variable-bit-rate coding are discussed. Simulations on test video are presented for various GOP structures and temporal segmentation methods, and the results compare favorably with those for conventional fixed GOP structures.

[1]  Zigmantas L. Budrikis,et al.  Detail perception after scene changes in television image presentations , 1965, IEEE Trans. Inf. Theory.

[2]  D. Kahneman Time-intensity reciprocity under various conditions of adaptation and backward masking. , 1966, Journal of experimental psychology.

[3]  W. G. Owen Spatio-temporal integration in the human peripheral retina. , 1972, Vision research.

[4]  R. Haber,et al.  The psychology of visual perception , 1973 .

[5]  B. Prasada,et al.  Adaptive quantization of picture signals using spatial masking , 1977, Proceedings of the IEEE.

[6]  Norman C. Griswold Perceptual Coding In The Cosine Transform Domain , 1980 .

[7]  Norman B. Nill,et al.  A Visual Model Weighted Cosine Transform for Image Compression and Quality Assessment , 1985, IEEE Trans. Commun..

[8]  R. Sedgewick,et al.  Algorithms (2nd ed.) , 1988 .

[9]  Gunnar Karlsson,et al.  Three dimensional sub-band coding of video , 1988, ICASSP-88., International Conference on Acoustics, Speech, and Signal Processing.

[10]  Willem Verbiest,et al.  The impact of the ATM concept on video coding , 1988, IEEE J. Sel. Areas Commun..

[11]  Mitsuru Nomura,et al.  Basic characteristics of variable rate video coding in ATM environment , 1989, IEEE J. Sel. Areas Commun..

[12]  Willem Verbiest,et al.  A variable bit rate video codec for asynchronous transfer mode networks , 1989, IEEE J. Sel. Areas Commun..

[13]  R. J. Safranek,et al.  A perceptually tuned sub-band image coder with image dependent quantization and post-quantization data compression , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[14]  Bernd Girod,et al.  The Information Theoretical Significance of Spatial and Temporal Masking in Video Signals , 1989, Photonics West - Lasers and Applications in Science and Engineering.

[15]  Hiroshi Yasuda,et al.  Variable bit-rate coding of video signals for ATM networks , 1989, IEEE J. Sel. Areas Commun..

[16]  Robert J. Safranek,et al.  Perceptually tuned sub-band image coder , 1990, Other Conferences.

[17]  Atul Puri,et al.  Comparing motion-interpolation structures for video coding , 1990, Other Conferences.

[18]  Mitsuru Nomura,et al.  Layered coding for ATM based video distribution systems , 1991, Signal Process. Image Commun..

[19]  V. J. Mathews,et al.  Vector quantization of images using visual masking functions , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[20]  Jean-Yves Le Boudec,et al.  The Asynchronous Transfer Mode: A Tutorial , 1992, Comput. Networks ISDN Syst..