A Theory for the Optimal Bit Allocation Between Displacement Vector Field and Displaced Frame Difference

We address the fundamental problem of optimally splitting a video sequence into two sources of information, the displaced frame difference (DFD) and the displacement vector field (DVF). We first consider the case of a lossless motion-compensated video coder (MCVC), and derive a general dynamic programming (DP) formulation which results in an optimal tradeoff between the DVF and the DFD. We then consider the more important case of a lossy MCVC, and present an algorithm which solves the tradeoff between the rate and the distortion. This algorithm is based on the Lagrange multiplier method and the DP approach introduced for the lossless MCVC. We then present an H.263-based MCVC which uses the proposed optimal bit allocation, and compare its results to H.263. As expected, the proposed coder is superior in the rate-distortion sense. In addition to this, it offers many advantages for a rate control scheme. The presented theory can be applied to build new optimal coders, and to analyze the heuristics employed in existing coders. In fact, whenever one changes an existing coder, the proposed theory can be used to evaluate how the change affects its performance.

[1]  Bernd Girod,et al.  Motion-compensating prediction with fractional-pel accuracy , 1993, IEEE Trans. Commun..

[2]  Yair Shoham,et al.  Efficient bit allocation for an arbitrary set of quantizers [speech coding] , 1988, IEEE Trans. Acoust. Speech Signal Process..

[3]  Aggelos K. Katsaggelos,et al.  Rate-Distortion Based Video Compression: Optimal Video Frame Compression and Object Boundary Encoding , 1996 .

[4]  Sanjit K. Mitra,et al.  Rate-distortion optimized mode selection for very low bit rate video coding and the emerging H.263 standard , 1996, IEEE Trans. Circuits Syst. Video Technol..

[5]  Curtis F. Gerald,et al.  APPLIED NUMERICAL ANALYSIS , 1972, The Mathematical Gazette.

[6]  David L. Neuhoff,et al.  Optimal bit allocations for lossless video coders: motion vectors vs. difference frames , 1995, Proceedings., International Conference on Image Processing.

[7]  Anil K. Jain,et al.  Image data compression: A review , 1981, Proceedings of the IEEE.

[8]  K Ramchandran,et al.  Best wavelet packet bases in a rate-distortion sense , 1993, IEEE Trans. Image Process..

[9]  Antonio Ortega,et al.  Bit allocation for dependent quantization with applications to multiresolution and MPEG video coders , 1994, IEEE Trans. Image Process..

[10]  A.N. Netravali,et al.  Picture coding: A review , 1980, Proceedings of the IEEE.

[11]  Jungwoo Lee,et al.  Optimal quadtree for variable block size motion estimation , 1995, Proceedings., International Conference on Image Processing.

[12]  Baback Moghaddam,et al.  Space-filling curves for image compression , 1991, Defense, Security, and Sensing.

[13]  Gary J. Sullivan,et al.  Efficient quadtree coding of images and video , 1994, IEEE Trans. Image Process..

[14]  Dimitri P. Bertsekas,et al.  Dynamic Programming: Deterministic and Stochastic Models , 1987 .

[15]  Frederic Dufaux,et al.  Entropy criterion for optimal bit allocation between motion and prediction error information , 1993, Other Conferences.

[16]  Bernd Girod,et al.  The Efficiency of Motion-Compensating Prediction for Hybrid Coding of Video Sequences , 1987, IEEE J. Sel. Areas Commun..

[17]  Aggelos K. Katsaggelos,et al.  Rate-Distortion Based Video Compression , 1997, Springer US.

[18]  Frederic Dufaux,et al.  Digital video codec for medium bitrate transmission , 1991, Other Conferences.

[19]  Rangaraj M. Rangayyan,et al.  Lossless compression of Peanoscanned images , 1994, J. Electronic Imaging.

[20]  P. Yip,et al.  Discrete Cosine Transform: Algorithms, Advantages, Applications , 1990 .

[21]  Aggelos K. Katsaggelos,et al.  A video compression scheme with optimal bit allocation between displacement vector field and displaced frame difference , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.

[22]  Bernd Girod,et al.  Rate-constrained motion estimation , 1994, Other Conferences.

[23]  Robert Forchheimer,et al.  Image coding-from waveforms in animation , 1989, IEEE Trans. Acoust. Speech Signal Process..

[24]  Aggelos K. Katsaggelos,et al.  Fast and efficient mode and quantizer selection in the rate distortion sense for H.263 , 1996, Other Conferences.

[25]  Harvey J. Everett Generalized Lagrange Multiplier Method for Solving Problems of Optimum Allocation of Resources , 1963 .

[26]  P. Pirsch,et al.  Advances in picture coding , 1985, Proceedings of the IEEE.