Optimal block size for block-based motion-compensated video coders

In block-based video coding, the current frame to be encoded is decomposed into blocks of the same size, and a motion vector is used to improve the prediction for each block. The motion vectors and the difference frame, which contains the blocks' prediction errors, must be encoded with bits. Typically, choosing a smaller block size will improve the prediction and hence decrease the number of difference frame bits, but it will increase the number of motion bits since more motion vectors need to be encoded. Not surprisingly, there must be some value for the block size that optimizes the tradeoff between motion and difference frame bits, and thus minimizes their sum. Despite the widespread experience with block-based video coders, there is little analysis or theory that quantitatively explains the effect of block size on encoding bit rate, and ordinarily the block size for a coder is chosen based on empirical experiments on video sequences of interest. In this work, we derive a procedure to determine the optimal block size that minimizes the encoding rate for a typical block-based video coder. To do this, we analytically model the effect of block size and derive expressions for the encoding rates for both motion vectors and difference frames, as functions of block size. Minimizing these expressions leads to a simple formula that indicates how to choose the block size in these types of coders. This formula also shows that the best block size is a function of the accuracy with which the motion vectors are encoded and several parameters related to key characteristics of the video scene,such as image texture, motion activity, interframe noise, and coding distortion. We implement the video coder and use our analysis to optimize and explain its performance on real video frames.

[1]  Bernd Girod,et al.  Motion-compensating prediction with fractional-pel accuracy , 1993, IEEE Trans. Commun..

[2]  David L. Neuhoff,et al.  Reducing rate/complexity in video coding by motion estimation with block adaptive accuracy , 1996, Other Conferences.

[3]  R. L. Baker,et al.  Rate-distortion optimized motion compensation for video compression using fixed or variable size blocks , 1991, IEEE Global Telecommunications Conference GLOBECOM '91: Countdown to the New Millennium. Conference Record.

[4]  Frederic Dufaux,et al.  Entropy criterion for optimal bit allocation between motion and prediction error information , 1993, Other Conferences.

[5]  Peter Strobach Tree-structured scene adaptive coder , 1990, IEEE Trans. Commun..

[6]  Arun N. Netravali,et al.  Digital Pictures: Representation and Compression , 1988 .

[7]  Mark J. T. Smith,et al.  Application of motion-compensated prediction to coding ultrasound video , 1996, Other Conferences.

[8]  Graham R. Martin,et al.  Variable size block matching motion estimation with minimal error , 1996, Electronic Imaging.

[9]  Didier Le Gall,et al.  MPEG: a video compression standard for multimedia applications , 1991, CACM.

[10]  Vijay K. Madisetti,et al.  Lossy techniques for motion vector encoding , 1996, Other Conferences.

[11]  Philippe Guillotel,et al.  Comparison of motion vector coding techniques , 1994, Other Conferences.

[12]  Aggelos K. Katsaggelos,et al.  A video compression scheme with optimal bit allocation between displacement vector field and displaced frame difference , 1996, 1996 IEEE International Conference on Acoustics, Speech, and Signal Processing Conference Proceedings.