Fast intermode decision in H.264/AVC video coding

The new video coding standard, H.264/MPEG-4 AVC, uses variable block sizes ranging from 4/spl times/4 to 16/spl times/16 in interframe coding. This new feature has achieved significant coding gain compared to coding a macroblock (MB) using fixed block size. However, this feature results in extremely high computational complexity when brute force rate distortion optimization (RDO) algorithm is used. This paper proposes a fast intermode decision algorithm to decide the best mode in intercoding. It makes use of the spatial homogeneity and the temporal stationarity characteristics of video objects. Specifically, spatial homogeneity of a MB is decided based on the MB's edge intensity, and temporal stationarity is decided by the difference of the current MB and it colocated counterpart in the reference frame. Based on the homogeneity and stationarity of the video objects, only a small number of intermodes are selected in the RDO process. The experimental results show that the fast intermode decision algorithm is able to reduce on the average 30% encoding time, with a negligible peak signal-to-noise ratio loss of 0.03 dB or, equivalently, a bit rate increment of 0.6%.

[1]  G. Bjontegaard,et al.  Calculation of Average PSNR Differences between RD-curves , 2001 .

[2]  Byeungwoo Jeon,et al.  Fast mode decision for H.264 , 2004, 2004 IEEE International Conference on Multimedia and Expo (ICME) (IEEE Cat. No.04TH8763).

[3]  Toshio Uchiyama,et al.  Estimation of homogeneous regions for segmentation of textured images , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.

[4]  Gary Sullivan,et al.  Recommended Simulation Common Conditions for H.26L Coding Efficiency Experiments on Low Resolution Progressive Scan Source Material , 2001 .

[5]  F. Pan,et al.  Fast Mode Decision for Intra Prediction , 2003 .

[6]  G. Crebbin,et al.  Classified vector quantisation with variable block-size DCT models , 1994 .

[7]  Jill M. Boyce,et al.  Fast mode decision and motion estimation for JVT/H.264 , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[8]  Byeungwoo Jeon,et al.  Fast Mode Decision for H.264 with Variable Motion Block Sizes , 2003, ISCIS.

[9]  DeLiang Wang,et al.  Image segmentation using local spectral histograms , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).