Block INTER mode decision for fast encoding of H.264

The paper presents a fast block INTER mode decision algorithm to improve significantly the time efficiency of the encoder in H.264. It makes use of the spatial homogeneity of a video object's textures and the temporal stationarity characteristics inherent in video sequences. Specifically, the homogeneity decision of a block is based on edge information, and MB differencing is used to judge whether the MB is time-stationary. Based on the above analysis, only parts of inter prediction modes are chosen for RDO (rate distortion optimization) calculation. Experimental results show that the new scheme is able to achieve a reduction of 30% encoding time on average, with a negligible average PSNR loss of only 0.03 dB and a mere 0.6% bit rate increase compared with the original H.264 reference software.

[1]  Dragutin Petkovic,et al.  Query by Image and Video Content: The QBIC System , 1995, Computer.

[2]  Erkki Oja,et al.  PicSOM-self-organizing image retrieval with MPEG-7 content descriptors , 2002, IEEE Trans. Neural Networks.

[3]  Berthold K. P. Horn,et al.  Determining Optical Flow , 1981, Other Conferences.

[4]  오승준 [서평]「Digital Video Processing」 , 1996 .

[5]  Thomas S. Huang,et al.  Content-based image retrieval with relevance feedback in MARS , 1997, Proceedings of International Conference on Image Processing.

[6]  Ling Guan,et al.  Automatic machine interactions for content-based image retrieval using a self-organizing tree map architecture , 2002, IEEE Trans. Neural Networks.

[7]  Thomas S. Huang,et al.  Relevance feedback: a power tool for interactive content-based image retrieval , 1998, IEEE Trans. Circuits Syst. Video Technol..

[8]  Amarnath Gupta,et al.  Visual information retrieval , 1997, CACM.

[9]  Shih-Fu Chang,et al.  VisualSEEk: a fully automated content-based image query system , 1997, MULTIMEDIA '96.

[10]  Gary Sullivan,et al.  Recommended Simulation Common Conditions for H.26L Coding Efficiency Experiments on Low Resolution Progressive Scan Source Material , 2001 .

[11]  Qi Tian,et al.  Discriminant-EM algorithm with application to image retrieval , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[12]  Nuno Vasconcelos,et al.  Learning from User Feedback in Image Retrieval Systems , 1999, NIPS.

[13]  G. Bjontegaard,et al.  Calculation of Average PSNR Differences between RD-curves , 2001 .

[14]  DeLiang Wang,et al.  Image segmentation using local spectral histograms , 2001, Proceedings 2001 International Conference on Image Processing (Cat. No.01CH37205).

[15]  Toshio Uchiyama,et al.  Estimation of homogeneous regions for segmentation of textured images , 2000, Proceedings 15th International Conference on Pattern Recognition. ICPR-2000.