MPEG-7 Descriptors Based Shot Detection and Adaptive Initial Quantization Parameter Estimation for the H.264/AVC

Currently no information about different shots is used in the H.264/AVC video coding standard. This kind of information can help us choose more optimally the size of the Group of Pictures (GOPs) used for encoding of video content. In this paper, we initially propose an MPEG-7 descriptor based shot detection technique with low computational cost for H.264/AVC. Then we propose an adaptive initial quantization parameter (QP) estimation method for each shot based on modeling, training according to the content of video sequences and the shot detection method of our previous step. This two step architecture can help us reduce the bit rate and PSNR fluctuation when video sequences have multiple shots. Our proposed scheme outperforms the rate control of the H.264/AVC significantly in terms of reducing the average bit rate fluctuation (variance) by 8.6%-99% and the average PSNR fluctuation (variance) by 5%-99% between shots. Experimental results also demonstrate that the proposed algorithm can achieve similar or even better Rate Distortion (R-D) performance than standard Rate Control algorithms. It is also applicable in computationally and memory restricted devices since it needs maximum 2 frames buffer space for MPEG-7 descriptor calculation, while the average amount of extra processing is only about 5.8% of the total CPU cycles.

[1]  Ajay Luthra,et al.  Overview of the H.264/AVC video coding standard , 2003, IEEE Trans. Circuits Syst. Video Technol..

[2]  Yücel Altunbasak,et al.  Frame bit allocation for the H.264/AVC video coder via Cauchy-density-based rate and distortion models , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[3]  Yasuhiro Takishima,et al.  A study on rate distortion optimization scheme for JVT coder , 2003, Visual Communications and Image Processing.

[4]  Joseph W. Goodman,et al.  A mathematical analysis of the DCT coefficient distributions for images , 2000, IEEE Trans. Image Process..

[5]  G. Qiu Indexing chromatic and achromatic patterns for content-based colour image retrieval , 2002, Pattern Recognit..

[6]  E. Kasutani,et al.  Visual program navigation system based on spatial distribution of color , 2000, 2000 Digest of Technical Papers. International Conference on Consumer Electronics. Nineteenth in the Series (Cat. No.00CH37102).

[7]  Do-Kyoung Kwon,et al.  Rate Control for H.264 Video With Enhanced Rate and Distortion Models , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[8]  B. S. Manjunath,et al.  Introduction to MPEG-7: Multimedia Content Description Interface , 2002 .

[9]  B. S. Manjunath,et al.  Color and texture descriptors , 2001, IEEE Trans. Circuits Syst. Video Technol..

[10]  Gary J. Sullivan,et al.  Rate-constrained coder control and comparison of video coding standards , 2003, IEEE Trans. Circuits Syst. Video Technol..

[11]  C. Won,et al.  Efficient Use of MPEG‐7 Edge Histogram Descriptor , 2002 .