A Long-Term Reference Frame for Hierarchical B-Picture-Based Video Coding

Generally, H.264/AVC video coding standard with hierarchical bipredictive picture (HBP) structure outperforms the classical prediction structures such as “IPPP...” and “IBBP...” through better exploitation of data correlation using reference frames and unequal quantization setting among frames. However, multiple reference frames (MRFs) techniques are not fully exploited in the HBP scheme because of the computational requirement for B-frames, unavailability of adjacent reference frames, and with no explicit sorting of the reference frames for foreground or background being used. To exploit MRFs fully and explicitly in background referencing, we observe that not a single frame of a video is appropriate to be the reference frame as no one covers adequate background of a video. To overcome the problems, we propose a new coding scheme with the HBP, which uses the most common frame in scene (McFIS), generated by background modeling, as a long-term reference (LTR) frame for the third unipredictive reference frame, so that foreground and background areas are expected to be referenced from the two frames in the HBP structure and the McFIS, respectively. There are two approaches to generate McFIS under the proposed methodology. In the first approach, we generate a McFIS using a number of original frames of a scene in a video and then encode it as an I-frame with a higher quality. For the rest of the scene, this generated I-frame is used as an LTR frame. In the second approach, we generate an McFIS from the decoded frames and then use it as an LTR frame, without the need to encode the McFIS. The first and the second approaches are suitable for a video with static background and dynamic background, respectively. In general, the second approach requires more computational time than that of the the first approach. The experiments confirm that the proposed scheme outperforms three state-of-the-art algorithms by improving the image quality significantly with reduced computational time.

[1]  Josef Kittler,et al.  Using background memory for efficient video coding , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).

[2]  Dar-Shyang Lee,et al.  Effective Gaussian mixture learning for video background subtraction , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[3]  Thomas Sikora,et al.  Background modeling for video coding: From sprites to Global Motion Temporal filtering , 2010, Proceedings of 2010 IEEE International Symposium on Circuits and Systems.

[4]  Heiko Schwarz,et al.  Overview of the Scalable Video Coding Extension of the H.264/AVC Standard , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[5]  Qian Huang,et al.  An efficient coding scheme for surveillance videos captured by stationary cameras , 2010, Visual Communications and Image Processing.

[6]  Satoshi Kondo,et al.  Motion-compensated video coding using sliced blocks , 2007, Systems and Computers in Japan.

[7]  Dietmar Hepper,et al.  Efficiency analysis and application of uncovered background prediction in a low bit rate image coder , 1990, IEEE Trans. Commun..

[8]  Itu-T and Iso Iec Jtc Advanced video coding for generic audiovisual services , 2010 .

[9]  Manoranjan Paul,et al.  Video Coding Focusing on Block Partitioning and Occlusion , 2010, IEEE Transactions on Image Processing.

[10]  Liang-Gee Chen,et al.  Analysis and complexity reduction of multiple reference frames motion estimation in H.264/AVC , 2006, IEEE Transactions on Circuits and Systems for Video Technology.

[11]  W. Eric L. Grimson,et al.  Adaptive background mixture models for real-time tracking , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[12]  Tien-Ying Kuo,et al.  Efficient Reference Frame Selector for H.264 , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[13]  Ennio Gambi,et al.  An Optimized Dynamic Scene Change Detection Algorithm for H.264/AVC Encoded Video Sequences , 2010, Int. J. Digit. Multim. Broadcast..

[14]  Markus Flierl,et al.  Generalized B pictures and the draft H.264/AVC video-compression standard , 2003, IEEE Trans. Circuits Syst. Video Technol..

[15]  Bu-Sung Lee,et al.  Video coding using the most common frame in scene , 2010, 2010 IEEE International Conference on Acoustics, Speech and Signal Processing.

[16]  Zhi Liu,et al.  An Adaptive and Fast Multiframe Selection Algorithm for H.264 Video Coding , 2007, IEEE Signal Processing Letters.

[17]  Ajay Luthra,et al.  Overview of the H.264/AVC video coding standard , 2003, IEEE Trans. Circuits Syst. Video Technol..

[18]  Markus Rupp,et al.  SCENE CHANGE DETECTION FOR H.264 USING DYNAMIC THRESHOLD TECHNIQUES , 2005 .

[19]  Bu-Sung Lee,et al.  Explore and Model Better I-Frames for Video Coding , 2011, IEEE Transactions on Circuits and Systems for Video Technology.

[20]  Sergio Saponara,et al.  “The JVT Advanced Video Coding Standard: Complexity and Performance Analysis on a Tool-by-tool Basis” , 2003 .

[21]  Gary J. Sullivan,et al.  Overview of the Stereo and Multiview Video Coding Extensions of the H.264/MPEG-4 AVC Standard , 2011, Proceedings of the IEEE.

[22]  Liang-Gee Chen,et al.  Efficient moving object segmentation algorithm using background registration technique , 2002, IEEE Trans. Circuits Syst. Video Technol..

[23]  Bernd Girod,et al.  Background extraction and long-term memory motion-compensated prediction for spatial-random-access-enabled video coding , 2009, 2009 Picture Coding Symposium.

[24]  Naoki Mukawa,et al.  Uncovered Background Prediction in Interframe Coding , 1985, IEEE Trans. Commun..

[25]  Pamela C. Cosman,et al.  Selection of Long-Term Reference Frames in Dual-Frame Video Coding Using Simulated Annealing , 2008, IEEE Signal Processing Letters.

[26]  Josef Kittler,et al.  A background memory update scheme for H.263 video codec , 1998, 9th European Signal Processing Conference (EUSIPCO 1998).

[27]  G. Bjontegaard,et al.  Calculation of Average PSNR Differences between RD-curves , 2001 .

[28]  Bu-Sung Lee,et al.  Direct Intermode Selection for H.264 Video Coding Using Phase Correlation , 2011, IEEE Transactions on Image Processing.

[29]  Heiko Schwarz,et al.  Analysis of Hierarchical B Pictures and MCTF , 2006, 2006 IEEE International Conference on Multimedia and Expo.

[30]  Qionghai Dai,et al.  Background-frame based motion compensation for video compression , 2004, 2004 IEEE International Conference on Multimedia and Expo (ICME) (IEEE Cat. No.04TH8763).

[31]  Wen Gao,et al.  Dual Frame Motion Compensation With Optimal Long-Term Reference Frame Selection and Bit Allocation , 2010, IEEE Transactions on Circuits and Systems for Video Technology.

[32]  Pamela C. Cosman,et al.  Dual Frame Motion Compensation With Uneven Quality Assignment , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[33]  Xianguo Zhang,et al.  Low-complexity and high-efficiency background modeling for surveillance video coding , 2012, 2012 Visual Communications and Image Processing.

[34]  Thomas Wiegand,et al.  Motion-compensating long-term memory prediction , 1997, Proceedings of International Conference on Image Processing.

[35]  Patrick Garda,et al.  Accelerating the multiple reference frames compensation in the H.264 video coder , 2009, Journal of Real-Time Image Processing.

[36]  Bu-Sung Lee,et al.  McFIS: Better I-frame for video coding , 2010, Proceedings of 2010 IEEE International Symposium on Circuits and Systems.

[37]  Jar-Ferr Yang,et al.  Adaptive group-of-pictures and scene change detection methods based on existing H.264 advanced video coding information , 2008 .

[38]  Manoranjan Paul,et al.  Improved Gaussian mixtures for robust object detection by adaptive multi-background generation , 2008, 2008 19th International Conference on Pattern Recognition.