Background Modeling and Referencing for Moving Cameras-Captured Surveillance Video Coding in HEVC

Surveillance video coding is crucial for improving compression efficiency in intelligent video surveillance systems and applications. Plenty of work has been done, which can be roughly divided into two categories: the former mainly focuses on low-complexity background modeling to obtain the clear background, while the latter focuses on an appropriate coding strategy to generate the high-quality background reference picture for effective background prediction. However, almost all existing works focus only on stationary camera scenes, while moving cameras-captured surveillance video coding is left untouched and is still an open problem. In this paper, a background modeling and referencing scheme for moving cameras-captured surveillance video coding in high-efficiency video coding (HEVC) is proposed. First, this paper proposes a low-complexity motion background modeling algorithm for surveillance video coding using the running average based on a global-motion-compensation method. To obtain the global motion vector, we propose a global motion detection method based on character blocks by establishing a low-rank singular value decomposition model for clustering and estimating motion vectors of background character blocks in the cameras movement circumstance. Second, we propose a background referencing coding strategy, in which the motion background coding tree units (MBCTUs) would be selected by anchoring the input video frame on the modeling background frame and coded with the optimized quantization parameter. Then, the reconstructed MBCTU will be used to update the previous coding tree unit in the global compensation location of the background reference picture. Extensive experimental results show that the proposed scheme can achieve significant bit savings of up to 26.6% and, on average, 6.7% with similar subjective quality and negligible encoding complexity, compared to HM12.0. Besides, the proposed scheme consistently outperforms two state-of-the-art surveillance video coding schemes with remarkable bitrate savings.

[1]  Wei Xiong,et al.  Moving Object Extraction with a Hand-held Camera , 2007, 2007 IEEE 11th International Conference on Computer Vision.

[2]  Debi Prosad Dogra,et al.  Autonomous detection and tracking under illumination changes, occlusions and moving camera , 2015, Signal Process..

[3]  Dong Liu,et al.  Block-Composed Background Reference for High Efficiency Video Coding , 2017, IEEE Transactions on Circuits and Systems for Video Technology.

[4]  Manoranjan Paul,et al.  An efficient video coding technique using a novel non-parametric background model , 2014, 2014 IEEE International Conference on Multimedia and Expo Workshops (ICMEW).

[5]  Soon Ki Jung,et al.  Spatiotemporal Low-Rank Modeling for Complex Scene Background Initialization , 2018, IEEE Transactions on Circuits and Systems for Video Technology.

[6]  Xianguo Zhang,et al.  The IEEE 1857 Standard: Empowering Smart Video Surveillance Systems , 2014, IEEE Intelligent Systems.

[7]  Ivan V. Bajic,et al.  A Joint Approach to Global Motion Estimation and Motion Segmentation From a Coarsely Sampled Motion Vector Field , 2011, IEEE Transactions on Circuits and Systems for Video Technology.

[8]  Malay Kumar Kundu,et al.  Efficient Foreground Extraction From HEVC Compressed Video for Application to Real-Time Analysis of Surveillance ‘Big’ Data , 2015, IEEE Transactions on Image Processing.

[9]  Thierry Bouwmans,et al.  Traditional and recent approaches in background modeling for foreground detection: An overview , 2014, Comput. Sci. Rev..

[10]  Ruimin Hu,et al.  Background Subtraction With Video Coding , 2013, IEEE Signal Processing Letters.

[11]  Takeo Kanade,et al.  Background Subtraction for Freely Moving Cameras , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[12]  Chongyu Chen,et al.  Incremental low-rank and sparse decomposition for compressing videos captured by fixed cameras , 2015, J. Vis. Commun. Image Represent..

[13]  Ying Liu,et al.  Compressed-Sensed-Domain L1-PCA Video Surveillance , 2015, IEEE Transactions on Multimedia.

[14]  Yongfei Zhang,et al.  Motion Classification-Based Fast Motion Estimation for High-Efficiency Video Coding , 2017, IEEE Transactions on Multimedia.

[15]  Thomas Wiegand,et al.  Long-term memory motion-compensated prediction , 1999, IEEE Trans. Circuits Syst. Video Technol..

[16]  Alper Yilmaz,et al.  Background subtraction for the moving camera: A geometric approach , 2014, Comput. Vis. Image Underst..

[17]  Guillaume-Alexandre Bilodeau,et al.  Universal Background Subtraction Using Word Consensus Models , 2016, IEEE Transactions on Image Processing.

[18]  Yaser Sheikh,et al.  Bayesian modeling of dynamic scenes for object detection , 2005, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[19]  Gary J. Sullivan,et al.  Rate-distortion optimization for video compression , 1998, IEEE Signal Process. Mag..

[20]  Xianguo Zhang,et al.  A Fast and Performance-Maintained Transcoding Method Based on Background Modeling for Surveillance Video , 2012, 2012 IEEE International Conference on Multimedia and Expo.

[21]  Richard Harrington,et al.  An Editor's Guide to Adobe Premiere Pro , 2011 .

[22]  Gérard G. Medioni,et al.  Detecting Motion Regions in the Presence of a Strong Parallax from a Moving Camera by Multiview Geometric Constraints , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[23]  Bu-Sung Lee,et al.  A Long-Term Reference Frame for Hierarchical B-Picture-Based Video Coding , 2014, IEEE Transactions on Circuits and Systems for Video Technology.

[24]  Wen Gao,et al.  Dual Frame Motion Compensation With Optimal Long-Term Reference Frame Selection and Bit Allocation , 2010, IEEE Transactions on Circuits and Systems for Video Technology.

[25]  Xianguo Zhang,et al.  Background-Modeling-Based Adaptive Prediction for Surveillance Video Coding , 2014, IEEE Transactions on Image Processing.

[26]  Gary J. Sullivan,et al.  Overview of the High Efficiency Video Coding (HEVC) Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[27]  Yongtian Wang,et al.  PTZ camera-based adaptive panoramic and multi-layered background model , 2011, 2011 18th IEEE International Conference on Image Processing.

[28]  Xianguo Zhang,et al.  A background proportion adaptive Lagrange multiplier selection method for surveillance video on HEVC , 2013, 2013 IEEE International Conference on Multimedia and Expo (ICME).

[29]  Alex Pentland,et al.  Pfinder: real-time tracking of the human body , 1996, Proceedings of the Second International Conference on Automatic Face and Gesture Recognition.

[30]  Antoine Vacavant,et al.  A comprehensive review of background subtraction algorithms evaluated with synthetic and real videos , 2014, Comput. Vis. Image Underst..

[31]  Pamela C. Cosman,et al.  Selection of Long-Term Reference Frames in Dual-Frame Video Coding Using Simulated Annealing , 2008, IEEE Signal Processing Letters.

[32]  King Ngi Ngan,et al.  Low-Delay Rate Control for Consistent Quality Using Distortion-Based Lagrange Multiplier , 2016, IEEE Transactions on Image Processing.

[33]  Houqiang Li,et al.  $\lambda $ Domain Rate Control Algorithm for High Efficiency Video Coding , 2014, IEEE Transactions on Image Processing.

[34]  Truong Q. Nguyen,et al.  Moving Object Detection With a Freely Moving Camera via Background Motion Subtraction , 2017, IEEE Transactions on Circuits and Systems for Video Technology.

[35]  Chongyu Chen,et al.  Surveillance video coding via low-rank and sparse decomposition , 2012, ACM Multimedia.

[36]  Zixiang Xiong,et al.  Knowledge-Based Coding of Objects for Multisource Surveillance Video Data , 2016, IEEE Transactions on Multimedia.

[37]  Kongqiao Wang,et al.  Character location in scene images from digital camera , 2003, Pattern Recognit..

[38]  Tiejun Huang,et al.  Background-foreground division based search for motion estimation in surveillance video coding , 2014, 2014 IEEE International Conference on Multimedia and Expo (ICME).

[39]  Xianguo Zhang,et al.  Optimizing the Hierarchical Prediction and Coding in HEVC for Surveillance and Conference Videos With Background Modeling , 2014, IEEE Transactions on Image Processing.

[40]  Guillaume-Alexandre Bilodeau,et al.  SuBSENSE: A Universal Change Detection Method With Local Adaptive Sensitivity , 2015, IEEE Transactions on Image Processing.

[41]  Soon Ki Jung,et al.  Decomposition into Low-rank plus Additive Matrices for Background/Foreground Separation: A Review for a Comparative Evaluation with a Large-Scale Dataset , 2015, Comput. Sci. Rev..

[42]  Zhongyuan Wang,et al.  Smart Monitoring Cameras Driven Intelligent Processing to Big Surveillance Video Data , 2018, IEEE Transactions on Big Data.

[43]  Marc Van Droogenbroeck,et al.  ViBe: A Universal Background Subtraction Algorithm for Video Sequences , 2011, IEEE Transactions on Image Processing.

[44]  Lucia Maddalena,et al.  Neural Background Subtraction for Pan-Tilt-Zoom Cameras , 2014, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[45]  W. Eric L. Grimson,et al.  Learning Patterns of Activity Using Real-Time Tracking , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[46]  Bo Li,et al.  Complexity Correlation-Based CTU-Level Rate Control with Direction Selection for HEVC , 2017, ACM Trans. Multim. Comput. Commun. Appl..

[47]  Paul Rodríguez,et al.  Incremental Principal Component Pursuit for Video Background Modeling , 2016, 2017 IEEE International Conference on Computer Vision Workshops (ICCVW).