Skip Decision and Reference Frame Selection for Low-Complexity H.264/AVC Surveillance Video Coding

H.264/advanced video coding surveillance video encoders use the Skip mode specified by the standard to reduce bandwidth. They also use multiple frames as reference for motion-compensated prediction. In this paper, we propose two techniques to reduce the bandwidth and computational cost of static camera surveillance video encoders without affecting detection and recognition performance. A spatial sampler is proposed to sample pixels that are segmented using a Gaussian mixture model. Modified weight updates are derived for the parameters of the mixture model to reduce floating point computations. A storage pattern of the parameters in memory is also modified to improve cache performance. Skip selection is performed using the segmentation results of the sampled pixels. The second contribution is a low computational cost algorithm to choose the reference frames. The proposed reference frame selection algorithm reduces the cost of coding uncovered background regions. We also study the number of reference frames required to achieve good coding efficiency. Distortion over foreground pixels is measured to quantify the performance of the proposed techniques. Experimental results show bit rate savings of up to 94.5% over methods proposed in literature on video surveillance data sets. The proposed techniques also provide up to 74.5% reduction in compression complexity without increasing the distortion over the foreground regions in the video sequence.

[1]  Bin Li,et al.  Rate-Distortion Optimized Reference Picture Management for High Efficiency Video Coding , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[2]  P. Wayne Power,et al.  Understanding Background Mixture Models for Foreground Segmentation , 2002 .

[3]  Bharadwaj S. Amrutur,et al.  Speeded up Gaussian Mixture Model algorithm for background subtraction , 2011, 2011 8th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[4]  Kai-Kuang Ma,et al.  Fast Mode Decision for H.264/AVC Based on Macroblock Motion Activity , 2009, IEEE Transactions on Circuits and Systems for Video Technology.

[5]  Chang-Su Kim,et al.  Fast background subtraction algorithm using two-level sampling and silhouette detection , 2009, 2009 16th IEEE International Conference on Image Processing (ICIP).

[6]  Chong-Min Kyung,et al.  Energy-Aware Video Encoding for Image Quality Improvement in Battery-Operated Surveillance Camera , 2012, IEEE Transactions on Very Large Scale Integration (VLSI) Systems.

[7]  Jordan W. Suchow,et al.  Motion Silences Awareness of Visual Change , 2011, Current Biology.

[8]  Wen Gao,et al.  Dual Frame Motion Compensation With Optimal Long-Term Reference Frame Selection and Bit Allocation , 2010, IEEE Transactions on Circuits and Systems for Video Technology.

[9]  Pamela C. Cosman,et al.  Dual Frame Motion Compensation With Uneven Quality Assignment , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[10]  Hyung Jin Chang,et al.  Active attentional sampling for speed-up of background subtraction , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[11]  Chih-Hsien Hsia,et al.  Hierarchical Method for Foreground Detection Using Codebook Model , 2011, IEEE Trans. Circuits Syst. Video Technol..

[12]  Satoshi Goto,et al.  Encoder adaptable difference detection for low power video compression in surveillance system , 2010, Signal Process. Image Commun..

[13]  Zoran Zivkovic,et al.  Improved adaptive Gaussian mixture model for background subtraction , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[14]  Christopher M. Bishop,et al.  Pattern Recognition and Machine Learning (Information Science and Statistics) , 2006 .

[15]  Karimulla Shaik,et al.  Application Parameter Settings for TMS320DM365 H.264 Encoder , 2010 .

[16]  Bu-Sung Lee,et al.  Explore and Model Better I-Frames for Video Coding , 2011, IEEE Transactions on Circuits and Systems for Video Technology.

[17]  Jen-Hui Chuang,et al.  Regularized Background Adaptation: A Novel Learning Rate Control Scheme for Gaussian Mixture Modeling , 2011, IEEE Transactions on Image Processing.

[18]  Pacha Shobarani ENERGY-AWARE VIDEO ENCODING FOR IMAGE QUALITY IMPROVEMENT IN BATTERY- OPERATED SURVEILLANCE CAMERA , 2015 .

[19]  Pamela C. Cosman,et al.  Selection of Long-Term Reference Frames in Dual-Frame Video Coding Using Simulated Annealing , 2008, IEEE Signal Processing Letters.

[20]  Dietmar Hepper,et al.  Efficiency analysis and application of uncovered background prediction in a low bit rate image coder , 1990, IEEE Trans. Commun..

[21]  W. Eric L. Grimson,et al.  Adaptive background mixture models for real-time tracking , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[22]  Paul A. Viola,et al.  Robust Real-Time Face Detection , 2001, Proceedings Eighth IEEE International Conference on Computer Vision. ICCV 2001.

[23]  Bin Li,et al.  Optimized reference frame selection for video coding by cloud , 2011, 2011 IEEE 13th International Workshop on Multimedia Signal Processing.

[24]  Avinash C. Kak,et al.  Hierarchical Data Structure for Real-Time Background Subtraction , 2006, 2006 International Conference on Image Processing.

[25]  Hélène Laurent,et al.  Comparative study of background subtraction algorithms , 2010, J. Electronic Imaging.

[26]  Avishek Saha,et al.  SKIP Prediction for Fast Rate Distortion Optimization in H.264 , 2007, IEEE Transactions on Consumer Electronics.

[27]  Guodong Guo,et al.  Patch-based Image Correlation with Rapid Filtering , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[28]  Edward J. Delp,et al.  Efficient and Low-Complexity Surveillance Video Compression Using Backward-Channel Aware Wyner-Ziv Video Coding , 2009, IEEE Transactions on Circuits and Systems for Video Technology.

[29]  Kazuhiko Sumi,et al.  Object-based coding for long-term archive of surveillance video , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).

[30]  David A. McAllester,et al.  Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[31]  Liang-Gee Chen,et al.  Efficient moving object segmentation algorithm using background registration technique , 2002, IEEE Trans. Circuits Syst. Video Technol..

[32]  Qionghai Dai,et al.  Background-frame based motion compensation for video compression , 2004, 2004 IEEE International Conference on Multimedia and Expo (ICME) (IEEE Cat. No.04TH8763).

[33]  Mary M Hayhoe,et al.  Task and context determine where you look. , 2016, Journal of vision.

[34]  Rita Cucchiara,et al.  Detecting Moving Objects, Ghosts, and Shadows in Video Streams , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[35]  Iain E. Garden Richardson,et al.  Low-complexity skip prediction for H.264 through Lagrangian cost estimation , 2006, IEEE Transactions on Circuits and Systems for Video Technology.