Scene Video Coding

Scene videos are defined as the kinds of video captured from a scene for a long while. Specifically, surveillance video is a typical kind of scene video, and this chapter will introduce some efficient coding tools developed for surveillance video in AVS, which also work well for other common scene video, e.g., conference video. This chapter consists of five parts. The first part introduces the background redundancy features of surveillance video. The second part gives a brief overview of model-based coding. The third part details the background picture model-based coding method, and typical background models are surveyed. In the fourth part, background picture-based surveillance coding methods in AVS and AVS2 are provided. The last part summarizes this chapter.

[1]  Kentaro Toyama,et al.  Wallflower: principles and practice of background maintenance , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[2]  Gary J. Sullivan,et al.  Motion compensation for video compression using control grid interpolation , 1991, [Proceedings] ICASSP 91: 1991 International Conference on Acoustics, Speech, and Signal Processing.

[3]  Xianguo Zhang,et al.  Optimizing the Hierarchical Prediction and Coding in HEVC for Surveillance and Conference Videos With Background Modeling , 2014, IEEE Transactions on Image Processing.

[4]  Larry S. Davis,et al.  W4: Real-Time Surveillance of People and Their Activities , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Peter Gerken,et al.  Object-based analysis-synthesis coding of image sequences at very low bit rates , 1994, IEEE Trans. Circuits Syst. Video Technol..

[6]  O. J. Morris,et al.  Segmented-image coding: Performance comparison with the discrete cosine transform , 1988 .

[7]  Fan Zhang,et al.  Adaptive Block-size Transform based Just-Noticeable Difference model for images/videos , 2011, Signal Process. Image Commun..

[8]  Gary J. Sullivan,et al.  Overview of the High Efficiency Video Coding (HEVC) Standard , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[9]  W. Eric L. Grimson,et al.  Learning Patterns of Activity Using Real-Time Tracking , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[10]  Rita Cucchiara,et al.  Detecting Moving Objects, Ghosts, and Shadows in Video Streams , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[11]  A. Murat Tekalp,et al.  Face and 2-D mesh animation in MPEG-4 , 2000, Signal Process. Image Commun..

[12]  Xianguo Zhang,et al.  Fast and Efficient Transcoding Based on Low-Complexity Background Modeling and Adaptive Block Classification , 2013, IEEE Transactions on Multimedia.

[13]  Mubarak Shah,et al.  An object-based video coding framework for video sequences obtained from static cameras , 2005, MULTIMEDIA '05.

[14]  Sergio A. Velastin,et al.  Automatic congestion detection system for underground platforms , 2001, Proceedings of 2001 International Symposium on Intelligent Multimedia, Video and Speech Processing. ISIMP 2001 (IEEE Cat. No.01EX489).

[15]  Zoran Bojkovic,et al.  Audiovisual integration in multimedia communications based on MPEG-4 facial animation , 2001 .

[16]  Kazuhiko Sumi,et al.  Object-based coding for long-term archive of surveillance video , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).

[17]  R. J. Safranek,et al.  A perceptually tuned sub-band image coder with image dependent quantization and post-quantization data compression , 1989, International Conference on Acoustics, Speech, and Signal Processing,.

[18]  Scott J. Daly,et al.  Visible differences predictor: an algorithm for the assessment of image fidelity , 1992, Electronic Imaging.

[19]  Hans Georg Musmann Object-oriented analysis-synthesis coding based on source models of moving 2D- and 3D-objects , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[20]  Wen Gao,et al.  Nonparametric Background Generation , 2006, 18th International Conference on Pattern Recognition (ICPR'06).

[21]  Yao Wang,et al.  Video Processing and Communications , 2001 .

[22]  Robert J. Safranek,et al.  Signal compression based on models of human perception , 1993, Proc. IEEE.

[23]  Michael Hötter,et al.  Object-oriented analysis-synthesis coding based on moving two-dimensional objects , 1990, Signal Process. Image Commun..

[24]  Qi Tian,et al.  Foreground object detection in changing background based on color co-occurrence statistics , 2002, Sixth IEEE Workshop on Applications of Computer Vision, 2002. (WACV 2002). Proceedings..

[25]  Theodosios Pavlidis,et al.  Picture Segmentation by a Tree Traversal Algorithm , 1976, JACM.

[26]  Larry S. Davis,et al.  Non-parametric Model for Background Subtraction , 2000, ECCV.

[27]  D. Marpe,et al.  The H.264/MPEG4 advanced video coding standard and its applications , 2006, IEEE Communications Magazine.

[28]  Alex Pentland,et al.  A Bayesian Computer Vision System for Modeling Human Interactions , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[29]  Heiko Schwarz,et al.  Improved H.264/AVC coding using texture analysis and synthesis , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[30]  Janelle J. Harms,et al.  Learning-based multiview video coding , 2009, 2009 Picture Coding Symposium.

[31]  David J. Sakrison,et al.  The effects of a visual fidelity criterion of the encoding of images , 1974, IEEE Trans. Inf. Theory.

[32]  M. Kocher,et al.  Adaptive region growing technique using polynomial functions for image approximation , 1986 .

[33]  Michael Elad,et al.  Compression of facial images using the K-SVD algorithm , 2008, J. Vis. Commun. Image Represent..

[34]  Chun-Hsien Chou,et al.  A perceptually tuned subband image coder based on the measure of just-noticeable-distortion profile , 1995, IEEE Trans. Circuits Syst. Video Technol..

[35]  Yang Li,et al.  Incorporating Primal Sketch Based Learning Into Low Bit-Rate Image Compression , 2007, 2007 IEEE International Conference on Image Processing.

[36]  B. Girod,et al.  Facial Expression Analysis for Model-Based Coding of Video Sequences , 1997 .

[37]  Fan Zhang,et al.  Region-based texture modelling for next generation video codecs , 2010, 2010 IEEE International Conference on Image Processing.

[38]  Alan C. Bovik,et al.  Visual quality assessment algorithms: what does the future hold? , 2010, Multimedia Tools and Applications.

[39]  Christine Guillemot,et al.  Mesh-Based Motion-Compensated Interpolation for Side Information Extraction in Distributed Video Coding , 2006, 2006 International Conference on Image Processing.

[40]  Morris Goldberg,et al.  Hierarchy in Picture Segmentation: A Stepwise Optimization Approach , 1989, IEEE Trans. Pattern Anal. Mach. Intell..

[41]  Mohammad Ghanbari,et al.  General approach to block-matching motion estimation , 1993 .

[42]  Manoranjan Paul,et al.  Improved Gaussian mixtures for robust object detection by adaptive multi-background generation , 2008, 2008 19th International Conference on Pattern Recognition.

[43]  Xianguo Zhang,et al.  Background-Modeling-Based Adaptive Prediction for Surveillance Video Coding , 2014, IEEE Transactions on Image Processing.

[44]  M. Hotter,et al.  Optimization and efficiency of an object-oriented analysis-synthesis coder , 1994 .

[45]  D. E. Pearson,et al.  Developments in model-based video coding , 1995, Proc. IEEE.

[46]  Graeme A. Jones,et al.  Hierarchical probabilistic models for video object segmentation and tracking , 2004, ICPR 2004.

[47]  Andrew B. Watson,et al.  DCT quantization matrices visually optimized for individual images , 1993, Electronic Imaging.

[48]  Kiyoharu Aizawa,et al.  Model-based analysis synthesis image coding (MBASIC) system for a person's face , 1989, Signal Process. Image Commun..

[49]  Massimo Piccardi,et al.  Background subtraction techniques: a review , 2004, 2004 IEEE International Conference on Systems, Man and Cybernetics (IEEE Cat. No.04CH37583).

[50]  Jörn Ostermann,et al.  Object-oriented analysis-synthesis coding of moving images , 1989, Signal Process. Image Commun..

[51]  K.N. Ngan,et al.  Foreground/background video coding scheme , 1997, Proceedings of 1997 IEEE International Symposium on Circuits and Systems. Circuits and Systems in the Information Age ISCAS '97.

[52]  John Gantz,et al.  The Digital Universe in 2020: Big Data, Bigger Digital Shadows, and Biggest Growth in the Far East , 2012 .

[53]  Peter Grant,et al.  Reliable tracking of facial features in semantic-based video coding , 1998 .

[54]  Alex Pentland,et al.  Pfinder: Real-Time Tracking of the Human Body , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[55]  Xiaoyan Sun,et al.  Classified patch learning for spatially scalable video coding , 2009, 2009 16th IEEE International Conference on Image Processing (ICIP).

[56]  Edward H. Adelson,et al.  Representing moving images with layers , 1994, IEEE Trans. Image Process..

[57]  Ferran Marqués,et al.  Segmentation-based video coding system allowing the manipulation of objects , 1997, IEEE Trans. Circuits Syst. Video Technol..

[58]  Hiroshi Harashima,et al.  Intelligent Image Coding and Communications with Realistic Sensations --Recent Trends-- , 1991 .

[59]  Aljoscha Smolic,et al.  Coding Algorithms for 3DTV—A Survey , 2007, IEEE Transactions on Circuits and Systems for Video Technology.

[60]  Anamitra Makur,et al.  Object-based Surveillance Video Compression using Foreground Motion Compensation , 2006, 2006 9th International Conference on Control, Automation, Robotics and Vision.

[61]  W.S. Lin,et al.  Just-noticeable-distortion profile with nonlinear additivity model for perceptual masking in color images , 2003, 2003 IEEE International Conference on Acoustics, Speech, and Signal Processing, 2003. Proceedings. (ICASSP '03)..

[62]  Jörn Ostermann Object-based analysis-synthesis coding (OBASC) based on the source model of moving flexible 3-D objects , 1994, IEEE Trans. Image Process..