The Impact of Video Transcoding Parameters on Event Detection for Surveillance Systems

The process of transcoding videos apart from being computationally intensive, can also be a rather complex procedure. The complexity refers to the choice of appropriate parameters for the transcoding engine, with the aim of decreasing video sizes, transcoding times and network bandwidth without degrading video quality beyond some threshold that event detectors lose their accuracy. This paper explains the need for transcoding, and then studies different video quality metrics. Commonly used algorithms for motion and person detection are briefly described, with emphasis in investigating the optimum transcoding configuration parameters. The analysis of the experimental results reveals that the existing video quality metrics are not suitable for automated systems and that the detection of persons is affected by the reduction of bit rate and resolution, while motion detection is more sensitive to frame rate.

[1]  Edward Y. Chang,et al.  Invariant feature extraction and biased statistical inference for video surveillance , 2003, Proceedings of the IEEE Conference on Advanced Video and Signal Based Surveillance, 2003..

[2]  Noel E. O'Connor,et al.  Automatic camera selection for activity monitoring in a multi-camera system for tennis , 2009, 2009 Third ACM/IEEE International Conference on Distributed Smart Cameras (ICDSC).

[3]  Paul S. Fisher,et al.  Image quality measures and their performance , 1995, IEEE Trans. Commun..

[4]  Mohammed Ghanbari,et al.  A multi-metric objective picture-quality measurement model for MPEG video , 2000, IEEE Trans. Circuits Syst. Video Technol..

[5]  Margaret H. Pinson,et al.  Spatial-temporal distortion metric for in-service quality monitoring of any digital video system , 1999, Optics East.

[6]  Arnaud Doucet,et al.  Particle filters for state estimation of jump Markov linear systems , 2001, IEEE Trans. Signal Process..

[7]  Christopher G. Harris,et al.  A Combined Corner and Edge Detector , 1988, Alvey Vision Conference.

[8]  Takeo Kanade,et al.  An Iterative Image Registration Technique with an Application to Stereo Vision , 1981, IJCAI.

[9]  Cordelia Schmid,et al.  Human Detection Using Oriented Histograms of Flow and Appearance , 2006, ECCV.

[10]  Cordelia Schmid,et al.  Learning realistic human actions from movies , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[11]  Fernando Jaureguizar,et al.  An advanced Bayesian model for the visual tracking of multiple interacting objects , 2011, EURASIP J. Adv. Signal Process..

[12]  Cordelia Schmid,et al.  Evaluation of Local Spatio-temporal Features for Action Recognition , 2009, BMVC.

[13]  Cordelia Schmid,et al.  Action recognition by dense trajectories , 2011, CVPR 2011.

[14]  Fernando Pereira,et al.  MPEG-4 video subjective test procedures and results , 1997, IEEE Trans. Circuits Syst. Video Technol..

[15]  Eero P. Simoncelli,et al.  Image quality assessment: from error visibility to structural similarity , 2004, IEEE Transactions on Image Processing.

[16]  Sheila S. Hemami,et al.  Analyzing the role of visual structure in the recognition of natural image content with multi-scale SSIM , 2008, Electronic Imaging.

[17]  Sugato Chakravarty,et al.  Methodology for the subjective assessment of the quality of television pictures , 1995 .

[18]  Paul Over,et al.  Evaluation campaigns and TRECVid , 2006, MIR '06.

[19]  Wei Tsang Ooi,et al.  Video quality for face detection, recognition, and tracking , 2011, TOMCCAP.

[20]  A. G. Amitha Perera,et al.  Multi-Object Tracking Through Simultaneous Long Occlusions and Split-Merge Conditions , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[21]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).