Automatically assessing properties of dynamic cameras for camera selection and rapid deployment of video content analysis tasks in large-scale ad-hoc networks

Video analytics is essential for managing large quantities of raw data that are produced by video surveillance systems (VSS) for the prevention, repression and investigation of crime and terrorism. Analytics is highly sensitive to changes in the scene, and for changes in the optical chain so a VSS with analytics needs careful configuration and prompt maintenance to avoid false alarms. However, there is a trend from static VSS consisting of fixed CCTV cameras towards more dynamic VSS deployments over public/private multi-organization networks, consisting of a wider variety of visual sensors, including pan-tilt-zoom (PTZ) cameras, body-worn cameras and cameras on moving platforms. This trend will lead to more dynamic scenes and more frequent changes in the optical chain, creating structural problems for analytics. If these problems are not adequately addressed, analytics will not be able to continue to meet end users’ developing needs. In this paper, we present a three-part solution for managing the performance of complex analytics deployments. The first part is a register containing meta data describing relevant properties of the optical chain, such as intrinsic and extrinsic calibration, and parameters of the scene such as lighting conditions or measures for scene complexity (e.g. number of people). A second part frequently assesses these parameters in the deployed VSS, stores changes in the register, and signals relevant changes in the setup to the VSS administrator. A third part uses the information in the register to dynamically configure analytics tasks based on VSS operator input. In order to support the feasibility of this solution, we give an overview of related state-of-the-art technologies for autocalibration (self-calibration), scene recognition and lighting estimation in relation to person detection. The presented solution allows for rapid and robust deployment of Video Content Analysis (VCA) tasks in large scale ad-hoc networks.

[1]  Anton Konushin,et al.  CONVOLUTIONAL NEURAL NETWORK FOR CAMERA POSE ESTIMATION FROM OBJECT DETECTIONS , 2017 .

[2]  George W. Quinn,et al.  Face In Video Evaluation (FIVE) Face Recognition of Non-Cooperative Subjects , 2017 .

[3]  Henri Bouma,et al.  Long-term behavior understanding based on the expert-based combination of short-term observations in high-resolution CCTV , 2016, Security + Defence.

[4]  Henri Bouma,et al.  Finding suspects in multiple cameras for improved railway protection , 2014, Security and Defence.

[5]  François Brémond,et al.  Crowd Behavior Recognition for Video Surveillance , 2008, ACIVS.

[6]  Jacomien G. M. de Jong,et al.  Integrated roadmap for the rapid finding and tracking of people at large airports , 2016, Int. J. Crit. Infrastructure Prot..

[7]  Tal Hassner,et al.  Violent flows: Real-time detection of violent crowd behavior , 2012, 2012 IEEE Computer Society Conference on Computer Vision and Pattern Recognition Workshops.

[8]  Henri Bouma,et al.  Automatic inference of geometric camera parameters and inter-camera topology in uncalibrated disjoint surveillance cameras , 2015, SPIE Security + Defence.

[9]  Klamer Schutte,et al.  Linear colour correction for multiple illumination changes and non-overlapping cameras , 2015, IET Image Process..

[10]  Henri Bouma,et al.  Video content analysis on body-worn cameras for retrospective investigation , 2015, SPIE Security + Defence.

[11]  Lex Lemmers,et al.  Alcohol en uitgaansgeweld , 2005 .

[12]  Yong Xu,et al.  Fighting Detection Based on Optical Flow Context Histogram , 2011, 2011 Second International Conference on Innovations in Bio-inspired Computing and Applications.

[13]  Rama Chellappa,et al.  Handbook of Remote Biometrics , 2009, Advances in Pattern Recognition.

[14]  Simson L. Garfinkel,et al.  First Workshop on Video Analytics in Public Safety | NIST , 2017 .

[15]  Alessandro Perina,et al.  Violence detection in crowded scenes using substantial derivative , 2015, 2015 12th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[16]  Tao Zhang,et al.  Robust extrinsic calibration from pedestrians , 2017, Signal Process. Image Commun..

[17]  François Brémond,et al.  Human violence recognition and detection in surveillance videos , 2016, 2016 13th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[18]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[19]  V. E. Machaca Arceda,et al.  Real Time Violence Detection in Video , 2016, ICPR 2016.

[20]  Xiangjian He,et al.  A new method for violence detection in surveillance scenes , 2015, Multimedia Tools and Applications.

[21]  Hongbin Zha,et al.  Coarse-to-fine vision-based localization by indexing scale-Invariant features , 2006, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[22]  Pawan Sinha,et al.  Face Recognition by Humans: Nineteen Results All Computer Vision Researchers Should Know About , 2006, Proceedings of the IEEE.

[23]  Wessel Kraaij,et al.  Requirements for multimedia metadata schemes in surveillance applications for security , 2013, Multimedia Tools and Applications.

[24]  Jasper R. van Huis,et al.  Automatic detection of suspicious behavior of pickpockets with track-based features in a shopping mall , 2014, Security and Defence.

[25]  Alessandro Mecocci,et al.  Real-Time Automatic Detection of Violent-Acts by Low-Level Colour Visual Cues , 2007, 2007 IEEE International Conference on Image Processing.

[26]  Klamer Schutte,et al.  Instantaneous threat detection based on a semantic representation of activities, zones and trajectories , 2014, Signal, Image and Video Processing.

[27]  Romaric Audigier,et al.  RIMOC, a feature to discriminate unstructured motions: Application to violence detection for video-surveillance , 2016, Comput. Vis. Image Underst..

[28]  Pietro Perona,et al.  Fast Feature Pyramids for Object Detection , 2014, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[29]  Pieter T. Eendebak,et al.  Improved depth perception with three-dimensional auxiliary display and computer generated three-dimensional panoramic overviews in robot-assisted laparoscopy , 2014, Journal of medical imaging.

[30]  J.H.C. van Rest Surveillance and video analytics: factors influencing the performance , 2015 .

[31]  Klamer Schutte,et al.  Selection of negative samples and two-stage combination of multiple features for action detection in thousands of videos , 2013, Machine Vision and Applications.