A Survey on Behavior Analysis in Video Surveillance Applications

There is an increasing desire and need in video surveillance applications for a proposed solution to be able to analyze human behaviors and identify subjects for standoff threat analysis and determination. The main purpose of this survey is to look at current developments and capabilities of visual surveillance systems and assess the feasibility and challenges of using a visual surveillance system to automatically detect abnormal behavior, detect hostile intent, and identify human subject. Visual (or video) surveillance devices have long been in use to gather information and to monitor people, events and activities. Visual surveillance technologies, CCD cameras, thermal cameras and night vision devices, are the three most widely used devices in the visual surveillance market. Visual surveillance in dynamic scenes, especially for humans, is currently one of the most active research topics in computer vision and artificial intelligence. It has a wide spectrum of promising public safety and security applications, including access control, crowd flux statistics and congestion analysis, human behavior detection and analysis, etc. Visual surveillance in dynamic scene with multiple cameras, attempts to detect, recognize and track certain objects from image sequences, and more importantly to understand and describe object behaviors. The main goal of visual surveillance is to develop intelligent visual surveillance to replace the traditional passive video surveillance that is proving ineffective as the number of cameras exceed the capability of human operators to monitor them. The goal of visual surveillance is not only to put cameras in the place of human eyes, but also to accomplish the entire surveillance task as automatically as possible. The capability of being able to analyze human movements and their activities from image sequences is crucial for visual surveillance. In general, the processing framework of an automated visual surveillance system includes the following stages: Motion/object detection, object classification, object tracking, behavior and activity analysis and understanding, person identification, and camera handoff and data fusion. Almost every visual surveillance system starts with motion and object detection. Motion detection aims at segmenting regions corresponding to moving objects from the rest of an image. Subsequent processes such as object tracking and behavior analysis and recognition are greatly dependent on it. The process of motion/object detection usually involves background/environment modeling and motion segmentation, which intersect each other

[1]  Tim J. Ellis,et al.  Learning semantic scene models from observing activity in visual surveillance , 2005, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[2]  Luc Van Gool,et al.  Modeling and Recognition of Human Actions Using a Stochastic Approach , 2002 .

[3]  Takeo Kanade,et al.  A System for Video Surveillance and Monitoring , 2000 .

[4]  Mohan M. Trivedi,et al.  A Survey of Vision-Based Trajectory Learning and Analysis for Surveillance , 2008, IEEE Transactions on Circuits and Systems for Video Technology.

[5]  Peter H. N. de With,et al.  Flexible Human Behavior Analysis Framework for Video Surveillance Applications , 2010, Int. J. Digit. Multim. Broadcast..

[6]  Paulo Peixoto,et al.  A Lidar and Vision-based Approach for Pedestrian and Vehicle Detection and Tracking , 2007, 2007 IEEE Intelligent Transportation Systems Conference.

[7]  Michael Isard,et al.  Contour Tracking by Stochastic Propagation of Conditional Density , 1996, ECCV.

[8]  Andrea Cavallaro,et al.  Video Analytics for Surveillance: Theory and Practice [From the Guest Editors] , 2010 .

[9]  M. Szarvas,et al.  Real-time Pedestrian Detection Using LIDAR and Convolutional Neural Networks , 2006, 2006 IEEE Intelligent Vehicles Symposium.

[10]  Michael J. Brooks,et al.  Issues in Automated Visual Surveillance , 2003, DICTA.

[11]  Tieniu Tan,et al.  Recent developments in human motion analysis , 2003, Pattern Recognit..

[12]  Ramakant Nevatia,et al.  Event Detection and Analysis from Video Streams , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[13]  Matthew Brand,et al.  Discovery and Segmentation of Activities in Video , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  Jean-Yves Le Boudec,et al.  An Artificial Immune System for Misbehavior Detection in Mobile Ad-Hoc Networks with Virtual Thymus, Clustering, Danger Signal and Memory Detectors , 2004, Int. J. Unconv. Comput..

[15]  Touradj Ebrahimi,et al.  Tracking video objects in cluttered background , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[16]  Mubarak Shah,et al.  Motion-based recognition a survey , 1995, Image Vis. Comput..

[17]  François Brémond,et al.  Video surveillance for aircraft activity monitoring , 2005, IEEE Conference on Advanced Video and Signal Based Surveillance, 2005..

[18]  Christoph Bregler,et al.  Learning and recognizing human dynamics in video sequences , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[19]  A J Ahumada,et al.  Model of human visual-motion sensing. , 1985, Journal of the Optical Society of America. A, Optics and image science.

[20]  Mubarak Shah,et al.  Tracking and Object Classification for Automated Surveillance , 2002, ECCV.

[21]  Ankush Mittal,et al.  Study of Robust and Intelligent Surveillance in Visible and Multi-modal Framework , 2007, Informatica.

[22]  Francis K. H. Quek,et al.  Agent-based gesture tracking , 2005, IEEE Trans. Syst. Man Cybern. Part A.

[23]  T. Jan,et al.  Neural network based threat assessment for automated visual surveillance , 2004, 2004 IEEE International Joint Conference on Neural Networks (IEEE Cat. No.04CH37541).

[24]  Aaron F. Bobick,et al.  Activity and function recognition for moving and static objects in urban environments from wide-area persistent surveillance inputs , 2010, Defense + Commercial Sensing.

[25]  Paul Over,et al.  High-level feature detection from video in TRECVid: a 5-year retrospective of achievements , 2009 .

[26]  G. Johansson Visual perception of biological motion and a model for its analysis , 1973 .

[27]  Aaron F. Bobick,et al.  Recognition of Visual Activities and Interactions by Stochastic Parsing , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[28]  Teddy Ko,et al.  A survey on behavior analysis in video surveillance for homeland security applications , 2008, 2008 37th IEEE Applied Imagery Pattern Recognition Workshop.

[29]  W. Eric L. Grimson,et al.  Adaptive background mixture models for real-time tracking , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[30]  J. Montepare,et al.  The identification of emotions from gait information , 1987 .

[31]  Tieniu Tan,et al.  A survey on visual surveillance of object motion and behaviors , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[32]  Aaron F. Bobick,et al.  A State-Based Approach to the Representation and Recognition of Gesture , 1997, IEEE Trans. Pattern Anal. Mach. Intell..

[33]  James W. Davis,et al.  The Recognition of Human Movement Using Temporal Templates , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[34]  François Brémond,et al.  Video-understanding framework for automatic behavior recognition , 2006, Behavior research methods.