Automatic Human Behaviour Recognition and Explanation for CCTV Video Surveillance

This paper is concerned with producing high-level text reports and explanations of human activity in video from a single, static camera. The motivation is to enable surveillance analysts to maintain situational awareness despite the presence of large volumes of data. The scenario we focus on is urban surveillance where the imaged person is medium/low resolution. The final output is text descriptions that not only describe, in human-readable terms, what is happening but also explain the interactions that take place. The input to the reasoning process is the information obtained from video processing methods that provide an abstraction from the image data to qualitative (i.e. human-readable) descriptions of observed human activity. Explanations of global scene activity, particularly where interesting events have occurred, is achieved using an extensible, rule-based method. The complete system represents a general technique for video understanding, which requires a guided training phase by an experienced analyst.

[1]  Michael E. Bratman,et al.  Intention, Plans, and Practical Reason , 1991 .

[2]  Ian D. Reid,et al.  Estimating Gaze Direction from Low-Resolution Faces in Video , 2006, ECCV.

[3]  David J. Fleet,et al.  Performance of optical flow techniques , 1994, International Journal of Computer Vision.

[4]  Dima Damen,et al.  Recognizing linked events: Searching the space of feasible explanations , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[5]  Jitendra Malik,et al.  Recognizing action at a distance , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[6]  Ian D. Reid,et al.  Behaviour understanding in video: a combined method , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[7]  Michael Brady,et al.  Towards a behavioural traffic monitoring system , 2005, AAMAS '05.

[8]  Dorin Comaniciu,et al.  Mean shift analysis and applications , 1999, Proceedings of the Seventh IEEE International Conference on Computer Vision.

[9]  Michael Wooldridge,et al.  The Belief-Desire-Intention Model of Agency , 1998, ATAL.

[10]  Brian J. Scholl,et al.  Innateness and (Bayesian) Visual Perception , 2005 .

[11]  Lawrence Birnbaum,et al.  Sensible Scenes: Visual Understanding of Complex Structures through Causal Analysis , 1993, AAAI.

[12]  Jeffrey Mark Siskind,et al.  Grounding the Lexical Semantics of Verbs in Visual Perception using Force Dynamics and Event Logic , 1999, J. Artif. Intell. Res..

[13]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[14]  David C. Hogg,et al.  Detecting inexplicable behaviour , 2004, BMVC.

[15]  Matthew Brand,et al.  Discovery and Segmentation of Activities in Video , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[16]  E. Jaynes Probability theory : the logic of science , 2003 .

[17]  Shaogang Gong,et al.  Video behaviour profiling and abnormality detection without manual labelling , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[18]  Jianbo Shi,et al.  Detecting unusual activity in video , 2004, CVPR 2004.

[19]  Simona Ronchi Della Rocca,et al.  λ Δ -Models , 2004 .

[20]  Michael J. Black,et al.  Implicit Probabilistic Models of Human Motion for Synthesis and Tracking , 2002, ECCV.

[21]  Ernesto Andrade,et al.  Simulation of Crowd Problems for Computer Vision , 2005 .

[22]  Anthony G. Cohn,et al.  Qualitative Reasoning , 1987, Advanced Topics in Artificial Intelligence.

[23]  Bob Fisher,et al.  First International Workshop on Crowd Simulation (V-CROWDS '05) , 2005 .

[24]  J. Buckley,et al.  Fuzzy expert systems and fuzzy reasoning , 2004 .

[25]  Paul A. Viola,et al.  Detecting Pedestrians Using Patterns of Motion and Appearance , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[26]  Richard Samuels,et al.  The Innate Mind: Structure and Contents , 2005 .

[27]  Peter Norvig,et al.  Artificial Intelligence: A Modern Approach , 1995 .

[28]  Michal Irani,et al.  Detecting Irregularities in Images and in Video , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[29]  Michal Irani,et al.  Detecting Irregularities in Images and in Video , 2005, ICCV.

[30]  M. Irani,et al.  Event-Based Video Analysis, , 2001 .

[31]  J. Pearl Causality: Models, Reasoning and Inference , 2000 .