Metadata extraction and organization for intelligent video surveillance system

The research for metadata extraction originates from the intelligent video surveillance system, which is widely used in outdoor and indoor environment for the aims of traffic monitor, security guard, and intelligent robot. Various features are extracted from the surveillance image sequences such as target detection, target tracking, object's shape and activities. However, the trend of more and more features being used and shared in video surveillance system calls for more attention to bridge the gap between specific analysis algorithms and end-user's expectation. This paper proposes a three-layer object oriented model to extract the surveillance metadata including shape, motion speed, and trajectory of the object emerging in image sequence. Meanwhile, the high-level semantic metadata including entry/exit point, object duration time is organized and stored which are provided for the further end-user queries. The paper also presents the experiment results in different indoor and outdoor surveillance scenarios. At last, a comparative analysis with another traditional method is presented.

[1]  Ehud Rivlin,et al.  Understanding Video Events: A Survey of Methods for Automatic Interpretation of Semantic Occurrences in Video , 2009, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[2]  Sergio A. Velastin,et al.  A profile of MPEG-7 for visual surveillance , 2007, 2007 IEEE Conference on Advanced Video and Signal Based Surveillance.

[3]  Ramakant Nevatia,et al.  VERL: An Ontology Framework for Representing and Annotating Video Events , 2005, IEEE Multim..

[4]  Ramakant Nevatia,et al.  An Ontology for Video Event Representation , 2004, 2004 Conference on Computer Vision and Pattern Recognition Workshop.

[5]  Monique Thonnat,et al.  Ontologies For Video Events , 2004 .

[6]  Xavier Desurmont,et al.  Adaptive Metadata Management System for Distributed Video Content Analysis , 2008, ACIVS.

[7]  Henrik Plate,et al.  Collaborative Workflow Management for eGovernment , 2007 .

[8]  Andrea Cavallaro,et al.  Target Detection and Tracking With Heterogeneous Sensors , 2008, IEEE Journal of Selected Topics in Signal Processing.

[9]  Walter Stechele,et al.  Toward contextual forensic retrieval for visual surveillance: Challenges and an architectural approach , 2009, 2009 10th Workshop on Image Analysis for Multimedia Interactive Services.