Heterogeneous Information Fusion and Visualization for a Large-Scale Intelligent Video Surveillance System

Wide-area monitoring for a smart community can be challenging in systems engineering because of its large scale and heterogeneity at the sensor, algorithm, and visualization levels. A smart interface to visualize high-level information fused from a diversity of low-level surveillance data, and to facilitate rapid response of events, is critical for the design of the system. This paper presents an event-driven visualization mechanism fusing multimodal information for a large-scale intelligent video surveillance system. The mechanism proactively helps security personnel intuitively be aware of events through close cooperation among visualization, data fusion, and sensor tasking. The visualization not only displays 2-D, 3-D, and geographical information within a condensed form of interface but also automatically shows the only important video streams corresponding to spontaneous alerts and events by a decision process called display switching arbitration. The display switching arbitration decides the importance of cameras by score ranking that considers event urgency and semantic object features. This system has been successfully deployed in a campus to demonstrate its usability and efficiency for an installation with two camera clusters that include dozens of cameras, and with a lot of video analytics to detect alerts and events. A further simulation comparing the display switching arbitration with similar camera selection methods shows that our method improves the visualization by selecting better representative camera views and reducing redundant switchover among multiview videos.

[1]  Henry Medeiros,et al.  Cluster-Based Distributed Face Tracking in Camera Networks , 2010, IEEE Transactions on Image Processing.

[2]  Hamid K. Aghajan,et al.  Architecture for Cluster-Based Automated Surveillance Network for Detecting and Tracking Multiple Persons , 2007, 2007 First ACM/IEEE International Conference on Distributed Smart Cameras.

[3]  Peter Widhalm,et al.  Next-generation 3D visualization for visual surveillance , 2011, 2011 8th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[4]  Sharath Pankanti,et al.  Video surveillance: past, present, and now the future [DSP Forum] , 2013, IEEE Signal Processing Magazine.

[5]  Sheng-Jyh Wang,et al.  A Hierarchical Bayesian Generation Framework for Vacant Parking Space Detection , 2010, IEEE Transactions on Circuits and Systems for Video Technology.

[6]  Yuan-Kai Wang,et al.  Real-time camera anomaly detection for real-world video surveillance , 2011, 2011 International Conference on Machine Learning and Cybernetics.

[7]  Jiming Chen,et al.  Smart community: an internet of things application , 2011, IEEE Communications Magazine.

[8]  Ming-Hsiu Tsai,et al.  Vehicle Counting without Background Modeling , 2011, MMM.

[9]  Gian Luca Foresti,et al.  Camera Selection for Adaptive Human-Computer Interface , 2014, IEEE Transactions on Systems, Man, and Cybernetics: Systems.

[10]  Yi-Ping Hung,et al.  Multi-Resolution Design for Large-Scale and High-Resolution Monitoring , 2011, IEEE Transactions on Multimedia.

[11]  Lisa M. Brown,et al.  IBM smart surveillance system (S3): event based video surveillance system with an open and extensible framework , 2008, Machine Vision and Applications.

[12]  Li-Chen Fu,et al.  Multitarget Visual Tracking Based Effective Surveillance With Cooperation of Multiple Active Cameras , 2011, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[13]  Henry Medeiros,et al.  Distributed Object Tracking Using a Cluster-Based Kalman Filter in Wireless Camera Networks , 2008, IEEE Journal of Selected Topics in Signal Processing.

[14]  Frank M. Shipman,et al.  DOTS: support for effective video surveillance , 2007, ACM Multimedia.

[15]  Seong-Dae Kim,et al.  Rate-driven key frame selection using temporal variation of visual content , 2002 .

[16]  Daijin Kim,et al.  Probabilistic camera hand-off for visual surveillance , 2008, 2008 Second ACM/IEEE International Conference on Distributed Smart Cameras.

[17]  Chin-Teng Lin,et al.  A Conditional Entropy-Based Independent Component Analysis for Applications in Human Detection and Tracking , 2010, EURASIP J. Adv. Signal Process..

[18]  Li Yu,et al.  Systems issues in distributed multi-modal surveillance , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[19]  Ian F. Akyildiz,et al.  A survey on wireless multimedia sensor networks , 2007, Comput. Networks.

[20]  A. Murat Tekalp,et al.  Robust color histogram descriptors for video segment retrieval and identification , 2002, IEEE Trans. Image Process..

[21]  Xiaogang Wang,et al.  Intelligent multi-camera video surveillance: A review , 2013, Pattern Recognit. Lett..

[22]  Yuting Su,et al.  Surveillance video summarization based on moving object detection and trajectory extraction , 2010, 2010 2nd International Conference on Signal Processing Systems.

[23]  Faouzi Kossentini,et al.  Automatic Key Video Object Plane Selection Using the Shape Information in the MPEG-4 Compressed Domain , 2000, IEEE Trans. Multim..

[24]  Yael Pritch,et al.  This article has been accepted for publication in a future issue of this journal, but has not been fully edited. Content may change prior to final publication. IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, 2008 1 Non-Chronological Video , 2022 .

[25]  Kin-Man Lam,et al.  A new key frame representation for video segment retrieval , 2005, IEEE Transactions on Circuits and Systems for Video Technology.

[26]  Shyh-Kang Jeng,et al.  Augmented keyframe , 2010, J. Vis. Commun. Image Represent..

[27]  Yannis Avrithis,et al.  A Region Thesaurus Approach for High-Level Concept Detection in the Natural Disaster Domain , 2007, SAMT.

[28]  Zhi-Hua Zhou,et al.  Multi-View Video Summarization , 2010, IEEE Transactions on Multimedia.

[29]  Yo-Ping Huang,et al.  Ubiquitous Information Transfer across Different Platforms by QR Codes , 2010, J. Mobile Multimedia.

[30]  Tomi Räty,et al.  Survey on Contemporary Remote Surveillance Systems for Public Safety , 2010, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[31]  Xavier Savatier,et al.  Face Detection and Recognition based on Fusion of Omnidirectional and PTZ Vision Sensors and Heteregenous Database , 2013 .

[32]  David Lyon,et al.  Eyes Everywhere : The Global Growth of Camera Surveillance , 2012 .

[33]  M.D. Naish,et al.  Active-vision-based multisensor surveillance - an implementation , 2006, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[34]  Jenq-Neng Hwang,et al.  Object-based video abstraction for video surveillance systems , 2002, IEEE Trans. Circuits Syst. Video Technol..

[35]  WeiQi Yan,et al.  A Framework for an Event Driven Video Surveillance System , 2010, 2010 7th IEEE International Conference on Advanced Video and Signal Based Surveillance.

[36]  G. J. Smith,et al.  Behind the Screens: Examining Constructions of Deviance and Informal Practices among CCTV Control Room Operators in the UK , 2002 .

[37]  Norimichi Ukita,et al.  Real-time multitarget tracking by a cooperative distributed vision system , 2002, Proc. IEEE.

[38]  Jun-Wei Hsieh,et al.  Segmentation of Human Body Parts Using Deformable Triangulation , 2006, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[39]  P. L. Venetianer,et al.  The evolution of video surveillance: an overview , 2008, Machine Vision and Applications.

[40]  Joseph Ferenbok,et al.  Hidden changes: From CCTV to ‘smart’ video surveillance , 2013 .