Expert video-surveillance system for real-time detection of suspicious behaviors in shopping malls

Tracking-by-detection based on segmentation, Kalman predictions and LSAP association.Occlusion management: SVM kernel metric for GCH+LBP+HOG image features.Overall performance near to 85% while tracking under occlusions in CAVIAR dataset.Human behavior analysis (exits, loitering, etc.) in naturalistic scenes in shops.Real-time multi-camera performance with a processing capacity near to 50fps/camera. Expert video-surveillance systems are a powerful tool applied in varied scenarios with the aim of automatizing the detection of different risk situations and helping human security officers to take appropriate decisions in order to enhance the protection of assets. In this paper, we propose a complete expert system focused on the real-time detection of potentially suspicious behaviors in shopping malls. Our video-surveillance methodology contributes several innovative proposals that compose a robust application which is able to efficiently track the trajectories of people and to discover questionable actions in a shop context. As a first step, our system applies an image segmentation to locate the foreground objects in scene. In this case, the most effective background subtraction algorithms of the state of the art are compared to find the most suitable for our expert video-surveillance application. After the segmentation stage, the detected blobs may represent full or partial people bodies, thus, we have implemented a novel blob fusion technique to group the partial blobs into the final human targets. Then, we contribute an innovative tracking algorithm which is not only based on people trajectories as the most part of state-of-the-art methods, but also on people appearance in occlusion situations. This tracking is carried out employing a new two-step method: (1) the detections-to-tracks association is solved by using Kalman filtering combined with an own-designed cost optimization for the Linear Sum Assignment Problem (LSAP); and (2) the occlusion management is based on SVM kernels to compute distances between appearance features such as GCH, LBP and HOG. The application of these three features for recognizing human appearance provides a great performance compared to other description techniques, because color, texture and gradient information are effectively combined to obtain a robust visual description of people. Finally, the resultant trajectories of people obtained in the tracking stage are processed by our expert video-surveillance system for analyzing human behaviors and identifying potential shopping mall alarm situations, as are shop entry or exit of people, suspicious behaviors such as loitering and unattended cash desk situations. With the aim of evaluating the performance of some of the main contributions of our proposal, we use the publicly available CAVIAR dataset for testing the proposed tracking method with a success near to 85% in occlusion situations. According to this performance, we corroborate in the presented results that the precision and efficiency of our tracking method is comparable and slightly superior to the most recent state-of-the-art works. Furthermore, the alarms given off by our application are evaluated on a naturalistic private dataset, where it is evidenced that our expert video-surveillance system can effectively detect suspicious behaviors with a low computational cost in a shopping mall context.

[1]  Ramakant Nevatia,et al.  Tracking of Multiple, Partially Occluded Humans based on Static Body Part Detection , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[2]  Qi Tian,et al.  An Efficient Sequential Approach to Tracking Multiple Objects Through Crowds for Real-Time Intelligent CCTV Systems , 2008, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[3]  Yizhen Huang,et al.  Multi-pose human head detection and tracking boosted by efficient human head validation using ellipse detection , 2015, Eng. Appl. Artif. Intell..

[4]  Benjamin Höferlin,et al.  Evaluation of background subtraction techniques for video surveillance , 2011, CVPR 2011.

[5]  Yihong Gong,et al.  Multi-target tracking by learning local-to-global trajectory models , 2015, Pattern Recognit..

[6]  Thomas B. Moeslund,et al.  Thermal cameras and applications: a survey , 2013, Machine Vision and Applications.

[7]  Luis Miguel Bergasa,et al.  Bidirectional loop closure detection on panoramas for visual navigation , 2014, 2014 IEEE Intelligent Vehicles Symposium Proceedings.

[8]  Hongxun zhang,et al.  Fusing Color and Texture Features for Background Model , 2006, FSKD.

[9]  Brian C. Lovell,et al.  A Low-Complexity Algorithm for Static Background Estimation from Cluttered Image Sequences in Surveillance Contexts , 2013, EURASIP J. Image Video Process..

[10]  J.A. Besada,et al.  Airport surface surveillance based on video images , 2005, IEEE Transactions on Aerospace and Electronic Systems.

[11]  Duc Phu Chau,et al.  Online parameter tuning for object tracking algorithms , 2014, Image Vis. Comput..

[12]  Tomi Räty,et al.  Survey on Contemporary Remote Surveillance Systems for Public Safety , 2010, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[13]  Thierry Bouwmans,et al.  Fuzzy integral for moving object detection , 2008, 2008 IEEE International Conference on Fuzzy Systems (IEEE World Congress on Computational Intelligence).

[14]  Matthew N. Dailey,et al.  Multiple human tracking in high-density crowds , 2009, Image Vis. Comput..

[15]  Luigi di Stefano,et al.  Advanced Video-Based Surveillance , 2011, EURASIP J. Image Video Process..

[16]  T. E. Easterfield A Combinatorial Algorithm , 1946 .

[17]  Ming-Hsuan Yang,et al.  Gender classification with support vector machines , 2000, Proceedings Fourth IEEE International Conference on Automatic Face and Gesture Recognition (Cat. No. PR00580).

[18]  Ramakant Nevatia,et al.  Learning to associate: HybridBoosted multi-target tracker for crowded scene , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[19]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[20]  Thierry Bouwmans,et al.  Background Modeling using Mixture of Gaussians for Foreground Detection - A Survey , 2008 .

[21]  Tieniu Tan,et al.  A survey on visual surveillance of object motion and behaviors , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part C (Applications and Reviews).

[22]  T. Başar,et al.  A New Approach to Linear Filtering and Prediction Problems , 2001 .

[23]  Thierry Bouwmans,et al.  Type-2 Fuzzy Mixture of Gaussians Model: Application to Background Modeling , 2008, ISVC.

[24]  Matti Pietikäinen,et al.  Performance evaluation of texture measures with classification based on Kullback discrimination of distributions , 1994, Proceedings of 12th International Conference on Pattern Recognition.

[25]  Robert B. Fisher,et al.  The PETS04 Surveillance Ground-Truth Data Sets , 2004 .

[26]  C. Micheloni,et al.  A network of co-operative cameras for visual surveillance , 2005 .

[27]  Marc Friedman,et al.  Video Surveillance for Biometrics: Long-Range Multi-biometric System , 2008, 2008 IEEE Fifth International Conference on Advanced Video and Signal Based Surveillance.

[28]  Ramakant Nevatia,et al.  Tracking multiple humans in crowded environment , 2004, Proceedings of the 2004 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2004. CVPR 2004..

[29]  Mubarak Shah,et al.  Automated Visual Surveillance in Realistic Scenarios , 2007, IEEE MultiMedia.

[30]  Luis Miguel Bergasa,et al.  Fast and effective visual place recognition using binary codes and disparity information , 2014, 2014 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[31]  Luis Miguel Bergasa,et al.  Towards life-long visual localization using an efficient matching of binary sequences from images , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[32]  David Fernández Llorca,et al.  Hierarchical camera auto-calibration for traffic surveillance systems , 2014, Expert Syst. Appl..

[33]  Z. Zivkovic Improved adaptive Gaussian mixture model for background subtraction , 2004, ICPR 2004.

[34]  Li-Qun Xu Issues in video analytics and surveillance systems: Research / prototyping vs. applications / user requirements , 2007, AVSS.

[35]  Lucia Maddalena,et al.  A Self-Organizing Approach to Background Subtraction for Visual Surveillance Applications , 2008, IEEE Transactions on Image Processing.

[36]  François Brémond,et al.  Global tracker: An online evaluation framework to improve tracking quality , 2014, 2014 11th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[37]  Parvaneh Saeedi,et al.  Robust region-based background subtraction and shadow removing using color and gradient information , 2008, 2008 19th International Conference on Pattern Recognition.

[38]  W. Eric L. Grimson,et al.  Adaptive background mixture models for real-time tracking , 1999, Proceedings. 1999 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (Cat. No PR00149).

[39]  Jean-Marc Odobez,et al.  Multi-Layer Background Subtraction Based on Color and Texture , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[40]  Luis Jiménez,et al.  OCULUS surveillance system: Fuzzy on-line speed analysis from 2D images , 2011, Expert Syst. Appl..

[41]  Hervé Mathias,et al.  Three Novell Analog-Domain Algorithms for Motion Detection in Video Surveillance , 2011, EURASIP J. Image Video Process..

[42]  Shihong Lao,et al.  Multi-object tracking through occlusions by local tracklets filtering and global tracklets association with detection responses , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[43]  Ioannis Pavlidis,et al.  Urban surveillance systems: from the laboratory to the commercial world , 2001, Proc. IEEE.

[44]  Chu Kiong Loo,et al.  An Effective Surveillance System Using Thermal Camera , 2009, 2009 International Conference on Signal Acquisition and Processing.

[45]  Steven A. Shafer,et al.  Anatomy of a color histogram , 1992, Proceedings 1992 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[46]  Ramakant Nevatia,et al.  How does person identity recognition help multi-person tracking? , 2011, CVPR 2011.

[47]  Ramakant Nevatia,et al.  Multi-target tracking by on-line learned discriminative appearance models , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[48]  Andrea Cavallaro,et al.  Distributed visual sensing for virtual top-view trajectory generation in football videos , 2008, CIVR '08.

[49]  Zygmunt L. Szpak,et al.  Maritime surveillance: Tracking ships inside a dynamic background using a fast level-set , 2011, Expert Syst. Appl..

[50]  Duc Phu Chau,et al.  Automatic tracker selection w.r.t object detection performance , 2014, IEEE Winter Conference on Applications of Computer Vision.

[51]  Gurjit Singh Walia,et al.  Intelligent video target tracking using an evolutionary particle filter based upon improved cuckoo search , 2014, Expert Syst. Appl..

[52]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[53]  Alessio Del Bue,et al.  Human behavior analysis in video surveillance: A Social Signal Processing perspective , 2013, Neurocomputing.

[54]  Gérard G. Medioni,et al.  Tracking Using Motion Patterns for Very Crowded Scenes , 2012, ECCV.

[55]  Javier Medina,et al.  Intelligent surveillance system with integration of heterogeneous information for intrusion detection , 2011, Expert Syst. Appl..

[56]  Chee Seng Chan,et al.  iSurveillance: Intelligent framework for multiple events detection in surveillance videos , 2014, Expert Syst. Appl..

[57]  Dmitry B. Goldgof,et al.  Understanding Transit Scenes: A Survey on Human Behavior-Recognition Algorithms , 2010, IEEE Transactions on Intelligent Transportation Systems.

[58]  G.L. Foresti,et al.  Active video-based surveillance system: the low-level image and video processing techniques needed for implementation , 2005, IEEE Signal Processing Magazine.

[59]  Xiaogang Wang,et al.  Intelligent multi-camera video surveillance: A review , 2013, Pattern Recognit. Lett..

[60]  Gian Luca Foresti,et al.  Active Video-Based Surveillance System , 2005 .