Spatio-temporal crowd density model in a human detection and tracking framework

Recently significant progress has been made in the field of person detection and tracking. However, crowded scenes remain particularly challenging and can deeply affect the results due to overlapping detections and dynamic occlusions. In this paper, we present a method to enhance human detection and tracking in crowded scenes. It is based on introducing additional information about crowds and integrating it into the state-of-the-art detector. This additional information cue consists of modeling time-varying dynamics of the crowd density using local features as an observation of a probabilistic function. It also involves a feature tracking step which allows excluding feature points attached to the background. This process is favorable for the later density estimation since the influence of features irrelevant to the underlying crowd density is removed. Our proposed approach applies a scene-adaptive dynamic parametrization using this crowd density measure. It also includes a self-adaptive learning of the human aspect ratio and perceived height in order to reduce false positive detections. The resulting improved detections are subsequently used to boost the efficiency of the tracking in a tracking-by-detection framework. Our proposed approach for person detection is evaluated on videos from different datasets, and the results demonstrate the advantages of incorporating crowd density and geometrical constraints into the detection process. Also, its impact on tracking results have been experimentally validated showing good results. HighlightsWe model time-varying dynamics of the crowd density using feature tracks.We incorporate the crowd measure into the state-of-the-art detector.We extend the improved detections to tracking in a tracking-by-detection framework.We obtain better results compared to the baseline methods of detection and tracking.

[1]  S. Godsill,et al.  Monte Carlo filtering for multi target tracking and data association , 2005, IEEE Transactions on Aerospace and Electronic Systems.

[2]  Ivan Laptev,et al.  Density-aware person detection and tracking in crowds , 2011, ICCV.

[3]  Xuran Zhao,et al.  Crowd density analysis using subspace learning on local binary pattern , 2013, 2013 IEEE International Conference on Multimedia and Expo Workshops (ICMEW).

[4]  Branko Ristic,et al.  A Metric for Performance Evaluation of Multi-Target Tracking Algorithms , 2011, IEEE Transactions on Signal Processing.

[5]  Florian Schmidt,et al.  Integrating pedestrian simulation, tracking and event detection for crowd analysis , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[6]  G. Rigoll,et al.  Unified hierarchical multi-object tracking using global data association , 2013, 2013 IEEE International Workshop on Performance Evaluation of Tracking and Surveillance (PETS).

[7]  Mubarak Shah,et al.  A Lagrangian Particle Dynamics Approach for Crowd Flow Segmentation and Stability Analysis , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[8]  Yang Li,et al.  Improved Probabilistic Multi-Hypothesis Tracker for Multiple Target Tracking With Switching Attribute States , 2011, IEEE Transactions on Signal Processing.

[9]  Ba-Ngu Vo,et al.  The Gaussian Mixture Probability Hypothesis Density Filter , 2006, IEEE Transactions on Signal Processing.

[10]  Rainer Stiefelhagen,et al.  Multimodal Technologies for Perception of Humans, First International Evaluation Workshop on Classification of Events, Activities and Relationships, CLEAR 2006, Southampton, UK, April 6-7, 2006, Revised Selected Papers , 2007, CLEAR.

[11]  Mubarak Shah,et al.  Floor Fields for Tracking in High Density Crowd Scenes , 2008, ECCV.

[12]  Takeo Kanade,et al.  Tracking in unstructured crowded scenes , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[13]  Rubén Heras Evangelio,et al.  Robust modified L2 local optical flow estimation and feature tracking , 2011, 2011 IEEE Workshop on Applications of Computer Vision (WACV).

[14]  Alexei A. Efros,et al.  Putting Objects in Perspective , 2006, CVPR.

[15]  Tom Drummond,et al.  Faster and Better: A Machine Learning Approach to Corner Detection , 2008, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  Gérard G. Medioni,et al.  Tracking Using Motion Patterns for Very Crowded Scenes , 2012, ECCV.

[17]  Hua Yang,et al.  Crowd Event Perception Based on Spatio-temporal Viscous Fluid Field , 2012, 2012 IEEE Ninth International Conference on Advanced Video and Signal-Based Surveillance.

[18]  Chabane Djeraba,et al.  Real-time crowd motion analysis , 2008, 2008 19th International Conference on Pattern Recognition.

[19]  Ko Nishino,et al.  Tracking with local spatio-temporal motion patterns in extremely crowded scenes , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[20]  Ian D. Reid,et al.  Stable multi-target tracking in real-time surveillance video , 2011, CVPR 2011.

[21]  Volker Eiselein,et al.  Real-Time Multi-human Tracking Using a Probability Hypothesis Density Filter and Multiple Detectors , 2012, 2012 IEEE Ninth International Conference on Advanced Video and Signal-Based Surveillance.

[22]  Matthijs C. Dorst Distinctive Image Features from Scale-Invariant Keypoints , 2011 .

[23]  R. Mahler Multitarget Bayes filtering via first-order multitarget moments , 2003 .

[24]  Ivan Laptev,et al.  Data-driven crowd analysis in videos , 2011, ICCV.

[25]  Antonio Albiol,et al.  VIDEO ANALYSIS USING CORNER MOTION STATISTICS , 2009 .

[26]  Jin Hyeong Park,et al.  Performance evaluation of object detection algorithms , 2002, Object recognition supported by user interaction for service robots.

[27]  Carlo Tomasi,et al.  Good features to track , 1994, 1994 Proceedings of IEEE Conference on Computer Vision and Pattern Recognition.

[28]  Ko Nishino,et al.  Tracking Pedestrians Using Local Spatio-Temporal Motion Patterns in Extremely Crowded Scenes , 2012, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[29]  Ronald P. S. Mahler,et al.  Advances in Statistical Multisource-Multitarget Information Fusion , 2014 .

[30]  Grantham Pang,et al.  People Counting and Human Detection in a Challenging Situation , 2011, IEEE Transactions on Systems, Man, and Cybernetics - Part A: Systems and Humans.

[31]  Rainer Stiefelhagen,et al.  The CLEAR 2006 Evaluation , 2006, CLEAR.

[32]  Tobias Senst,et al.  Robust Local Optical Flow for Feature Tracking , 2012, IEEE Transactions on Circuits and Systems for Video Technology.

[33]  J. Ferryman,et al.  PETS2009: Dataset and challenge , 2009, 2009 Twelfth IEEE International Workshop on Performance Evaluation of Tracking and Surveillance.

[34]  Ba-Ngu Vo,et al.  Convergence Analysis of the Gaussian Mixture PHD Filter , 2007, IEEE Transactions on Signal Processing.

[35]  Thomas Sikora,et al.  Real-time person counting by propagating networks flows , 2011, 2011 8th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[36]  Jean-Luc Dugelay,et al.  Low level crowd analysis using frame-wise normalized feature for people counting , 2012, 2012 IEEE International Workshop on Information Forensics and Security (WIFS).

[37]  Ramin Mehran,et al.  Abnormal crowd behavior detection using social force model , 2009, CVPR.

[38]  Carlo S. Regazzoni,et al.  Bio-inspired relevant interaction modelling in cognitive crowd management , 2015, J. Ambient Intell. Humaniz. Comput..

[39]  Heiko Neumann,et al.  A Bio-Inspired, Motion-Based Analysis of Crowd Behavior Attributes Relevance to Motion Transparency, Velocity Gradients, and Motion Patterns , 2012, PloS one.

[40]  Luc Van Gool,et al.  Robust tracking-by-detection using a detector confidence particle filter , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[41]  Bill Triggs,et al.  Histograms of oriented gradients for human detection , 2005, 2005 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'05).

[42]  P ? ? ? ? ? ? ? % ? ? ? ? , 1991 .

[43]  Qingming Huang,et al.  Beyond particle flow: Bag of Trajectory Graphs for dense crowd event recognition , 2013, 2013 IEEE International Conference on Image Processing.

[44]  A. Sugar,et al.  Faster and better. , 2013, Cornea.

[45]  David A. McAllester,et al.  Object Detection with Discriminatively Trained Part Based Models , 2010, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[46]  Jean-Luc Dugelay,et al.  Crowd density map estimation based on feature tracks , 2013, 2013 IEEE 15th International Workshop on Multimedia Signal Processing (MMSP).

[47]  Rubén Heras Evangelio,et al.  Complementary background models for the detection of static and moving objects in crowded environments , 2011, 2011 8th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).