Goal-oriented visual tracking of pedestrians with motion priors in semi-crowded scenes

We propose a methodology for learning and using a multiple-goal probabilistic motion model within a particle filter-based target tracking on video streams. In a set of training video sequences, we first extract the locations (coined as “goals”) where the pedestrians either leave the scene or often change directions. Then, we learn one motion prior model per detected goal. Each of these models is learned statistically based on the local motion observed by the camera during the training phase. Given that the initial, empirical distribution may be incomplete or noisy, we regularize it in a second phase. These priors are then used in an Interactive Multiple Model (IMM) scheme for target tracking and goal estimation. We demonstrate the relevance of this methodology with tracking experiments and comparisons done on standard datasets.

[1]  A. Ellis,et al.  PETS2010 and PETS2009 Evaluation of Results Using Individual Ground Truthed Single Views , 2010, 2010 7th IEEE International Conference on Advanced Video and Signal Based Surveillance.

[2]  Francisco Madrigal,et al.  Evaluation of multiple motion models for multiple pedestrian visual tracking , 2013, 2013 10th IEEE International Conference on Advanced Video and Signal Based Surveillance.

[3]  Pascal Fua,et al.  Multi-camera Tracking and Atypical Motion Detection with Behavioral Maps , 2008, ECCV.

[4]  James Ferryman,et al.  Performance evaluation of crowd image analysis using the PETS2009 dataset , 2014, Pattern Recognit. Lett..

[5]  Ian D. Reid,et al.  Stable multi-target tracking in real-time surveillance video , 2011, CVPR 2011.

[6]  Alexandre Heili,et al.  Combined estimation of location and body pose in surveillance video , 2011, 2011 8th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[7]  Takayuki Kanda,et al.  Modeling and Prediction of Pedestrian Behavior based on the Sub-goal Concept , 2012, Robotics: Science and Systems.

[8]  Wolfram Burgard,et al.  Learning Motion Patterns of People for Compliant Robot Motion , 2005, Int. J. Robotics Res..

[9]  Jing Zhang,et al.  Framework for Performance Evaluation of Face, Text, and Vehicle Detection and Tracking in Video: Data, Metrics, and Protocol , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[10]  Francisco Madrigal,et al.  Learning and Regularizing Motion Models for Enhancing Particle Filter-Based Target Tracking , 2011, PSIVT.

[11]  Konrad Schindler,et al.  Challenges of Ground Truth Evaluation of Multi-target Tracking , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[12]  Patrick Pérez,et al.  Data fusion for visual tracking with particles , 2004, Proceedings of the IEEE.