Motion priors based on goals hierarchies in pedestrian tracking applications

In this paper, the problem of automated scene understanding by tracking and predicting paths for multiple humans is tackled, with a new methodology using data from a single, fixed camera monitoring the environment. Our main idea is to build goal-oriented prior motion models that could drive both the tracking and path prediction algorithms, based on a coarse-to-fine modeling of the target goal. To implement this idea, we use a dataset of training video sequences with associated ground-truth trajectories and from which we extract hierarchically a set of key locations. These key locations may correspond to exit/entrance zones in the observed scene, or to crossroads where trajectories have often abrupt changes of direction. A simple heuristic allows us to make piecewise associations of the ground-truth trajectories to the key locations, and we use these data to learn one statistical motion model per key location, based on the variations of the trajectories in the training data and on a regularizing prior over the models spatial variations. We illustrate how to use these motion priors within an interacting multiple model scheme for target tracking and path prediction, and we finally evaluate this methodology with experiments on common datasets for tracking algorithms comparison.

[1]  Yihong Gong,et al.  Online Multi-Target Tracking With Unified Handling of Complex Scenarios , 2015, IEEE Transactions on Image Processing.

[2]  S. Savarese,et al.  Learning an Image-Based Motion Context for Multiple People Tracking , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[3]  Patrick Pérez,et al.  Data fusion for visual tracking with particles , 2004, Proceedings of the IEEE.

[4]  J. Laumond,et al.  The formation of trajectories during goal‐oriented locomotion in humans. I. A stereotyped behaviour , 2007, The European journal of neuroscience.

[5]  Takayuki Kanda,et al.  Modeling and Prediction of Pedestrian Behavior based on the Sub-goal Concept , 2012, Robotics: Science and Systems.

[6]  Alexandre Heili,et al.  Combined estimation of location and body pose in surveillance video , 2011, 2011 8th IEEE International Conference on Advanced Video and Signal Based Surveillance (AVSS).

[7]  Christian Laugier,et al.  Intentional motion on-line learning and prediction , 2008, Machine Vision and Applications.

[8]  A. Ellis,et al.  PETS2010 and PETS2009 Evaluation of Results Using Individual Ground Truthed Single Views , 2010, 2010 7th IEEE International Conference on Advanced Video and Signal Based Surveillance.

[9]  Pascal Fua,et al.  Multi-camera Tracking and Atypical Motion Detection with Behavioral Maps , 2008, ECCV.

[10]  Geoffrey J. Gordon,et al.  Better Motion Prediction for People-tracking , 2004 .

[11]  Kai Oliver Arras,et al.  People tracking with human motion predictions from social forces , 2010, 2010 IEEE International Conference on Robotics and Automation.

[12]  Wolfram Burgard,et al.  Learning Motion Patterns of People for Compliant Robot Motion , 2005, Int. J. Robotics Res..

[13]  Paolo Rocco,et al.  Towards safe human-robot interaction in robotic cells: An approach based on visual tracking and intention estimation , 2011, 2011 IEEE/RSJ International Conference on Intelligent Robots and Systems.

[14]  Francisco Madrigal,et al.  Learning and Regularizing Motion Models for Enhancing Particle Filter-Based Target Tracking , 2011, PSIVT.

[15]  Catherine A. Sugar,et al.  Finding the Number of Clusters in a Dataset , 2003 .

[16]  Francisco Madrigal,et al.  Goal-oriented visual tracking of pedestrians with motion priors in semi-crowded scenes , 2015, 2015 IEEE International Conference on Robotics and Automation (ICRA).

[17]  J.-Y. Bouguet,et al.  Pyramidal implementation of the lucas kanade feature tracker , 1999 .

[18]  Donghoon Lee,et al.  OPTIMUS:online persistent tracking and identification of many users for smart spaces , 2014, Machine Vision and Applications.

[19]  Afshin Dehghan,et al.  Understanding Crowd Collectivity: A Meta-Tracking Approach , 2015 .

[20]  Fei-Fei Li,et al.  Socially-Aware Large-Scale Crowd Forecasting , 2014, 2014 IEEE Conference on Computer Vision and Pattern Recognition.

[21]  Konrad Schindler,et al.  Challenges of Ground Truth Evaluation of Multi-target Tracking , 2013, 2013 IEEE Conference on Computer Vision and Pattern Recognition Workshops.

[22]  Afshin Dehghan,et al.  Target Identity-aware Network Flow for online multiple target tracking , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[23]  Konrad Schindler,et al.  Discrete-continuous optimization for multi-target tracking , 2012, 2012 IEEE Conference on Computer Vision and Pattern Recognition.

[24]  Francisco Madrigal,et al.  Evaluation of multiple motion models for multiple pedestrian visual tracking , 2013, 2013 10th IEEE International Conference on Advanced Video and Signal Based Surveillance.

[25]  James Ferryman,et al.  Performance evaluation of crowd image analysis using the PETS2009 dataset , 2014, Pattern Recognit. Lett..

[26]  Afshin Dehghan,et al.  GMMCP tracker: Globally optimal Generalized Maximum Multi Clique problem for multiple object tracking , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[27]  Anil K. Jain,et al.  Algorithms for Clustering Data , 1988 .

[28]  Gustavo Arechavaleta Servin An optimality principle governing human walking , 2007 .

[29]  Jing Zhang,et al.  Framework for Performance Evaluation of Face, Text, and Vehicle Detection and Tracking in Video: Data, Metrics, and Protocol , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[30]  Ian D. Reid,et al.  Stable multi-target tracking in real-time surveillance video , 2011, CVPR 2011.

[31]  Kai Oliver Arras,et al.  Place-Dependent People Tracking , 2009, ISRR.

[32]  Bodo Rosenhahn,et al.  Everybody needs somebody: Modeling social and grouping behavior on a linear programming multiple people tracker , 2011, 2011 IEEE International Conference on Computer Vision Workshops (ICCV Workshops).

[33]  Mubarak Shah,et al.  Floor Fields for Tracking in High Density Crowd Scenes , 2008, ECCV.