Incremental Learning for Motion Prediction of Pedestrians and Vehicles

Abstract The main subject of this thesis is motion prediction. The problem is approached from the hypothesis that the dynamic and kinematic properties of objects such as pedestrian and vehicles do not suffice to predict their motion in the long term. Instead, the work presented here, in scribes itself in a new family of approaches which assume that, in a given environment, objects do not move at random, but engage in “typical motion patterns”, which may be learned and then used to predict motion on the basis of sensor data. In this context, this thesis focuses in three fundamental questions: modeling, learning and prediction. Modeling. This thesis is based on Hidden Markov Models, a probabilistic framework, which is used as a discrete approximation to represent the continuous state-space in which motion takes place. The main originality of the approach lies in modeling explicitly the intentions which are at the origin of “typical motion patterns”. This is achieved through the using of an extended space, which adds the state that the object intends to reach to the other “classic” state variables, such as position or velocity. Learning. The main problem of existing approaches lies in the separation of model learning and utilization in two distinct stages: in a first phase, the model is learned from data; then, it is used to predict. This principle is difficult to apply to real situations, because it requires at least one example of every possible typical pattern to be available during the learning phase. To address this problem, this thesis proposes a novel extension to Hidden Markov Models which allows simultaneous learning and utilization of the model. This extension incrementally builds a topological map – representing the model's structure – and reestimates the model's parameters. The approach is intended to be general, and it could be used in other application domains such as gesture recognition or automatic landmark extraction. Prediction. In this context, prediction is carried on by using exact Bayesian inference algorithms, which are able to work in real time thanks to the properties of the structure which has been learned. In particular, the time complexity of inference is reduced from O(N 2 ) to O(N) with respect to the number of discrete states in the system. All of the results obtained on this thesis have been implemented and validated with experiments, using both real and simulated data. Real data has been obtained on two different visual tracking systems: one installed over a parking lot, and the other installed at INRIA's entry hall. For synthetic data, a simulator has been developed in order to facilitate the conduction of controlled tests and the study of larger environments than for real data.

[1]  Stuart J. Russell,et al.  Dynamic bayesian networks: representation, inference and learning , 2002 .

[2]  Kai-Tai Song,et al.  Environment prediction for a mobile robot in a dynamic environment , 1997, IEEE Trans. Robotics Autom..

[3]  Lawrence D. Stone,et al.  Bayesian Multiple Target Tracking , 1999 .

[4]  Vladimir Kolmogorov,et al.  An Experimental Comparison of Min-Cut/Max-Flow Algorithms for Energy Minimization in Vision , 2004, IEEE Trans. Pattern Anal. Mach. Intell..

[5]  Michael I. Jordan,et al.  Learning with Mixtures of Trees , 2001, J. Mach. Learn. Res..

[6]  Friedrich M. Wahl,et al.  Camera-based observation of obstacle motions to derive statistical data for mobile robot motion planning , 1998, Proceedings. 1998 IEEE International Conference on Robotics and Automation (Cat. No.98CH36146).

[7]  Robert M. Gray,et al.  An Algorithm for Vector Quantizer Design , 1980, IEEE Trans. Commun..

[8]  Svetha Venkatesh,et al.  Policy Recognition in the Abstract Hidden Markov Model , 2002, J. Artif. Intell. Res..

[9]  Y. Bar-Shalom Tracking and data association , 1988 .

[10]  Jr. G. Forney,et al.  Viterbi Algorithm , 1973, Encyclopedia of Machine Learning.

[11]  Luc Van Gool,et al.  Modeling and Recognition of Human Actions Using a Stochastic Approach , 2002 .

[12]  Hans-Peter Kriegel,et al.  The R*-tree: an efficient and robust access method for points and rectangles , 1990, SIGMOD '90.

[13]  Christopher Richard Wren,et al.  Finding temporal patterns by data decomposition , 2004, Sixth IEEE International Conference on Automatic Face and Gesture Recognition, 2004. Proceedings..

[14]  Hinrich Schütze,et al.  Book Reviews: Foundations of Statistical Natural Language Processing , 1999, CL.

[15]  Andrew J. Viterbi,et al.  Error bounds for convolutional codes and an asymptotically optimum decoding algorithm , 1967, IEEE Trans. Inf. Theory.

[16]  S. P. Lloyd,et al.  Least squares quantization in PCM , 1982, IEEE Trans. Inf. Theory.

[17]  W. Eric L. Grimson,et al.  Learning Patterns of Activity Using Real-Time Tracking , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[18]  S. Sclaroff,et al.  Extraction and clustering of motion trajectories in video , 2004, ICPR 2004.

[19]  Ali S. Hadi,et al.  Finding Groups in Data: An Introduction to Chster Analysis , 1991 .

[20]  Fernando Bação,et al.  Self-organizing Maps as Substitutes for K-Means Clustering , 2005, International Conference on Computational Science.

[21]  Tieniu Tan,et al.  Learning activity patterns using fuzzy self-organizing neural network , 2004, IEEE Transactions on Systems, Man, and Cybernetics, Part B (Cybernetics).

[22]  Peter Xiaoping Liu,et al.  On-line data-driven fuzzy clustering with applications to real-time robotic tracking , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.

[23]  Toshi Takamori,et al.  Motion planner of mobile robots which avoid moving human obstacles on the basis of stochastic prediction , 1995, 1995 IEEE International Conference on Systems, Man and Cybernetics. Intelligent Systems for the 21st Century.

[24]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[25]  Sebastian Thrun,et al.  Probabilistic robotics , 2002, CACM.

[26]  Anil K. Jain,et al.  Data clustering: a review , 1999, CSUR.

[27]  Judea Pearl,et al.  Probabilistic reasoning in intelligent systems - networks of plausible inference , 1991, Morgan Kaufmann series in representation and reasoning.

[28]  Benjamin Kuipers,et al.  A Hierarchy of Qualitative Representations for Space , 1998, Spatial Cognition.

[29]  Franz Aurenhammer,et al.  Voronoi diagrams—a survey of a fundamental geometric data structure , 1991, CSUR.

[30]  Antonin Guttman,et al.  R-trees: a dynamic index structure for spatial searching , 1984, SIGMOD '84.

[31]  J. R. Boston,et al.  Hidden Markov model topology estimation to characterize the dynamic structure of repetitive lifting data , 1997, Proceedings of the 19th Annual International Conference of the IEEE Engineering in Medicine and Biology Society. 'Magnificent Milestones and Emerging Opportunities in Medical Engineering' (Cat. No.97CH36136).

[32]  Lawrence R. Rabiner,et al.  A tutorial on hidden Markov models and selected applications in speech recognition , 1989, Proc. IEEE.

[33]  Marco Bernardo,et al.  Noninformative Priors Do Not Exist: A Discussion with Jos , 1997 .

[34]  Nir Friedman,et al.  Learning Belief Networks in the Presence of Missing Values and Hidden Variables , 1997, ICML.

[35]  Benjamin King Step-Wise Clustering Procedures , 1967 .

[36]  David C. Hogg,et al.  Detecting inexplicable behaviour , 2004, BMVC.

[37]  Matthew Brand,et al.  Discovery and Segmentation of Activities in Video , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[38]  Sridhar Mahadevan,et al.  Learning hierarchical models of activity , 2004, 2004 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS) (IEEE Cat. No.04CH37566).

[39]  Teuvo Kohonen,et al.  The 'neural' phonetic typewriter , 1988, Computer.

[40]  Neil J. Gordon,et al.  A tutorial on particle filters for online nonlinear/non-Gaussian Bayesian tracking , 2002, IEEE Trans. Signal Process..

[41]  Panos E. Trahanias,et al.  Predictive autonomous robot navigation , 2002, IEEE/RSJ International Conference on Intelligent Robots and Systems.

[42]  Imran N. Junejo,et al.  Multi feature path modeling for video surveillance , 2004, ICPR 2004.

[43]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[44]  Ashraf Elnagar,et al.  Motion prediction of moving objects based on autoregressive model , 1998, IEEE Trans. Syst. Man Cybern. Part A.

[45]  Gregory F. Cooper,et al.  The Computational Complexity of Probabilistic Inference Using Bayesian Belief Networks , 1990, Artif. Intell..

[46]  Matthew Brand,et al.  Structure Learning in Conditional Probability Models via an Entropic Prior and Parameter Extinction , 1999, Neural Computation.

[47]  Q. Zhu A stochastic algorithm for obstacle motion prediction in visual guidance of robot motion , 1990, 1990 IEEE International Conference on Systems Engineering.

[48]  David Filliat Cartographie et estimation globale de la position pour un robot mobile autonome , 2001 .

[49]  Shaogang Gong,et al.  Learning Prior and Observation Augmented Density Models for Behaviour Recognition , 1999, BMVC.

[50]  Ronald A. Howard,et al.  Dynamic Programming and Markov Processes , 1960 .

[51]  D. Dennett The Intentional Stance. , 1987 .

[52]  Rex A. Dwyer Higher-dimensional voronoi diagrams in linear expected time , 1991, Discret. Comput. Geom..

[53]  Jeffrey K. Uhlmann,et al.  New extension of the Kalman filter to nonlinear systems , 1997, Defense, Security, and Sensing.

[54]  L. R. Rabiner,et al.  An introduction to the application of the theory of probabilistic functions of a Markov process to automatic speech recognition , 1983, The Bell System Technical Journal.

[55]  Andrew J. Bulpitt,et al.  Learning spatio-temporal patterns for predicting object behaviour , 2000, Image Vis. Comput..

[56]  Shaogang Gong,et al.  Beyond Tracking: Modelling Activity and Understanding Behaviour , 2006, International Journal of Computer Vision.

[57]  Shaogang Gong,et al.  Recognition of group activities using dynamic probabilistic networks , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[58]  Hannah M. Dee,et al.  Explaining visible behaviour , 2005 .

[59]  Friedrich M. Wahl,et al.  Estimation of collision probabilities in dynamic environments for path planning with minimum collision probability , 1996, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems. IROS '96.

[60]  Thierry Fraichard,et al.  Motion prediction for moving objects: a statistical approach , 2004, IEEE International Conference on Robotics and Automation, 2004. Proceedings. ICRA '04. 2004.

[61]  Tieniu Tan,et al.  A hierarchical self-organizing approach for learning the patterns of motion trajectories , 2004, IEEE Trans. Neural Networks.

[62]  Wolfram Burgard,et al.  Learning motion patterns of persons for mobile service robots , 2002, Proceedings 2002 IEEE International Conference on Robotics and Automation (Cat. No.02CH37292).

[63]  Marc Blanchet,et al.  An algorithm for the dynamic inference of hidden Markov models (DIHMM) , 1993, 1993 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[64]  Wolfram Burgard,et al.  Learning Motion Patterns of People for Compliant Robot Motion , 2005, Int. J. Robotics Res..

[65]  Alex Pentland,et al.  A Bayesian Computer Vision System for Modeling Human Interactions , 1999, IEEE Trans. Pattern Anal. Mach. Intell..

[66]  Richard Bellman,et al.  Adaptive Control Processes: A Guided Tour , 1961, The Mathematical Gazette.

[67]  Bernd Fritzke,et al.  A Growing Neural Gas Network Learns Topologies , 1994, NIPS.

[68]  Micha Sharir,et al.  Motion Planning in the Presence of Moving Obstacles , 1985, FOCS.

[69]  Biing-Hwang Juang,et al.  Maximum likelihood estimation for multivariate mixture observations of markov chains , 1986, IEEE Trans. Inf. Theory.

[70]  Zhengyou Zhang,et al.  A Flexible New Technique for Camera Calibration , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[71]  R. E. Kalman,et al.  A New Approach to Linear Filtering and Prediction Problems , 2002 .

[72]  Friedrich M. Wahl,et al.  Acquisition of statistical motion patterns in dynamic environments and their application to mobile robot motion planning , 1997, Proceedings of the 1997 IEEE/RSJ International Conference on Intelligent Robot and Systems. Innovative Robotics for Real-World Applications. IROS '97.

[73]  Thierry Artières,et al.  Learning Model Structure from Data: an Application to On-Line Handwriting , 2005 .

[74]  Henry A. Kautz,et al.  Voronoi tracking: location estimation using sparse and noisy sensor data , 2003, Proceedings 2003 IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS 2003) (Cat. No.03CH37453).

[75]  Alex Pentland,et al.  Modeling and Prediction of Human Behavior , 1999, Neural Computation.

[76]  Tim J. Ellis,et al.  Finding Paths in Video Sequences , 2001, BMVC.

[77]  David Heckerman,et al.  A Tutorial on Learning with Bayesian Networks , 1999, Innovations in Bayesian Networks.

[78]  Christian Igel,et al.  A computational efficient covariance matrix update and a (1+1)-CMA for evolution strategies , 2006, GECCO.

[79]  Stan Sclaroff,et al.  Improved Tracking of Multiple Humans with Trajectory Predcition and Occlusion Modeling , 1998 .

[80]  Rajmohan Madhavan,et al.  Moving object prediction for off-road autonomous navigation , 2003, SPIE Defense + Commercial Sensing.

[81]  R. T. Cox Probability, frequency and reasonable expectation , 1990 .

[82]  Yoram Singer,et al.  Training Algorithms for Hidden Markov Models using Entropy Based Distance Functions , 1996, NIPS.

[83]  Leslie Pack Kaelbling,et al.  Acting under uncertainty: discrete Bayesian models for mobile-robot navigation , 1996, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems. IROS '96.

[84]  R. Guy,et al.  Unsolved problems come of age , 1989 .

[85]  Alex Pentland,et al.  Coupled hidden Markov models for complex action recognition , 1997, Proceedings of IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[86]  L. Baum,et al.  A Maximization Technique Occurring in the Statistical Analysis of Probabilistic Functions of Markov Chains , 1970 .

[87]  Tieniu Tan,et al.  A system for learning statistical motion patterns , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[88]  Geoffrey E. Hinton,et al.  A View of the Em Algorithm that Justifies Incremental, Sparse, and other Variants , 1998, Learning in Graphical Models.

[89]  Andreas Stolcke,et al.  Hidden Markov Model} Induction by Bayesian Model Merging , 1992, NIPS.

[90]  Andrew Blake,et al.  Mathematical modelling of animate and intentional motion. , 2003, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[91]  David C. Hogg,et al.  Learning the distribution of object trajectories for event recognition , 1996, Image Vis. Comput..

[92]  M. Tribus,et al.  Probability theory: the logic of science , 2003 .

[93]  Roni Rosenfeld,et al.  Learning Hidden Markov Model Structure for Information Extraction , 1999 .

[94]  J. MacQueen Some methods for classification and analysis of multivariate observations , 1967 .

[95]  Ben J. A. Kröse,et al.  A self-organizing representation of sensor space for mobile robot navigation , 1994, Proceedings of IEEE/RSJ International Conference on Intelligent Robots and Systems (IROS'94).

[96]  Tim J. Ellis,et al.  Spatial and Probabilistic Modelling of Pedestrian Behaviour , 2002, BMVC.

[97]  Helge J. Ritter,et al.  An instantaneous topological mapping model for correlated stimuli , 1999, IJCNN'99. International Joint Conference on Neural Networks. Proceedings (Cat. No.99CH36339).

[98]  Andrew McCallum,et al.  Information Extraction with HMM Structures Learned by Stochastic Optimization , 2000, AAAI/IAAI.

[99]  Derek R. Magee,et al.  Tracking multiple vehicles using foreground, background and motion models , 2004, Image Vis. Comput..

[100]  Jason Morphett,et al.  An integrated algorithm of incremental and robust PCA , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[101]  W. Eric L. Grimson,et al.  Learning Semantic Scene Models by Trajectory Analysis , 2006, ECCV.