Recognizing linked events: Searching the space of feasible explanations

The ambiguity inherent in a localized analysis of events from video can be resolved by exploiting constraints between events and examining only feasible global explanations. We show how jointly recognizing and linking events can be formulated as labeling of a Bayesian network. The framework can be extended to multiple linking layers, expressing explanations as compositional hierarchies. The best global explanation is the maximum a posteriori (MAP) solution over a set of feasible explanations. The search space is sampled using reversible jump Markov chain Monte Carlo (RJMCMC). We propose a set of general move types that is extensible to multiple layers of linkage, and use simulated annealing to find the MAP solution given all observations. We provide experimental results for a challenging two-layer linkage problem, demonstrating the ability to recognise and link drop and pick events of bicycles in a rack over five days.

[1]  Nando de Freitas,et al.  An Introduction to MCMC for Machine Learning , 2004, Machine Learning.

[2]  Songhwai Oh,et al.  Markov chain Monte Carlo data association for general multiple-target tracking problems , 2004, 2004 43rd IEEE Conference on Decision and Control (CDC) (IEEE Cat. No.04CH37601).

[3]  Yaacov Ritov,et al.  Tracking Many Objects with Many Sensors , 1999, IJCAI.

[4]  Stuart J. Russell,et al.  Object Identification: A Bayesian Analysis with Application to Traffic Surveillance , 1998, Artif. Intell..

[5]  Ben J. A. Kröse,et al.  A sequential Bayesian algorithm for surveillance with nonoverlapping cameras , 2005, Int. J. Pattern Recognit. Artif. Intell..

[6]  Gérard G. Medioni,et al.  Multiple Target Tracking Using Spatio-Temporal Markov Chain Monte Carlo Data Association , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[7]  Dima Damen,et al.  Associating People Dropping off and Picking up Objects , 2007, BMVC.

[8]  Shaogang Gong,et al.  Recognition of group activities using dynamic probabilistic networks , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[9]  Derek R. Magee,et al.  Tracking multiple vehicles using foreground, background and motion models , 2004, Image Vis. Comput..

[10]  Stan Sclaroff,et al.  Layered Graphical Models for Tracking Partially Occluded Objects , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[11]  Kevin Smith,et al.  Bayesian methods for visual multi-object tracking with applications to human activity recognition , 2007 .

[12]  Mun Wai Lee,et al.  A rank constrained continuous formulation of multi-frame multi-target tracking problem , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[13]  A. Mira Highly Structured Stochastic Systems. Peter J. Green, Nils Lid Hjort, and Sylvia Richardson , 2005 .

[14]  Donald Reid An algorithm for tracking multiple targets , 1978 .

[15]  Frank Dellaert,et al.  MCMC Data Association and Sparse Factorization Updating for Real Time Multitarget Tracking with Merged and Multiple Measurements , 2006, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[16]  A. G. Amitha Perera,et al.  Joint Recognition of Complex Events and Track Matching , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[17]  P. Green,et al.  Trans-dimensional Markov chain Monte Carlo , 2000 .