Detecting rare events in video using semantic primitives with HMM

We present a new approach for recognizing rare events in aerial video. We use the framework of hidden Markov models (HMMs) to represent the spatio-temporal relations between objects and uncertainty in observations, where the data observables are semantic spatial primitives encoded based on prior knowledge about the events of interest. Events are observed as a sequence of binarized distance relations among the objects participating in the event. This avoids directly modeling the temporal trajectories of continuous observables, which is difficult when training data is scarce. The approach enables better generalization to other scenes for which little or no training data may be available. We demonstrate the effectiveness of our approach using real aerial video and simulated data.

[1]  Ramakant Nevatia,et al.  Large-scale event detection using semi-hidden Markov models , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[2]  Aaron F. Bobick,et al.  A Framework for Recognizing Multi-Agent Action from Visual Evidence , 1999, AAAI/IAAI.

[3]  David C. Hogg,et al.  Learning the distribution of object trajectories for event recognition , 1996, Image Vis. Comput..

[4]  Biing-Hwang Juang,et al.  Fundamentals of speech recognition , 1993, Prentice Hall signal processing series.

[5]  Ramakant Nevatia,et al.  Representation and optimal recognition of human activities , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[6]  Robert Givan,et al.  Specific-to-General Learning for Temporal Events with Application to Learning Event Definitions from Video , 2002, J. Artif. Intell. Res..

[7]  W. Eric L. Grimson,et al.  Learning Patterns of Activity Using Real-Time Tracking , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[8]  Vera M. Kettnaker Time-dependent HMMs for visual intrusion detection , 2003, 2003 Conference on Computer Vision and Pattern Recognition Workshop.

[9]  Rama Chellappa,et al.  Activity recognition using the dynamics of the configuration of interacting objects , 2003, 2003 IEEE Computer Society Conference on Computer Vision and Pattern Recognition, 2003. Proceedings..

[10]  Shaogang Gong,et al.  Recognition of group activities using dynamic probabilistic networks , 2003, Proceedings Ninth IEEE International Conference on Computer Vision.

[11]  Leo Joskowicz,et al.  Understanding Mechanical Motion: From Images to Behaviors , 1999, Artif. Intell..

[12]  Dorin Comaniciu,et al.  Real-time tracking of non-rigid objects using mean shift , 2000, Proceedings IEEE Conference on Computer Vision and Pattern Recognition. CVPR 2000 (Cat. No.PR00662).

[13]  Aaron F. Bobick,et al.  Recognition of Visual Activities and Interactions by Stochastic Parsing , 2000, IEEE Trans. Pattern Anal. Mach. Intell..