An improved clustering for action recognition in online video

A new method for human action recognition in online video sequences using Latent Dirichlet Markov Clustering (LDMC) is proposed. Video sequences are represented by a novel “bag-of-words” representation, and each frame corresponds to a “word”. LDMC builds on Hidden Markov Models (HMMs) and Latent Dirichlet Allocation, and it overcome their low recognition rate, robustness and high computational complexity. A collapsed Gibbs sampler is designed for offline learning with unlabeled training data, and a new approximation to online Bayesian inference is formulated to enable human action recognition in new online video sequence in real-time. The strength of this model is demonstrated by unsupervised learning of human action categories and detecting salient actions in one complex and crowded public scenes.

[1]  Hong Zhou,et al.  Real-time gait classification based on fuzzy associative memory , 2010, Int. J. Model. Identif. Control..

[2]  Hanna M. Wallach,et al.  Topic modeling: beyond bag-of-words , 2006, ICML.

[3]  Barbara Caputo,et al.  Recognizing human actions: a local SVM approach , 2004, Proceedings of the 17th International Conference on Pattern Recognition, 2004. ICPR 2004..

[4]  A. McCallum,et al.  Topical N-Grams: Phrase and Topic Discovery, with an Application to Information Retrieval , 2007, Seventh IEEE International Conference on Data Mining (ICDM 2007).

[5]  Stefano Soatto,et al.  Detecting Humans via Their Pose , 2006, NIPS.

[6]  Michal Rosen-Zvi,et al.  Hidden Topic Markov Models , 2007, AISTATS.

[7]  Mark Steyvers,et al.  Finding scientific topics , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[8]  Greg Mori,et al.  IEEE TRANSACTIONS ON PATTERN ANALYSIS AND MACHINE INTELLIGENCE, VOL., NO. 1 Human Action Recognition by Semi-Latent Topic Models , 2022 .

[9]  Juan Carlos Niebles,et al.  Unsupervised Learning of Human Action Categories , 2006 .

[10]  Jianbo Shi,et al.  Detecting unusual activity in video , 2004, CVPR 2004.

[11]  Yang Wang,et al.  Human Action Recognition by Semilatent Topic Models , 2009, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[12]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..