Sequential Correspondence Hierarchical Dirichlet Processes for Video Data Analysis

Multimedia data mining based on topic models as an emerging technique has become a very popular research topic in recent years. In this paper, we propose a novel topic model named sequential correspondence hierarchical Dirichlet Processes (Seq-cHDP) to learn the hidden structure within video data. The Seq-cHDP model can be considered as an extended hierarchical Dirichlet processes (HDP) model containing two important features: one is the time-dependency mechanism that connects neighboring video frames on the basis of a time dependent Markovian assumption, and the other is the data correspondence mechanism that provides a solution for dealing with the multimodal data such as the mixture of visual words and speech words extracted from video files. We present a comprehensive evaluation for Seq-cHDP through experimentation and finally demonstrate that our model outperforms than other baseline models.

[1]  Martha Larson,et al.  Overview of MediaEval 2011 Rich Speech Retrieval Task and Genre Tagging Task , 2011, MediaEval.

[2]  Koji Eguchi,et al.  Multimedia Topic Models Considering Burstiness of Local Features , 2014, IEICE Trans. Inf. Syst..

[3]  W. Eric L. Grimson,et al.  Unsupervised Activity Perception by Hierarchical Bayesian Models , 2007, 2007 IEEE Conference on Computer Vision and Pattern Recognition.

[4]  Ian Porteous,et al.  Networks of mixture blocks for non parametric bayesian models with applications , 2010 .

[5]  Michael I. Jordan,et al.  Modeling annotated data , 2003, SIGIR.

[6]  Jianwen Zhang,et al.  Evolutionary hierarchical dirichlet processes for multiple correlated time-varying corpora , 2010, KDD.

[7]  Michael I. Jordan,et al.  Bayesian Nonparametrics: Hierarchical Bayesian nonparametric models with applications , 2010 .

[8]  J. Sethuraman A CONSTRUCTIVE DEFINITION OF DIRICHLET PRIORS , 1991 .

[9]  Michael I. Jordan,et al.  Hierarchical Bayesian Nonparametric Models with Applications , 2008 .

[10]  Michael I. Jordan,et al.  Hierarchical Dirichlet Processes , 2006 .

[11]  Fabrice Souvannavong,et al.  Latent semantic analysis for an effective region-based video shot retrieval system , 2004, MIR '04.

[12]  G LoweDavid,et al.  Distinctive Image Features from Scale-Invariant Keypoints , 2004 .

[13]  T. Ferguson A Bayesian Analysis of Some Nonparametric Problems , 1973 .

[14]  Shaogang Gong,et al.  A Markov Clustering Topic Model for mining behaviour in video , 2009, 2009 IEEE 12th International Conference on Computer Vision.

[15]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..