论文信息 - Temporally Consistent Gaussian Random Field for Video Semantic Analysis

Temporally Consistent Gaussian Random Field for Video Semantic Analysis

As a major family of semi-supervised learning, graph based semi-supervised learning methods have attracted lots of interests in the machine learning community as well as many application areas recently. However, for the application of video semantic annotation, these methods only consider the relations among samples in the feature space and neglect an intrinsic property of video data: the temporally adjacent video segments (e.g., shots) usually have similar semantic concept. In this paper, we adapt this temporal consistency property of video data into graph based semi-supervised learning and propose a novel method named temporally consistent Gaussian random field (TCGRF) to improve the annotation results. Experiments conducted on the TREC VID data set have demonstrated its effectiveness.

[1] Mikhail Belkin,et al. Manifold Regularization: A Geometric Framework for Learning from Labeled and Unlabeled Examples , 2006, J. Mach. Learn. Res..

[2] Zoubin Ghahramani,et al. Combining active learning and semi-supervised learning using Gaussian fields and harmonic functions , 2003, ICML 2003.

[3] Bernhard Schölkopf,et al. Learning with Local and Global Consistency , 2003, NIPS.

[4] Tao Mei,et al. Anisotropic Manifold Ranking for Video Annotation , 2007, 2007 IEEE International Conference on Multimedia and Expo.

[5] Meng Wang,et al. Manifold-ranking based video concept detection on large database and feature pool , 2006, MM '06.

[6] Alexander Zien,et al. Semi-Supervised Learning , 2006 .

[7] Changhu Wang,et al. Image annotation refinement using random walk with restarts , 2006, MM '06.

[8] Jun Yang,et al. Exploring temporal consistency for video analysis and retrieval , 2006, MIR '06.

[9] Gerhard Weikum,et al. Graph-based text classification: learn from your neighbors , 2006, SIGIR.

[10] Ronald Rosenfeld,et al. Semi-supervised learning with graphs , 2005 .

[11] Wei-Ying Ma,et al. Graph based multi-modality learning , 2005, ACM Multimedia.

[12] Mikhail Belkin,et al. Manifold Regularization : A Geometric Framework for Learning from Examples , 2004 .