Efficient semantic annotation method for indexing large personal video database

As there is a large gap between high-level semantics and low-level features, it is difficult to automatically obtain high-accuracy video semantic annotation through general statistical learning based methods. In this paper, we propose a novel annotation framework based on active learning and semi-supervised ensemble method, which is specially designed for personal video database. To efficiently annotate the home video database, an initial training set is first elaborately constructed based on the distribution analysis of the entire video dataset. Then, both a semi-supervised ensemble based method and an active learning based method are proposed, which aims at minimizing a margin cost function of ensemble to ensure the generalization capacity. The experiment results on about 50 hours home videos show that the proposed method performs superior to both existing semi-supervised learning algorithms and the general active learning algorithms in terms of annotation accuracy and performance stability.

[1]  Avrim Blum,et al.  The Bottleneck , 2021, Monopsony Capitalism.

[2]  Stefan M. Rüger,et al.  Mining multimedia salient concepts for incremental information extraction , 2005, SIGIR '05.

[3]  Boon-Lock Yeo,et al.  Segmentation of Video by Clustering and Graph Analysis , 1998, Comput. Vis. Image Underst..

[4]  Yan Song,et al.  Semi-automatic video semantic annotation based on active learning , 2005, Visual Communications and Image Processing.

[5]  Ayhan Demiriz,et al.  Exploiting unlabeled data in ensemble methods , 2002, KDD.

[6]  David A. Cohn,et al.  Active Learning with Statistical Models , 1996, NIPS.

[7]  Yoav Freund,et al.  Experiments with a New Boosting Algorithm , 1996, ICML.

[8]  Meng Wang,et al.  Semi-automatic video annotation based on active learning with multiple complementary predictors , 2005, MIR '05.

[9]  Bo Zhang,et al.  An online-optimized incremental learning framework for video semantic classification , 2004, MULTIMEDIA '04.

[10]  Peter L. Bartlett,et al.  Boosting Algorithms as Gradient Descent , 1999, NIPS.

[11]  Nello Cristianini,et al.  Query Learning with Large Margin Classi ersColin , 2000 .

[12]  Samy Bengio,et al.  SVMTorch: Support Vector Machines for Large-Scale Regression Problems , 2001, J. Mach. Learn. Res..

[13]  Ayhan Demiriz,et al.  Semi-Supervised Support Vector Machines , 1998, NIPS.

[14]  Daphne Koller,et al.  Support Vector Machine Active Learning with Applications to Text Classification , 2000, J. Mach. Learn. Res..

[15]  Edward Y. Chang,et al.  Multimodal concept-dependent active learning for image retrieval , 2004, MULTIMEDIA '04.

[16]  Robert E. Schapire,et al.  The Boosting Approach to Machine Learning An Overview , 2003 .

[17]  Darrell Whitley,et al.  A genetic algorithm tutorial , 1994, Statistics and Computing.

[18]  Christophe Ambroise,et al.  Semi-supervised MarginBoost , 2001, NIPS.