A semi-supervised incremental learning method based on adaptive probabilistic hypergraph for video semantic detection

Semantic categorization for the complex videos is an ambiguous task. The semi-supervised learning method based on hypergraph model can achieve multi-semantics labels, but a hypergraph model is sensitive to the radius parameter when it is constructed and the number of vertices belonging to a hyperedge is fixed. In this paper, a semi-supervised incremental learning method based on adaptive probabilistic hypergraph for video semantic detection is presented. In the probabilistic hypergraph modeling, a formula is presented as a measurement to adaptively decide whether a vertex is belonged to a hyperedge or not. The model has high robustness and can overcome the defect of fixed number of vertices belonging to the same hyperedge in the traditional probabilistic hypergraph model. In the semi-supervised incremental learning process, a threshold is defined, which is used to judge whether unlabeled sample can be added into the modeling, in order that the model learning result for unlabeled samples has high certainty. The experimental results show that our method can improve the model generalization ability and utilize the unlabeled samples effectively. In the aspects of recall rate and precision rate for semantic video concept detection from complex videos, our proposed method outperforms the compared methods.

[1]  Ruoming Jin,et al.  A Hypergraph-based Method for Discovering Semantically Associated Itemsets , 2011, 2011 IEEE 11th International Conference on Data Mining.

[2]  Meng Wang,et al.  Adaptive Hypergraph Learning and its Application in Image Classification , 2012, IEEE Transactions on Image Processing.

[3]  Tao Mei,et al.  Graph-based semi-supervised learning with multi-label , 2008, 2008 IEEE International Conference on Multimedia and Expo.

[4]  Qingshan Liu,et al.  Hypergraph with sampling for image retrieval , 2011, Pattern Recognit..

[5]  Sun Jia-yao Semantic detection of video events based on probabilistic hyper-graph , 2012 .

[6]  Li Li,et al.  Semantic event representation and recognition using syntactic attribute graph grammar , 2009, Pattern Recognit. Lett..

[7]  Zhi-Hua Zhou,et al.  New Semi-Supervised Classification Method Based on Modified Cluster Assumption , 2012, IEEE Transactions on Neural Networks and Learning Systems.

[8]  Mei-Ling Shyu,et al.  Model-driven collaboration and information integration for enhancing video semantic concept detection , 2012, 2012 IEEE 13th International Conference on Information Reuse & Integration (IRI).

[9]  Pengfei Guo,et al.  The enhanced genetic algorithms for the optimization design , 2010, 2010 3rd International Conference on Biomedical Engineering and Informatics.

[10]  Mubarak Shah,et al.  Monitoring human behavior from video taken in an office environment , 2001, Image Vis. Comput..

[11]  Qingshan Liu,et al.  Image retrieval via probabilistic hypergraph ranking , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[12]  Saeid Belkasim,et al.  Scale Invariants of Radial Tchebichef Moments for Shape-Based Image Retrieval , 2009, 2009 11th IEEE International Symposium on Multimedia.

[13]  Yihong Gong,et al.  Unsupervised Image Categorization by Hypergraph Partition , 2011, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[14]  Zhi-Hua Zhou,et al.  Multi-View Video Summarization , 2010, IEEE Transactions on Multimedia.

[15]  Yongzhao Zhan,et al.  Shot retrieval based on fuzzy evolutionary aiNet and hybrid features , 2011, Comput. Hum. Behav..

[16]  Gang Chen,et al.  Efficient multi-label classification with hypergraph regularization , 2009, CVPR.

[17]  Baogang Wei,et al.  Multiple hypergraph ranking for video concept detection , 2010, Journal of Zhejiang University SCIENCE C.

[18]  Hongyang Chao,et al.  Discovering Video Shot Categories by Unsupervised Stochastic Graph Partition , 2013, IEEE Transactions on Multimedia.

[19]  Jen-Tzung Chien,et al.  Adaptive Bayesian Latent Semantic Analysis , 2008, IEEE Transactions on Audio, Speech, and Language Processing.

[20]  Gang Chen,et al.  Efficient multi-label classification with hypergraph regularization , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[21]  He Zhao,et al.  Traffic incident classification at intersections based on image sequences by HMM/SVM classifiers , 2010, Multimedia Tools and Applications.

[22]  Yanan Liu,et al.  Tensor-based Video Categorization via Sparse Representation , 2011 .

[23]  Yi-Ding Wang,et al.  Hand vein recognition based on multi-scale LBP and wavelet , 2011, 2011 International Conference on Wavelet Analysis and Pattern Recognition.

[24]  Matej Kristan,et al.  A trajectory-based analysis of coordinated team activity in a basketball game , 2009, Comput. Vis. Image Underst..

[25]  Meng Chang Chen,et al.  Using Incremental PLSI for Threshold-Resilient Online Event Analysis , 2008, IEEE Transactions on Knowledge and Data Engineering.