Semantic video classification has become an active research topic to enable more effective video retrieval and knowledge discovery from large-scale video databases. However, most existing techniques for classifier training require a large number of hand-labeled samples to learn correctly. To address this problem, we have proposed a semi-supervised framework to achieve incremental classifier training by integrating a limited number of labeled samples with a large number of unlabeled samples. Specifically, this emi-supervised framework includes: (a) Modeling the semantic video concepts by using the finite mixture models to approximate the class distributions of the relevant salient objects; (b) Developing an adaptive EM algorithm to integrate the unlabeled samples to achieve parameter estimation and model selection simultaneously; The experimental results in a certain domain of medical videos are also provided.
[1]
Sebastian Thrun,et al.
Text Classification from Labeled and Unlabeled Documents using EM
,
2000,
Machine Learning.
[2]
Edward Y. Chang,et al.
Confidence-based dynamic ensemble for image annotation and semantics discovery
,
2003,
MULTIMEDIA '03.
[3]
Jianping Fan,et al.
Concept-oriented indexing of video databases: toward semantic sensitive retrieval and browsing
,
2004,
IEEE Transactions on Image Processing.
[4]
Jianping Fan,et al.
Automatic image annotation by using concept-sensitive salient objects for image content representation
,
2004,
SIGIR '04.