Ontology-based multi-classification learning for video concept detection

In this paper, an ontology-based multi-classification learning algorithm is adopted to detect concepts in the NIST TREC-2003 video retrieval benchmark which defines 133 video concepts, organized hierarchically and each video data can belong to one or more concepts. The algorithm consists of two steps. In the first step, each single concept model is constructed independently. In the second step, ontology-based concept learning improves the accuracy of the individual concept by considering the possible influence relations between concepts based on a predefined ontology hierarchy. The advantage of ontology learning is that its influence path is based on an ontology hierarchy, which has real semantic meanings. Besides semantics, it also considers the data correlation to decide the exact influence assigned to each path, which makes the influence more flexible according to data distribution. This learning algorithm can be used for multiple topic document classification such as Internet documents and video documents. We demonstrate that precision-recall can be significantly improved by taking ontology into account

[1]  Rainer Lienhart,et al.  Comparison of automatic shot boundary detection algorithms , 1998, Electronic Imaging.

[2]  Aaron Kershenbaum,et al.  The Effect of Using Hierarchical Classifiers in Text Categorization , 2000, RIAO.

[3]  Robert Tibshirani,et al.  Classification by Pairwise Coupling , 1997, NIPS.

[4]  Harriet J. Nock,et al.  Assessing face and speech consistency for monologue detection in video , 2002, MULTIMEDIA '02.

[5]  John R. Smith,et al.  Normalized classifier fusion for semantic visual concept detection , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[6]  Charles A. Bouman,et al.  Storage and Retrieval for Image and Video Databases VII , 1998 .

[7]  Edward Y. Chang,et al.  Confidence-based dynamic ensemble for image annotation and semantics discovery , 2003, MULTIMEDIA '03.

[8]  John R. Smith,et al.  A framework for moderate vocabulary semantic visual concept detection , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).

[9]  R. Tibshirani,et al.  Classi cation by Pairwise Coupling , 1998 .

[10]  U. M. Feyyad Data mining and knowledge discovery: making sense out of data , 1996 .

[11]  John R. Smith,et al.  VideoAL: a novel end-to-end MPEG-7 video automatic labeling system , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[12]  John R. Smith,et al.  VideoAnnEx: IBM MPEG-7 Annotation Tool for Multimedia Indexing and Concept Learning , 2003 .

[13]  Tom M. Mitchell,et al.  Improving Text Classification by Shrinkage in a Hierarchy of Classes , 1998, ICML.

[14]  Maurice Bruynooghe,et al.  Hierarchical multi-classification , 2002, KDD 2002.

[15]  Harriet J. Nock,et al.  Discriminative model fusion for semantic concept detection and annotation in video , 2003, ACM Multimedia.