Model vector-based retrieval is a novel approach for video indexing that uses a semantic model vector signature that describes the detection of a fixed set of concepts across a lexicon. The model vector basis is created using a set of independent binary classifiers that correspond to the semantic concepts. The model vectors are created by applying the binary detectors to video content and measuring the confidence of detection. Once the model vectors are extracted, simple techniques can be used for searching to find similar matches in a video database. However, since confidence scores alone do not capture information about the reliability of the underlying detectors, techniques are needed to ensure good performance in the presence of varying qualities of detectors. In this paper, we examine the model vector-based retrieval framework for video and propose methods using detector validity to improve matching performance. In particular, we develop a model vector distance metric that weighs the dimensions using detector validity scores. In this paper, we explore the new model vector-based retrieval method for video indexing and empirically evaluate the retrieval effectiveness on a large video test collection using different methods of measuring and incorporating detector validity indicators.
[1]
Haim H. Permuter,et al.
IBM Research TREC 2002 Video Retrieval System
,
2002,
TREC.
[2]
Ioannis Pitas,et al.
On the stability of support vector machines for face detection
,
2002,
Proceedings. International Conference on Image Processing.
[3]
John R. Smith,et al.
Multimedia semantic indexing using model vectors
,
2003,
2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).
[4]
John R. Smith,et al.
Video texture indexing using spatio-temporal wavelets
,
2002,
Proceedings. International Conference on Image Processing.
[5]
John R. Smith,et al.
Interactive content-based retrieval of video
,
2002,
Proceedings. International Conference on Image Processing.
[6]
A. Murat Tekalp,et al.
Integrated semantic-syntactic video event modeling for search and retrieval
,
2002,
Proceedings. International Conference on Image Processing.