Hierarchical genetic fusion of possibilities

Classification and fusion are major tasks in many applications and in particular for automatic semantic-based video content indexing and retrieval. In this paper, we focus on the challenging task of classifier output fusion. It is a necessary step to efficiently estimate the semantic content of video shots from multiple cues. We propose to fuse the numeric information provided by multiple classifiers in the framework of possibility logic. In this framework, many operators with different properties were suggested to achieve the fusion. We present a binary tree structure to model the fusion mechanism of available cues and the genetic algorithms that are used to determine the most appropriate operators and fusion tree structure. Experiments are conducted in the framework of TRECVID feature extraction task that consists in ordering shots with respect to their relevance to a given class. Finally, we will show the efficiency of our approach.

[1]  Fabrice Souvannavong,et al.  Latent semantic analysis for an effective region-based video shot retrieval system , 2004, MIR '04.

[2]  Edward Y. Chang,et al.  Optimal multimodal fusion for multimedia data analysis , 2004, MULTIMEDIA '04.

[3]  Fabrice Souvannavong,et al.  Multi-modal classifier fusion for video shot content retrieval , 2005 .

[4]  Vladimir Vapnik,et al.  The Nature of Statistical Learning , 1995 .

[5]  Lakhmi C. Jain,et al.  Designing classifier fusion systems by genetic algorithms , 2000, IEEE Trans. Evol. Comput..

[6]  Fabrice Souvannavong,et al.  Video content modeling with latent semantic analysis , 2003 .

[7]  Nicu Sebe,et al.  Salient Points for Content-Based Retrieval , 2001, BMVC.

[8]  Ludmila I. Kuncheva,et al.  A Theoretical Study on Six Classifier Fusion Strategies , 2002, IEEE Trans. Pattern Anal. Mach. Intell..

[9]  Richard A. Harshman,et al.  Indexing by Latent Semantic Analysis , 1990, J. Am. Soc. Inf. Sci..

[10]  Roberto Manduchi,et al.  A Study on Bayes Feature Fusion for Image Classification , 2003, 2003 Conference on Computer Vision and Pattern Recognition Workshop.

[11]  Gérard Chollet,et al.  Multi-modal identity verification using expert fusion , 2000, Inf. Fusion.

[12]  T. Landauer,et al.  Indexing by Latent Semantic Analysis , 1990 .

[13]  Harriet J. Nock,et al.  Semantic indexing of multimedia using audio, text and visual cues , 2002, Proceedings. IEEE International Conference on Multimedia and Expo.

[14]  Nello Cristianini,et al.  An introduction to Support Vector Machines , 2000 .

[15]  P. Beek,et al.  Text of 15938-5 FCD Information Technology-Multimedia Content Description Interface-Pard 5 Multimedia Description Schemes , 2001 .

[16]  Didier Dubois,et al.  Possibility Theory and its Applications: a Retrospective and Prospective view , 2006, Decision Theory and Multi-Agent Planning.

[17]  Bogdan Gabrys,et al.  Classifier selection for majority voting , 2005, Inf. Fusion.

[18]  Josef Kittler,et al.  A Framework for Classifier Fusion: Is It Still Needed? , 2000, SSPR/SPR.

[19]  Yoav Freund,et al.  Experiments with a New Boosting Algorithm , 1996, ICML.

[20]  John R. Smith,et al.  Normalized classifier fusion for semantic visual concept detection , 2003, Proceedings 2003 International Conference on Image Processing (Cat. No.03CH37429).

[21]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[23]  Pedro F. Felzenszwalb,et al.  Efficiently computing a good segmentation , 1998 .

[24]  Brendan J. Frey,et al.  Probabilistic multimedia objects (multijects): a novel approach to video indexing and retrieval in multimedia systems , 1998, Proceedings 1998 International Conference on Image Processing. ICIP98 (Cat. No.98CB36269).