Probabilistic model supported rank aggregation for the semantic concept detection in video

Rank aggregation (RA) is an important classifier combination technology for semantic concept detection (SCD) in video because modelling of semantic concepts based on multimodal representations requires effective and robust methods of classifier combination. Although many RA methods have been developed and proven workable in practice, there are few theoretical hints for devising better ones because the reasons why RA can improve the classification precision have not been thoroughly elucidated. In this work, we use the order statistics to reveal the meaning of rank and RA for classification problems and propose the Probabilistic Model Supported Rank Aggregation (PMSRA) framework, which not only provides a probabilistic interpretation of why RA may be good for classification but also serves as a possible guide to new RA methods. Moreover, we apply the principle of Bayesian decision to the PMSRA framework to develop a new RA method, i.e. the Bayesian PMSRA. The effectiveness and robustness of our method have been further collaborated by the experimental results of incremental RA for SCD in video on the TRECVID 2005's dataset and an artificial dataset.

[1]  Paul Over,et al.  TRECVID 2005 - An Overview , 2005, TRECVID.

[2]  Moni Naor,et al.  Rank aggregation methods for the Web , 2001, WWW '01.

[3]  J. Berger Statistical Decision Theory and Bayesian Analysis , 1988 .

[4]  Alexander G. Hauptmann Lessons for the Future from a Decade of Informedia Video Analysis Research , 2005, CIVR.

[5]  R. Forthofer,et al.  Rank Correlation Methods , 1981 .

[6]  Herbert A. David,et al.  Order Statistics, Third Edition , 2003, Wiley Series in Probability and Statistics.

[7]  Javed A. Aslam,et al.  Condorcet fusion for improved retrieval , 2002, CIKM '02.

[8]  S. Shapiro,et al.  Mathematics without Numbers , 1993 .

[9]  Lambert Schomaker,et al.  Variants of the Borda count method for combining ranked classifier hypotheses , 2000 .

[10]  Dong Wang,et al.  AP-Based Borda Voting Method for Feature Extraction in TRECVID-2004 , 2005, ECIR.

[11]  H. Young Condorcet's Theory of Voting , 1988, American Political Science Review.

[12]  Ofer Melnik,et al.  Mixed group ranks: preference and confidence in classifier combination , 2004, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[13]  Anil K. Jain,et al.  Statistical Pattern Recognition: A Review , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[14]  Ofer Melnik,et al.  A Probability Model for Combining Ranks , 2005, Multiple Classifier Systems.

[15]  R. Graham,et al.  Spearman's Footrule as a Measure of Disarray , 1977 .

[16]  Edward Y. Chang,et al.  Optimal multimodal fusion for multimedia data analysis , 2004, MULTIMEDIA '04.

[17]  Alexander G. Hauptmann,et al.  Towards a Large Scale Concept Ontology for Broadcast Video , 2004, CIVR.