Weighted Subspace Filtering and Ranking Algorithms for Video Concept Retrieval

The rapid increase in the amount of digital data has posed significant challenges to retrieval systems, which are expected to effectively and efficiently filter the data and provide relevant results. Filtering is a technique that automatically selects features or data instances to represent the data, reduce storage costs, prune redundancy, decrease computation costs, enhance model learning performance, and dynamically deliver the media. On the other hand, ranking is a critical step to render as many relevant results as possible on the basis of some loss functions or similarity measures. This article presents a novel retrieval framework that consists of weighted subspace-based filtering and ranking components to address these challenges. One critical issue in multimedia retrieval is the difficulty of comparing semantics between low-level features (such as color and shape) and high-level concepts (such as sky and ocean). Because multiple correspondence analysis (MCA) is powerful in exploring relationships among high-dimensional categorical variables, it is attractive to apply MCA to explore relationships between feature categories and concept classes. However, existing correlation based algorithms are weak in capturing semantics through a numeric measure such as conditional entropy to guide the search of features. Most existing subspace-based algorithms, while considering semantics, mainly focus on the representations of features in new spaces or relationships among high-level concepts. Therefore, our filtering algorithm transfers the semantic correlation captured in the subspace to the feature weights to distinguish features and imbalanced data instances. Relevance of retrieval results is also crucial in multimedia, where effectiveness is measured by the ranking process. Our proposed algorithm is designed to overcome the limitation of the most commonly used ranking methods, which treat all data instances equally within one group during the training step. Our algorithm ranks data instances using dissimilarity values in the subspace toward both positive and nega tive one-class models from the learning phase. The idea behind this is that even those data instances in the same relevant group may have implicit different significances. For exam ple, the data instances closer to the relevant group's center could indicate more relevance than those that are further from the center. Meanwhile, a low-dimensional representation of the data instances could compactly characterize the structure of the data instance groups, making it possible to achieve an easy separation of relevant and irrelevant groups and effectively rank the retrieved results. To demonstrate the effectiveness of the proposed framework, we evaluated 30 high-level concepts and data sets from Trecvid 2008 and 2009.

[1]  Chong-Wah Ngo,et al.  Representations of Keypoint-Based Semantic Concept Detection: A Comprehensive Study , 2010, IEEE Transactions on Multimedia.

[2]  Gregory N. Hullender,et al.  Learning to rank using gradient descent , 2005, ICML.

[3]  Tat-Seng Chua,et al.  TRECVID 2005 by NUS PRIS , 2005, TRECVID.

[4]  Ian H. Witten,et al.  Data mining: practical machine learning tools and techniques, 3rd Edition , 1999 .

[5]  Michael R. Lyu,et al.  A Multimodal and Multilevel Ranking Scheme for Large-Scale Video Retrieval , 2008, IEEE Transactions on Multimedia.

[6]  Yoram Singer,et al.  An Efficient Boosting Algorithm for Combining Preferences by , 2013 .

[7]  Thorsten Joachims,et al.  Optimizing search engines using clickthrough data , 2002, KDD.

[8]  Shih-Fu Chang,et al.  Columbia University’s Baseline Detectors for 374 LSCOM Semantic Visual Concepts , 2007 .

[9]  Ian Witten,et al.  Data Mining , 2000 .

[10]  Martti Juhola,et al.  On principal component analysis, cosine and Euclidean measures in information retrieval , 2007, Inf. Sci..

[11]  Neil Salkind Encyclopedia of Measurement and Statistics , 2006 .

[12]  Paul Over,et al.  Evaluation campaigns and TRECVid , 2006, MIR '06.

[13]  Rong Yan,et al.  Efficient Margin-Based Rank Learning Algorithms for Information Retrieval , 2006, CIVR.

[14]  Mei-Ling Shyu,et al.  Effective Feature Space Reduction with Imbalanced Data for Semantic Concept Detection , 2008, 2008 IEEE International Conference on Sensor Networks, Ubiquitous, and Trustworthy Computing (sutc 2008).

[15]  Min Chen,et al.  An Effective Multi-concept Classifier for Video Streams , 2008, 2008 IEEE International Conference on Semantic Computing.

[16]  Tao Mei,et al.  Graph-Based Pairwise Learning to Rank for Video Search , 2009, MMM.