Locally regressive G-optimal design for image retrieval

Content Based Image Retrieval (CBIR) has attracted increasing attention from both academia and industry. Relevance Feedback is one of the most effective techniques to bridge the semantic gap in CBIR. One of the key research problems related to relevance feedback is how to select the most informative images for users to label. In this paper, we propose a novel active learning algorithm, called Locally Regressive G-Optimal Design (LRGOD) for relevance feedback image retrieval. Our assumption is that for each image, its label can be well estimated based on its neighbors via a locally regressive function. LRGOD algorithm is developed based on a locally regressive least squares model which makes use of the labeled and unlabeled images, as well as simultaneously exploits the local structure of each image. The images that can minimize the maximum prediction variance are selected as the most informative ones. We evaluated the proposed LRGOD approach on two real-world image corpus: Corel and NUS-WIDE-OBJECT [5] datasets, and compare it to three state-of-the-art active learning methods. The experimental results demonstrate the effectiveness of the proposed approach.

[1]  Robert H. Halstead,et al.  Matrix Computations , 2011, Encyclopedia of Parallel Computing.

[2]  Thomas S. Huang,et al.  Relevance feedback: a power tool for interactive content-based image retrieval , 1998, IEEE Trans. Circuits Syst. Video Technol..

[3]  Golub Gene H. Et.Al Matrix Computations, 3rd Edition , 2007 .

[4]  Tao Mei,et al.  Joint multi-label multi-instance learning for image classification , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[5]  Mikhail Belkin,et al.  Manifold Regularization: A Geometric Framework for Learning from Labeled and Unlabeled Examples , 2006, J. Mach. Learn. Res..

[6]  Xiaofei He,et al.  A unified active and semi-supervised learning framework for image compression , 2009, 2009 IEEE Conference on Computer Vision and Pattern Recognition.

[7]  Meng Wang,et al.  MSRA atT TRECVID 2008: High-Level Feature Extraction and Automatic Search , 2008, TRECVID.

[8]  Kun Zhou,et al.  Laplacian optimal design for image retrieval , 2007, SIGIR.

[9]  Chun Chen,et al.  G-Optimal Design with Laplacian Regularization , 2010, AAAI.

[10]  Meng Wang,et al.  Active learning in multimedia annotation and retrieval: A survey , 2011, TIST.

[11]  Meng Wang,et al.  Beyond Distance Measurement: Constructing Neighborhood Similarity for Video Annotation , 2009, IEEE Transactions on Multimedia.

[12]  Edward Y. Chang,et al.  Support vector machine active learning for image retrieval , 2001, MULTIMEDIA '01.

[13]  Edward Y. Chang,et al.  Active Learning for Interactive Multimedia Retrieval , 2008, Proceedings of the IEEE.

[14]  Tao Mei,et al.  Graph-based semi-supervised learning with multiple labels , 2009, J. Vis. Commun. Image Represent..

[15]  Xian-Sheng Hua,et al.  Content-aware Ranking for visual search , 2010, 2010 IEEE Computer Society Conference on Computer Vision and Pattern Recognition.

[16]  Xian-Sheng Hua,et al.  Towards a Relevant and Diverse Search of Social Images , 2010, IEEE Transactions on Multimedia.

[17]  Bernhard Schölkopf,et al.  Transductive Classification via Local Learning Regularization , 2007, AISTATS.

[18]  Tat-Seng Chua,et al.  NUS-WIDE: a real-world web image database from National University of Singapore , 2009, CIVR '09.

[19]  Eric R. Ziegel,et al.  The Elements of Statistical Learning , 2003, Technometrics.

[20]  Meng Wang,et al.  Visual query suggestion , 2009, ACM Multimedia.

[21]  Meng Wang,et al.  Unified Video Annotation via Multigraph Learning , 2009, IEEE Transactions on Circuits and Systems for Video Technology.

[22]  Xiaofei He,et al.  Laplacian Regularized D-Optimal Design for Active Learning and Its Application to Image Retrieval , 2010, IEEE Transactions on Image Processing.

[23]  Yi Yang,et al.  Ranking with local regression and global alignment for cross media retrieval , 2009, ACM Multimedia.

[24]  Qi Tian,et al.  Visual Synset: Towards a higher-level visual representation , 2008, 2008 IEEE Conference on Computer Vision and Pattern Recognition.

[25]  Nicu Sebe,et al.  Content-based multimedia information retrieval: State of the art and challenges , 2006, TOMCCAP.

[26]  Chun Chen,et al.  Convex experimental design using manifold structure for image retrieval , 2009, MM '09.

[27]  Jinbo Bi,et al.  Active learning via transductive experimental design , 2006, ICML.

[28]  Xiaojin Zhu,et al.  --1 CONTENTS , 2006 .

[29]  James Ze Wang,et al.  Content-based image retrieval: approaches and trends of the new age , 2005, MIR '05.

[30]  James Ze Wang,et al.  Image retrieval: Ideas, influences, and trends of the new age , 2008, CSUR.