Region-based supervised annotation for semantic image retrieval

Abstract Automatic image annotation is the key to semantic-based image retrieval. We formulate image annotation as a multi-class classification problem under the multi-instance learning framework, which deals with the weak annotation problem and works with image-level ground truth training data. The relationship between low-level visual features and semantic concepts is found by supervised Bayesian learning from positive bags. For each region in the test image, a posterior probability for each concept is calculated from class densities estimated from the training set and then the probability is modified using relevance with the other regions in the image. The image-level posterior probabilities are obtained by combining the regional posterior probabilities and keywords are selected according to their ranks. The proposed algorithm is tested on standard datasets and achieves good annotation performance.

[1]  Jitendra Malik,et al.  Normalized Cuts and Image Segmentation , 2000, IEEE Trans. Pattern Anal. Mach. Intell..

[2]  James Ze Wang,et al.  Automatic Linguistic Indexing of Pictures by a Statistical Modeling Approach , 2003, IEEE Trans. Pattern Anal. Mach. Intell..

[3]  Raimondo Schettini,et al.  Image annotation using SVM , 2003, IS&T/SPIE Electronic Imaging.

[4]  R. Manmatha,et al.  Multiple Bernoulli relevance models for image and video annotation , 2004, CVPR 2004.

[5]  Shih-Fu Chang,et al.  VisualSEEk: a fully automated content-based image query system , 1997, MULTIMEDIA '96.

[6]  David A. Forsyth,et al.  Matching Words and Pictures , 2003, J. Mach. Learn. Res..

[7]  Ziqiang Wang,et al.  A Novel Region-based Image Annotation Using Multi-instance Learning , 2009, 2009 Second International Workshop on Knowledge Discovery and Data Mining.

[8]  Thomas G. Dietterich,et al.  Solving the Multiple Instance Problem with Axis-Parallel Rectangles , 1997, Artif. Intell..

[9]  A. Yezzi,et al.  Local or Global Minima: Flexible Dual-Front Active Contours , 2007 .

[10]  Yen-Wei Chen,et al.  Region-Based Segmentation and Auto-Annotation for Color Images , 2008, 2008 International Conference on Intelligent Information Hiding and Multimedia Signal Processing.

[11]  Jing Hua,et al.  Region-based Image Annotation using Asymmetrical Support Vector Machine-based Multiple-Instance Learning , 2006, 2006 IEEE Computer Society Conference on Computer Vision and Pattern Recognition (CVPR'06).

[12]  Keiji Yanai,et al.  Probabilistic web image gathering , 2005, MIR '05.

[13]  Wan-Chi Siu,et al.  Image annotation with parametric mixture model based multi-class multi-labeling , 2008, 2008 IEEE 10th Workshop on Multimedia Signal Processing.

[14]  Xiaohui Xie,et al.  A novel framework for semantic-based video retrieval , 2009, 2009 IEEE International Conference on Intelligent Computing and Intelligent Systems.

[15]  Ying Liu,et al.  Region-based image retrieval with high-level semantics using decision tree learning , 2008, Pattern Recognit..

[16]  Dragutin Petkovic,et al.  Query by Image and Video Content: The QBIC System , 1995, Computer.

[17]  Zheru Chi,et al.  Annotating Image Regions Using Spatial Context , 2006, Eighth IEEE International Symposium on Multimedia (ISM'06).

[18]  R. Manmatha,et al.  Automatic image annotation and retrieval using cross-media relevance models , 2003, SIGIR.

[19]  Gustavo Carneiro,et al.  Supervised Learning of Semantic Classes for Image Annotation and Retrieval , 2007, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[20]  Fatos T. Yarman Vural,et al.  HANOLISTIC: A Hierarchical automatic Image Annotation System Using Holistic Approach , 2008, ISCIS 2008.

[21]  Alex Pentland,et al.  Photobook: Content-based manipulation of image databases , 1996, International Journal of Computer Vision.

[22]  David G. Stork,et al.  Pattern Classification , 1973 .

[23]  Luc Van Gool,et al.  Modeling scenes with local descriptors and latent aspects , 2005, Tenth IEEE International Conference on Computer Vision (ICCV'05) Volume 1.

[24]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .

[25]  Ying Liu,et al.  Region-Based Image Retrieval with Perceptual Colors , 2004, PCM.

[26]  David A. Forsyth,et al.  Object Recognition as Machine Translation: Learning a Lexicon for a Fixed Image Vocabulary , 2002, ECCV.

[27]  Yao Zhao,et al.  A Novel Image Annotation Scheme Based on Neural Network , 2008, 2008 Eighth International Conference on Intelligent Systems Design and Applications.

[28]  Hichem Frigui,et al.  Unsupervised learning of prototypes and attribute weights , 2004, Pattern Recognit..

[29]  Ying Liu,et al.  Study on texture feature extraction in region-based image retrieval system , 2006, 2006 12th International Multi-Media Modelling Conference.

[30]  B. S. Manjunath,et al.  Unsupervised Segmentation of Color-Texture Regions in Images and Video , 2001, IEEE Trans. Pattern Anal. Mach. Intell..

[31]  R. Manmatha,et al.  A Model for Learning the Semantics of Pictures , 2003, NIPS.