Mapping image segments to words for image retrieval

This paper proposes a new method for mapping image segments to words in three layers for image retrieval. Our main goal here is to incorporate higher-level semantics into the retrieval process and thus narrow the gap between the user's interpretation and the automatically extracted low-level visual features of the same image content. The method is based on nonlinear segmentation, as well as clustering and statistical learning applied to both visual and textual features to find semantic relations between visual segment clusters and words of various abstraction levels. Experiments conducted on a wide, natural image domain shows that step-by-step semantic inferencing in image-word mapping helps to improve retrieval performance. The method supports various textual and/or visual browsing and searching schemes and is proved to be very useful for effective browsing and retrieval in large image data sets.

[1]  Andrea Kutics,et al.  Image retrieval via the inhomogeneous diffusion of luminance and texture features , 2000, J. Electronic Imaging.

[2]  Andrea Kutics,et al.  An object-based image retrieval system using an inhomogeneous diffusion model , 1999, Proceedings 1999 International Conference on Image Processing (Cat. 99CH36348).

[3]  A. Ravishankar Rao,et al.  The Texture Lexicon: Understanding the Categorization of Visual Texture Terms and Their Relationship to Texture Images , 1997, Cogn. Sci..

[4]  John R. Smith,et al.  Multimedia semantic indexing using model vectors , 2003, 2003 International Conference on Multimedia and Expo. ICME '03. Proceedings (Cat. No.03TH8698).

[5]  James Ze Wang,et al.  Learning-based linguistic indexing of pictures with 2--d MHMMs , 2002, MULTIMEDIA '02.

[6]  William I. Grosky,et al.  Negotiating the semantic gap: from feature maps to semantic landscapes , 2001, Pattern Recognit..

[7]  David A. Forsyth,et al.  Matching Words and Pictures , 2003, J. Mach. Learn. Res..

[8]  Joo-Hwee Lim,et al.  Home Photo Content Modeling for Personalized Event-Based Retrieval , 2003, IEEE Multim..