Query Expansion by Text and Image Features in Image Retrieval

We present a two-pass image retrieval system in which retrieval techniques for text and image documents are combined in a novel approach. In the first pass, the text-based initial query is matched against the text captions of the images in the database to obtain the initial retrieved set. In the second pass, text and image features obtained from this initial retrieved set are used to expand the initial query. Additional images from the database are then retrieved based on the expanded query. The image features that we have used are color histograms, DC coefficients from the discrete cosine transform, and two texture features: multiresolution simultaneous autoregressive model and local binary pattern. These are low-level statistical image features that can be easily computed. Extensive experiments have been performed on 1019 color pictures of mixed variety with captions, relevance judgments and queries supplied by a national archives agency. Objective precision-recall results have been obtained with various combinations of text and image features. The results show that the image features do not perform well when used on their own. However, when image features are used in query expansion, they increase the average precision more significantly than text annotations. Moreover, these findings are valid at all precision levels and are not sensitive to the image feature acquisition parameters.

[1]  Sharon Flank,et al.  PhotoFile: a digital library for image retrieval , 1995, Proceedings of the International Conference on Multimedia Computing and Systems.

[2]  Rosalind W. Picard Toward a Visual Thesaurus , 1995, MIRO.

[3]  Alan F. Smeaton,et al.  Experiments on using semantic distances between words in image caption retrieval , 1996, SIGIR '96.

[4]  George A. Miller,et al.  Introduction to WordNet: An On-line Lexical Database , 1990 .

[5]  Stephen E. Robertson,et al.  Okapi at TREC-3 , 1994, TREC.

[6]  Peter Willett,et al.  The Limitations of Term Co-Occurrence Data for Query Expansion in Document Retrieval Systems , 1991 .

[7]  T. Poggio,et al.  Ill-Posed Problems and Regularization Analysis in Early Vision , 1984 .

[8]  Alex Pentland,et al.  Photobook: tools for content-based manipulation of image databases , 1994, Electronic Imaging.

[9]  Donna K. Harman The First Text REtrieval Conference (TREC-1), Rockville, MD, USA, 4-6 November 1992 , 1993, Inf. Process. Manag..

[10]  Kui-Lam Kwok,et al.  TREC-4 Ad-Hoc, Routing Retrieval and Filtering Experiments using PIRCS , 1995, TREC.

[11]  Harpreet S. Sawhney,et al.  Compact Representations of Videos Through Dominant and Multiple Motion Estimation , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[12]  Ellen M. Voorhees,et al.  Query expansion using lexical-semantic relations , 1994, SIGIR '94.

[13]  Alan F. Smeaton,et al.  The Retrieval Effects of Query Expansion on a Feedback Document Retrieval System , 1983, Comput. J..

[14]  Christos Faloutsos,et al.  QBIC project: querying images by content, using color, texture, and shape , 1993, Electronic Imaging.

[15]  Hinrich Schütze,et al.  Xerox Site Report: Four TREC-4 Tracks , 1995, TREC.

[16]  Sung-Hyon Myaeng,et al.  Image organization and retrieval with automatically constructed feature vectors , 1996, SIGIR '96.

[17]  Rakesh Mohan,et al.  Text-based search of TV news stories , 1996, Other Conferences.

[18]  Hanan Samet,et al.  MARCO: MAp Retrieval by COntent , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[19]  James Allan,et al.  Recent Experiments with INQUERY , 1995, TREC.

[20]  B. S. Manjunath,et al.  Texture Features for Browsing and Retrieval of Image Data , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[21]  Anil K. Jain,et al.  Texture classification and segmentation using multiresolution simultaneous autoregressive models , 1992, Pattern Recognit..

[22]  C.-C. Jay Kuo,et al.  Image retrieval based on JPEG compressed data , 1996, Other Conferences.

[23]  Li WangDong-Chen He,et al.  Texture classification using texture spectrum , 1990, Pattern Recognit..

[24]  Gerald Salton,et al.  Automatic text processing , 1988 .

[25]  Tomek Strzalkowski,et al.  Natural Language Information Retrieval: TREC-8 Report , 1994, TREC.

[26]  Ellen M. Voorhees,et al.  Siemens TREC-4 Report: Further Experiments with Database Merging , 1995, TREC.

[27]  Matti Pietikäinen,et al.  Performance evaluation of texture measures with classification based on Kullback discrimination of distributions , 1994, Proceedings of 12th International Conference on Pattern Recognition.

[28]  Syin Chan,et al.  Multilingual information retrieval system , 1996, Other Conferences.

[29]  M. E. Maron,et al.  Full-text information retrieval: Further analysis and clarification , 1990, Inf. Process. Manag..

[30]  Juyang Weng,et al.  Using Discriminant Eigenfeatures for Image Retrieval , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[31]  Anil K. Jain,et al.  A Real-Time Matching System for Large Fingerprint Databases , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[32]  Donna Harman,et al.  The First Text REtrieval Conference (TREC-1) , 1993 .

[33]  Rohini K. Srihari,et al.  Automatic Indexing and Content-Based Retrieval of Captioned Images , 1995, Computer.

[34]  Michael Shneier,et al.  Exploiting the JPEG Compression Scheme for Image Retrieval , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[35]  Hsinchun Chen,et al.  A Parallel Computing Approach to Creating Engineering Concept Spaces for Semantic Retrieval: The Illinois Digital Library Initiative Project , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[36]  Eugene J. Guglielmo,et al.  Exploiting Captions in Retrieval of Multimedia Data , 1993, Inf. Process. Manag..

[37]  Arding Hsu,et al.  Image processing on compressed data for large video databases , 1993, MULTIMEDIA '93.

[38]  David Doermann,et al.  Archiving, indexing, and retrieval of video in the compressed domain , 1996, Other Conferences.

[39]  G. Healey,et al.  Retrieving Multispectral Satellite Images Using Physics-Based Invariant Representations , 1996, IEEE Trans. Pattern Anal. Mach. Intell..

[40]  Chris Buckley,et al.  New Retrieval Approaches Using SMART: TREC 4 , 1995, TREC.