Natural-language retrieval of images based on descriptive captions

We describe a prototype intelligent information retrieval system that uses natural-language understanding to efficiently locate captioned data. Multimedia data generally require captions to explain their features and significance. Such descriptive captions often rely on long nominal compounds (strings of consecutive nouns) which create problems of disambiguating word sence. In our system, captions and user queries are parsed and interpreted to produce a logical form using a detailed theory of the meaning of nominal compounds. A fine-grain match can then compare the logical form of the query to the logical forms for each caption. To improve system efficiency, we first perform a coarse-grain match with index files, using nouns and verbs extracted from the query. Our experiments with randomly selected queries and captions from an existing image library show an increase of 30% in precision and 50% in recall over the keyphrase approach currently used. Our processing times have a median of seven seconds as compared to eight minutes for the existing system, and our system is much easier to use.

[1]  Michael McGill,et al.  Introduction to Modern Information Retrieval , 1983 .

[2]  Rohini K. Srihari,et al.  Combining Linguistic and Pictorial Information: Using Captions to Interpret Newspaper Photographs , 1989, SNePS.

[3]  Janet L. Kolodner Indexing and retrieval strategies for natural language fact retrieval , 1983, TODS.

[4]  Neil C. Rowe Inferring Depictions in Natural-Language Captions for Efficient Access to Picture Data , 1994, Inf. Process. Manag..

[5]  Helen M. Brooks,et al.  Plexus-the expert system for referral , 1987, Inf. Process. Manag..

[6]  M. Yokota,et al.  Language-Picture Question-Answering Through Common Semantic Representation And its Application to the World of Weather Report , 1984 .

[7]  Gary Orwig,et al.  Attacking the information access problem with expert systems , 1991 .

[8]  Boris Katz,et al.  Using English for Indexing and Retrieving , 1991 .

[9]  John H Dickert Defense Technical Information Center thesaurus , 1996 .

[10]  C. J. van Rijsbergen,et al.  SILOL: A simple logical-linguistic document retrieval system , 1990, Inf. Process. Manag..

[11]  Alan F. Smeaton,et al.  Progress in the Application of Natural Language Processing to Information Retrieval Tasks , 1992, Comput. J..

[12]  Tim Finin Constraining the interpretation of nominal compounds in a limited context , 1986 .

[13]  W. Bruce Croft,et al.  Lexical ambiguity and information retrieval , 1992, TOIS.

[14]  James F. Allen Natural language understanding , 1987, Bejnamin/Cummings series in computer science.

[15]  Marvin Minsky,et al.  A framework for representing knowledge , 1974 .

[16]  Mark Sanderson,et al.  Conceptual Information Retrieval – A Case Study in Adaptive Partial Parsing , 1992 .

[17]  W. Bruce Croft,et al.  Interpreting nominal compounds for information retrieval , 1990, Inf. Process. Manag..

[18]  Alexander Dekhtyar,et al.  Information Retrieval , 2018, Lecture Notes in Computer Science.

[19]  Eugene J. Guglielmo Intelligent Information Retrieval for a Multimedia Database Using Captions , 1992 .

[20]  Drew McDermott,et al.  Introduction to artificial intelligence , 1986, Addison-Wesley series in computer science.

[21]  Lisa F. Rau,et al.  Knowledge organization and access in a conceptual information system , 1987, Inf. Process. Manag..

[22]  Stephanie W. Haas A feasibility study of the case hierarchy model for the construction and porting of natural language interfaces , 1990, Inf. Process. Manag..

[23]  Christine A. Montgomery,et al.  The DBG message understanding system , 1989, [1989] Proceedings. The Annual AI Systems in Government Conference.

[24]  Jean Tague-Sutcliffe,et al.  The Pragmatics of Information Retrieval Experimentation Revisited , 1997, Inf. Process. Manag..