论文信息 - Combining text and image information in content-based retrieval

Combining text and image information in content-based retrieval

This research explores the interaction of textual and photographic information in an integrated text/image database environment. By understanding the caption accompanying a picture, we are able to extract information useful in (i) retrieving the picture and (ii) directing an image interpretation system to identify relevant objects (in this case, faces) in the picture. The latter constitutes a powerful technique for automatically indexing images. In cases where images are not accompanied by text, it is far easier to manually add a line of descriptive text than to manually truth the images. A multi-stage system, PICTION, which uses captions to identify human faces in an accompanying photograph has been developed. We discuss the use of PICTION's output in content-based retrieval of images to satisfy focus of attention in queries.

Rohini K. Srihari

[1] Rama Chellappa,et al. Human and machine recognition of faces: a survey , 1995, Proc. IEEE.

[2] Venu Govindaraju,et al. Zero crossings of a non-orthogonal wavelet transform for object location , 1995, Proceedings., International Conference on Image Processing.

[3] Anil S. Chakravarthy,et al. Representing Information Need with Semantic Relations , 1994, COLING.

[4] Venu Govindaraju,et al. A Computational Model for Face Location Based on Cognitive Principles , 1992, AAAI.

[5] Rohini K. Srihari. Use of collateral text in understanding photos in documents , 1994, Other Conferences.

[6] Debra T. Burhans,et al. Visual Semantics: Extracting Visual information from Text Accompanying Pictures , 1994, AAAI.