Modelling the Retrieval of Structured Documents Containing Texts and Images

We present a model for complex documents possibly consisting of a hierarchically structured set of images or texts. Documents are represented both at the form level (as sets of physical features of the representing objects), at the content level (as sets of properties of the represented entities), and at the structure level. A uniform and powerful query language allows queries to be issued that transparently combine features pertaining to form, content and structure alike. Queries are expressions of a (fuzzy) logical language. While that part of the query that pertains to (medium-independent) content is “directly” processed by an inferential engine, that part that pertains to (medium-dependent) form is entrusted to specialised document processing procedures linked to the logical language by a procedural attachment mechanism. The model thus combines the power of state-of-the-art document processing techniques with the advantages of a clean, logically defined framework for understanding multimedia document retrieval.

[1]  Anthony G. Cohn,et al.  Calculi for Qualitative Spatial Reasoning , 1996, AISMC.

[2]  Umberto Straccia,et al.  A relevance terminological logic for information retrieval , 1996, SIGIR '96.

[3]  E. Dubois,et al.  Digital picture processing , 1985, Proceedings of the IEEE.

[4]  Sukhamay Kundu,et al.  A Sound and Complete Fuzzy Logic System Using Zadeh's Implication Operator , 1996, ISMIS.

[5]  Gert Smolka,et al.  Attributive Concept Descriptions with Complements , 1991, Artif. Intell..

[6]  Amarnath Gupta,et al.  Virage image search engine: an open framework for image management , 1996, Electronic Imaging.

[7]  Alan F. Smeaton,et al.  Experiments on using semantic distances between words in image caption retrieval , 1996, SIGIR '96.

[8]  Serge Abiteboul,et al.  Foundations of Databases , 1994 .

[9]  Carlo Meghini An image retrieval model based on classical logic , 1995, SIGIR '95.

[10]  Alexander Borgida,et al.  Description Logics in Data Management , 1995, IEEE Trans. Knowl. Data Eng..

[11]  Ricardo A. Baeza-Yates,et al.  A language for queries on structure and contents of textual databases , 1995, SIGIR '95.

[12]  K. Wakimoto,et al.  Efficient and Effective Querying by Image Content , 1994 .

[13]  Franz Baader,et al.  A Scheme for Integrating Concrete Domains into Concept Languages , 1991, IJCAI.

[14]  Vijay V. Raghavan,et al.  Content-Based Image Retrieval Systems - Guest Editors' Introduction , 1995, Computer.

[15]  Umberto Straccia,et al.  A model of information retrieval based on a terminological logic , 1993, SIGIR.

[16]  Vijay V. Raghavan,et al.  Design and evaluation of algorithms for image retrieval by spatial similarity , 1995, TOIS.

[17]  Neil C. Rowe,et al.  Natural-language retrieval of images based on descriptive captions , 1996, TOIS.