Uniform Representation of Content and Structure for structured document retrieval

Documents often display a hierarchical structure. For example, a SGML document contains a title, several sections, which themselves contain paragraphs. In this paper, we develop a formal model to represent in a uniform manner structured documents by their content and structure. As a result, querying structured documents can be done with respect to their content, their structure, or both. The model is based on a possible worlds approach, modal operators and uncertainty distributions.

[1]  Ross Wilkinson,et al.  Effective retrieval of structured documents , 1994, SIGIR '94.

[2]  Guido Moerkotte,et al.  Querying documents in object databases , 1997, International Journal on Digital Libraries.

[3]  James Allan,et al.  Approaches to passage retrieval in full text information systems , 1993, SIGIR.

[4]  M. Lalmas,et al.  A dempster-shafer indeing for structured document retrieval: implementation and experiments on a web museum collection , 1999 .

[5]  E. Frisse Mark,et al.  Searching for information in a hypertext medical handbook , 1988 .

[6]  Sung-Hyon Myaeng,et al.  A flexible model for retrieval of SGML documents , 1998, SIGIR '98.

[7]  Norbert Fuhr,et al.  Retrieval of complex objects using a four-valued logic , 1996, SIGIR '96.

[8]  Mounia Lalmas,et al.  Representing and retrieving structured documents using the Dempster-Shafer theory of evidence: modelling and evaluation , 1998, J. Documentation.

[9]  Glenn Shafer,et al.  A Mathematical Theory of Evidence , 2020, A Mathematical Theory of Evidence.

[10]  Mounia Lalmas,et al.  A Dempster-Shafer indexing for the focused retrieval of a hierarchically structured document space: Implementation and experiments on a web museum collection , 2000, RIAO.

[11]  Mounia Lalmas,et al.  Dempster-Shafer's theory of evidence applied to structured documents: modelling uncertainty , 1997, SIGIR '97.

[12]  Yves Chiaramella,et al.  An Integrated Model for Hypermedia and Information Retrieval , 1996 .