Annotation-based Document Retrieval with Four-Valued Probabilistic Datalog

The COLLATE system (collaboratory for annotation, indexing and retrieval of digitized historical archive material) provides film researchers with a collaborative environment in which historic documents about European films can be analysed, interpreted and discussed, using nested annotations and discourse structure relations among them. Annotations are metadata, and annotation threads form a hypertext containing positive and negative links, constituting a certain kind of context exploitable for document retrieval. In this paper, we discuss a solution for using annotations for information retrieval. To exploit annotation threads which consist of nested annotations and typed links between them, an annotation-based retrieval approach should have to cope with negative and contradictory statements. The nested annotation retrieval approach (NARA) is an approach addressing these issues. Based on this, we present NARAlog, an implementation using four-valued probabilistic datalog (FVPD), able to perform an in-depth analysis of annotation threads and to deal with contradictory statements.

[1]  Editors , 1986, Brain Research Bulletin.

[2]  C. J. van Rijsbergen,et al.  A Non-Classical Logic for Information Retrieval , 1997, Comput. J..

[3]  H. P. Frei,et al.  The use of semantic links in hypertext information retrieval , 1995 .

[4]  Hans-Peter Frei,et al.  The Use of Semantic Links in Hypertext Information Retrieval , 1995, Inf. Process. Manag..

[5]  Maristella Agosti,et al.  An Overview of Hypertext , 1996 .

[6]  Maristella Agosti,et al.  Information Retrieval and Hypertext , 1996, Information Retrieval and Hypertext.

[7]  Norbert Fuhr,et al.  Retrieval of complex objects using a four-valued logic , 1996, SIGIR '96.

[8]  Frank M. Shipman,et al.  Hypertext paths and the World-Wide Web: experiences with Walden's Paths , 1997, HYPERTEXT '97.

[9]  Catherine C. Marshall,et al.  Annotation: from paper books to the digital library , 1997, DL '97.

[10]  Robert Wilensky,et al.  Multivalent Annotations , 1997, ECDL.

[11]  Catherine C. Marshall,et al.  Toward an ecology of hypertext annotation , 1998, HYPERTEXT '98.

[12]  Giuseppe Attardi,et al.  Automatic Web Page Categorization by Link and Context Analysis , 1999 .

[13]  Bill N. Schilit,et al.  From reading to retrieval: freeform ink annotations as queries , 1999, SIGIR '99.

[14]  Michael A. Arbib,et al.  Annotation technology , 1999, Int. J. Hum. Comput. Stud..

[15]  Laurent Denoue,et al.  An annotation tool for Web browsers and its applications to information retrieval , 2000, RIAO.

[16]  David M. Nichols,et al.  DEBORA: Developing an Interface to Support Collaboration in a Digital Library , 2000, ECDL.

[17]  Panos Constantopoulos,et al.  Research and Advanced Technology for Digital Libraries , 2001, Lecture Notes in Computer Science.

[18]  Catherine C. Marshall,et al.  From personal to shared annotations , 2002, CHI Extended Abstracts.

[19]  Ulrich Thiel,et al.  How to Incorporate Collaborative Discourse in Cultural Digital Libraries , 2002, SAAKM@ECAI.

[20]  Eric Prud'hommeaux,et al.  Annotea: an open RDF infrastructure for shared Web annotations , 2002, Comput. Networks.

[21]  Daniel Marcu,et al.  An Unsupervised Approach to Recognizing Discourse Relations , 2002, ACL.

[22]  Nicola Ferro,et al.  Annotations: Enriching a Digital Library , 2003, ECDL.

[23]  Michelangelo Ceci,et al.  Document-Centered Collaboration for Scholars in the Humanities - The COLLATE System , 2003, ECDL.

[24]  Nicola Ferro,et al.  Annotations in Digital Libraries and Collaboratories - Facets, Models and Usage , 2004, ECDL.