Reference Resolution over a Restricted Domain: References to Documents

This article studies the resolution of references made by speakers to documents discussed during a meeting. The focus is on transcribed recordings of press review meetings, in French. After an overview of the required framework for reference resolution—specification of the task, data annotation, and evaluation procedure—we propose, analyze and evaluate an algorithm for the resolution of references to documents (ref2doc) based on anaphora tracking and context matching. Applications to speech-to-document alignment and more generally to meeting processing and retrieval are finally discussed.

[1]  William G. Lycan,et al.  Philosophy of Language: A Contemporary Introduction , 1999 .

[2]  Candace L. Sidner,et al.  Focusing in the comprehension of definite anaphora , 1986 .

[3]  Andrei Popescu-Belis,et al.  Building and Using a Corpus of Shallow Dialogue Annotated Meetings , 2004, LREC.

[4]  Denis Lalanne,et al.  Thematic alignment of recorded speech with documents , 2003, DocEng '03.

[5]  Scott Weinstein,et al.  Centering: A Framework for Modeling the Local Coherence of Discourse , 1995, CL.

[6]  Kees van Deemter,et al.  On Coreferring: Coreference in MUC and Related Annotation Schemes , 2000, CL.

[7]  Maurizio Rigamonti,et al.  Xed: a new tool for extracting hidden structures from electronic documents , 2004, First International Workshop on Document Image Analysis for Libraries, 2004. Proceedings..

[8]  Carla Huls,et al.  Automatic Referent Resolution of Deictic and Anaphoric Expressions , 1995, CL.

[9]  Gabriel Skantze,et al.  Coordination of referring expressions in multimodal human-computer dialogue , 2002, INTERSPEECH.

[10]  Andrei Popescu-Belis,et al.  Reference Resolution beyond Coreference: a Conceptual Frame and its Application , 1998, COLING-ACL.

[11]  Kees van Deemter,et al.  Information sharing : reference and presupposition in language generation and interpretation , 2002 .

[12]  Laurent Romary,et al.  Towards a Reference Annotation Framework , 2004, LREC.

[13]  Andrei Popescu-Belis,et al.  User Query Analysis for the Specification and Evaluation of a Dialogue Processing and Retrieval System , 2004, LREC.

[14]  Andrei Popescu-Belis,et al.  Natural Language Queries on Natural Language Data: a Database of Meeting Dialogues , 2003, NLDB.

[15]  Andrei Popescu-Belis,et al.  Evaluation-driven design of a robust coreference resolution system , 2003, Natural Language Engineering.

[16]  Robert J. Gaizauskas,et al.  Using a semantic network for information extraction , 1997, Natural Language Engineering.

[17]  Edith Bolling Anaphora Resolution , 2006 .

[18]  Denis Lalanne,et al.  Talking about documents: revealing a missing link to multimedia meeting archives , 2003, IS&T/SPIE Electronic Imaging.

[19]  Kees van Deemter,et al.  Towards the generation of document-deictic references , 2002 .