An Empirical Approach to Temporal Reference Resolution

Scheduling dialogs, during which people negotiate the times of appointments, are common in everyday life. This paper reports the results of an in-depth empirical investigation of resolving explicit temporal references in scheduling dialogs. There are four phases of this work: data annotation and evaluation, model development, system implementation and evaluation, and model evaluation and analysis. The system and model were developed primarily on one set of data, and then applied later to a much more complex data set, to assess the generalizability of the model for the task being performed. Many different types of empirical methods are applied to pinpoint the strengths and weaknesses of the approach. Detailed annotation instructions were developed and an intercoder reliability study was performed, showing that naive annotators can reliably perform the targeted annotations. A fully automatic system has been developed and evaluated on unseen test data, with good results on both data sets. We adopt a pure realization of a recency-based focus model to identify precisely when it is and is not adequate for the task being addressed. In addition to system results, an in-depth evaluation of the model itself is presented, based on detailed manual annotations. The results are that few errors occur specifically due to the model of focus being used, and the set of anaphoric relations defined in the model are low in ambiguity for both data sets.

[1]  T. Parsons,et al.  Toward a General Theory of Action , 1952 .

[2]  S. Siegel,et al.  Nonparametric Statistics for the Behavioral Sciences , 2022, The SAGE Encyclopedia of Research Design.

[3]  Laurence C. McGinn Nonparametric statistics for the behavioral sciences: by Sidney Siegel. 312 pages, 6 × 9 in. New York, McGraw-Hill Book Co., Inc., 1956. Price, $6.50 , 1957 .

[4]  Herbert H. Clark,et al.  Bridging , 1975, TINLAP.

[5]  Philip N. Johnson-Laird,et al.  Thinking; Readings in Cognitive Science , 1977 .

[6]  Jerry R. Hobbs Resolving pronoun references , 1986 .

[7]  Candace L. Sidner,et al.  Towards a computational theory of definite anaphora comprehension in English discourse , 1979 .

[8]  C. Raymond Perrault,et al.  Analyzing Intention in Utterances , 1986, Artif. Intell..

[9]  Irene Heim,et al.  The semantics of definite and indefinite noun phrases : a dissertation , 1982 .

[10]  James F. Allen Towards a General Theory of Action and Time , 1984, Artif. Intell..

[11]  Candace L. Sidner,et al.  Attention, Intentions, and the Structure of Discourse , 1986, CL.

[12]  Bonnie Webber,et al.  So what can we talk about now , 1986 .

[13]  Candace L. Sidner,et al.  Focusing in the comprehension of definite anaphora , 1986 .

[14]  Bonnie L. Webber,et al.  Tense as Discourse Anaphor , 1988, CL.

[15]  Alexander Nakhimovsky,et al.  Aspect, Aspectual Class, and the Temporal Structure of Narrative , 1988, Comput. Linguistics.

[16]  William C. Mann,et al.  Rhetorical Structure Theory: Toward a functional theory of text organization , 1988 .

[17]  Fei Song,et al.  Tense Interpretation in the Context of Narrative , 1991, AAAI.

[18]  Alex Lascarides,et al.  Proceedings of the 32nd annual meeting on Association for Computational Linguistics , 1992 .

[19]  Chung Hee Hwang,et al.  Tense Trees as the “Fine Structure” of Discourse , 1992, ACL.

[20]  Massimo Poesio,et al.  Temporal Centering , 1993, ACL.

[21]  Rebecca J. Passonneau,et al.  Intention-Based Segmentation: Human Reliability and Correlation with Linguistic Cues , 1993, ACL.

[22]  GLR* – An Efficient Noise-skipping Parsing Algorithm For Context Free Grammars , 1993, IWPT.

[23]  Uwe Reyle,et al.  From discourse to logic , 1993 .

[24]  A. Lavie,et al.  Glr* { an Eecient Noise-skipping Parsing Algorithm for Context Free Grammars , 1993 .

[25]  Carolyn Penstein Rosé,et al.  Speech--Language Integration In A Multi--Lingual Speech Translation System , 1994, AAAI 1994.

[26]  Scott Weinstein,et al.  Centering: A Framework for Modeling the Local Coherence of Discourse , 1995, CL.

[27]  Johnny Chen,et al.  ARTWORK: Discourse Processing in Machine Translation of Dialog , 1995 .

[28]  Johanna D. Moore,et al.  Investigating Cue Selection and Placement in Tutorial Discourse , 1995, ACL.

[29]  Carolyn Penstein Rosé,et al.  Discourse Processing of Dialogues with Multiple Threads , 1995, ACL.

[30]  Alex Waibel,et al.  Using Context in Machine Translation of Spoken Language , 1995, TMI.

[31]  Rebecca J. Passonneau,et al.  Combining Multiple Knowledge Sources for Discourse Segmentation , 1995, ACL.

[32]  Amy Isard,et al.  Transaction and Action Coding in the Map Task Corpus , 1995 .

[33]  Julia Hirschberg,et al.  A Prosodic Analysis of Discourse Segments in Direction-Giving Monologues , 1996, ACL.

[34]  Carolyn Penstein Rosé,et al.  Minimizing Cumulative Error in Discourse Context , 1996, ECAI Workshop on Dialogue Processing in Spoken Language Systems.

[35]  Jean Carletta,et al.  Assessing Agreement on Classification Tasks: The Kappa Statistic , 1996, CL.

[36]  Marilyn A. Walker,et al.  Limited Attention and Discourse Structure , 1995, CL.

[37]  Johanna D. Moore,et al.  Empirical Studies in Discourse , 1997, CL.

[38]  Simone Teufel,et al.  Resolving bridging references in unrestricted text , 1997 .

[39]  Thierry Declerck,et al.  Natural Language Dialogue Service for Appointment Scheduling Agents , 1997, ANLP.

[40]  Norbert Reithinger,et al.  Insights into the Dialogue Processing of VERBMOBIL , 1997, ANLP.

[41]  ofDialogJanyce,et al.  ARTWORK : Discourse Processing in Machine Translation , 1997 .