Techniques for utterance disambiguation in a human-computer dialogue system

Disambiguating an utterance occurring in a dialogue context is a complex task, which requires input from many different sources of information— some syntactic, some semantic, and some pragmatic. The central question addressed by this thesis is how to integrate data sources for utterance disambiguation within a bilingual human-computer dialogue system. First, a simple scheme is proposed for classifying disambiguation data sources; then this scheme is used to develop a method for combining data sources in a principled manner. Next, several actual sources of disambiguation data are explored; each is fitted into the previously described implementation framework. In particular, a probabilistic grammar is developed and augmented using novel techniques to increase its performance with respect to the local dialogue context. In a dialogue system, ambiguities which cannot be resolved automatically can be clarified by asking the user what was meant. This thesis also presents a model of clarification subdialogues which is integrated within the utterance disambiguation framework. This is followed by a brief treatment of how user errors may be accommodated, and how this process can also be fitted—conceptually and in implementation—into the previously described disambiguation framework. Finally, I describe the details of implementing these techniques within an existing dialogue system, and give examples demonstrating their effectiveness.

[1]  Alistair Knott,et al.  Syntactic disambiguation using presupposition resolution , 2003, ALTA.

[2]  H. E. Pople,et al.  Internist-I, an Experimental Computer-Based Diagnostic Consultant for General Internal Medicine , 1982 .

[3]  A. Knott,et al.  A DRT-based framework for presuppositions in dialogue management , 2002 .

[4]  Taylor L. Booth,et al.  Probabilistic Representation of Formal Languages , 1969, SWAT.

[5]  Robert A. MacLachlan CMU Common Lisp User''s Manual , 1992 .

[6]  Michael Collins,et al.  Three Generative, Lexicalised Models for Statistical Parsing , 1997, ACL.

[7]  Wolfgang Wahlster,et al.  Verbmobil: Foundations of Speech-to-Speech Translation , 2000, Artificial Intelligence.

[8]  Dov M. Gabbay,et al.  Handbook of Philosophical Logic , 2002 .

[9]  Stephan Oepen,et al.  Parse Disambiguation for a Rich HPSG Grammar , 2002 .

[10]  D. Swinney Lexical access during sentence comprehension: (Re)consideration of context effects , 1979 .

[11]  Henry Fuchs,et al.  Predetermining visibility priority in 3-D scenes (Preliminary Report) , 1979, SIGGRAPH '79.

[12]  John D. Lafferty,et al.  Inducing Features of Random Fields , 1995, IEEE Trans. Pattern Anal. Mach. Intell..

[13]  David Traum,et al.  Computational Models of Grounding in Collaborative Systems , 1999 .

[14]  Alistair Knott,et al.  Multi-agent Human-Machine Dialogue: Issues in Dialogue Management and Referring Expression Semantics , 2004, PRICAI.

[15]  Eugene Charniak,et al.  Statistical Parsing with a Context-Free Grammar and Word Statistics , 1997, AAAI/IAAI.

[16]  Mark Johnson,et al.  Estimators for Stochastic “Unification-Based” Grammars , 1999, ACL.

[17]  Candace L. Sidner,et al.  Attention, Intentions, and the Structure of Discourse , 1986, CL.

[18]  F ChenStanley,et al.  An Empirical Study of Smoothing Techniques for Language Modeling , 1996, ACL.

[19]  Frank Keller,et al.  Using the Web to Obtain Frequencies for Unseen Bigrams , 2003, CL.

[20]  Chris Brew,et al.  Stochastic HPSG , 1995, EACL.

[21]  Mark James,et al.  An Artificial Intelligence Approach to Language Instruction , 1978, Artif. Intell..

[22]  Milan Kundera,et al.  The Book of Laughter and Forgetting , 1979 .

[23]  Rashmi Prasad,et al.  The Penn Discourse Treebank , 2004, LREC.

[24]  Siobhan Chapman Logic and Conversation , 2005 .

[25]  George Kingsley Zipf,et al.  Human behavior and the principle of least effort , 1949 .

[26]  Alistair Knott,et al.  A framework for utterance disambiguation in dialogue , 2004, ALTA.

[27]  Michael J. Fischer,et al.  The String-to-String Correction Problem , 1974, JACM.

[28]  Steven Abney,et al.  Statistical Methods and Linguistics , 2002 .

[29]  Stanley F. Chen,et al.  An Empirical Study of Smoothing Techniques for Language Modeling , 1996, ACL.

[30]  James L. Alty,et al.  Expert Systems: Concepts and Examples , 1984 .

[32]  Steven P. Abney Stochastic Attribute-Value Grammars , 1996, CL.

[33]  Ivan A. Sag,et al.  Book Reviews: Head-driven Phrase Structure Grammar and German in Head-driven Phrase-structure Grammar , 1996, CL.

[34]  J. Thurber The Thurber Carnival , 2020 .

[35]  Ronald Rosenfeld,et al.  A maximum entropy approach to adaptive statistical language modelling , 1996, Comput. Speech Lang..

[36]  Christopher D. Manning,et al.  LinGO Redwoods A Rich and Dynamic Treebank for HPSG , 2002 .

[37]  Brendan McCane,et al.  Language-driven nonverbal communication in a bilingual conversational agent , 2003, Proceedings 11th IEEE International Workshop on Program Comprehension.

[38]  James F. Allen,et al.  A Plan Recognition Model for Clarification Subdialogues , 1984, ACL.

[39]  Ted Pedersen,et al.  Complementarity of lexical and simple syntactic features: The SyntaLex approach to Senseval-3 , 2004, SENSEVAL@ACL.

[40]  Peter Jackson,et al.  Introduction to expert systems , 1986 .

[41]  A. Knott,et al.  Analysis of Errors in the Writing of First Year University Students of Maori , 2004 .

[42]  Stephan Oepen,et al.  Parse Selection on the Redwoods Corpus: 3rd Growth Results , 2003 .

[43]  Dan Flickinger,et al.  On building a more effcient grammar by exploiting types , 2000, Natural Language Engineering.

[44]  Larry Wall,et al.  Programming Perl , 1991 .

[45]  Alex Lascarides,et al.  Discourse Relations and Defeasible Knowledge , 1991, ACL.

[46]  Hans Kamp,et al.  Discourse Representation Theory: What it is and Where it Ought to Go , 1988, Natural Language at the Computer.

[47]  Norman K. Sondheimer,et al.  A Rule-Based Approach to Ill-Formed Input , 1980, COLING.

[48]  Karen Kukich,et al.  Techniques for automatically correcting words in text , 1992, CSUR.

[49]  Wolfgang Menzel,et al.  Error Diagnosis for Language Learning Systems , 1999 .

[50]  Guy L. Steele,et al.  Common Lisp the Language , 1984 .

[51]  Andreas Stolcke,et al.  Dialogue act modeling for automatic tagging and recognition of conversational speech , 2000, CL.

[52]  Dan Flickinger,et al.  Minimal Recursion Semantics: An Introduction , 2005 .

[53]  E. Schegloff,et al.  A simplest systematics for the organization of turn-taking for conversation , 1974 .

[54]  James H. Martin,et al.  Speech and language processing: an introduction to natural language processing, computational linguistics, and speech recognition, 2nd Edition , 2000, Prentice Hall series in artificial intelligence.

[55]  Achim G. Hoffmann,et al.  A New Measure for Extracting Semantically Related Words , 2004, ALTA.

[57]  Laurence Sterne,et al.  The Life and Opinions of Tristram Shandy , 2008, Medicine and Literature.

[58]  W. Kirby,et al.  Kalevala : the land of the heroes , 2018 .

[59]  Dan Flickinger,et al.  An Open Source Grammar Development Environment and Broad-coverage English Grammar Using HPSG , 2000, LREC.