The Influence of Context on Dialogue Act Recognition

This article presents a deep analysis of the influence of context information on dialogue act recognition. We performed experiments on the annotated subsets of three different corpora: the widely explored Switchboard Dialog Act Corpus, as well as the unexplored LEGO and Cambridge Restaurant corpora. In contrast with previous work, especially in what concerns the Switchboard corpus, we used an event-based classification approach, using SVMs, instead of the more common sequential approaches, such as HMMs. We opted for such an approach so that we could control the amount of provided context information and, thus, explore its range of influence. Our base features consist of n-grams, punctuation, and wh-words. Context information is obtained from previous utterances and provided in three ways -- n-grams, n-grams tagged with relative position, and dialogue act classifications. A comparative study was conducted to evaluate the performance of the three approaches. From it, we were able to assess the importance of context information on dialogue act recognition, as well as its range of influence for each of the three selected representations. In addition to the conclusions originated by the analysis, this work also produced results that advance the state-of-the-art, especially considering previous work on the Switchboard corpus. Furthermore, since, to our knowledge, the remaining datasets had not been previously explored for this task, our experiments can be used as baselines for future work on those corpora.

[1]  Elizabeth Shriberg,et al.  Switchboard SWBD-DAMSL shallow-discourse-function annotation coders manual , 1997 .

[2]  Fabio Pianesi,et al.  NESPOLE!'s Multilingual and Multimodal Corpus , 2002, LREC.

[3]  Isabelle Guyon,et al.  An Introduction to Variable and Feature Selection , 2003, J. Mach. Learn. Res..

[4]  Anne H. Anderson,et al.  The DCIEM map task corpus: spontaneous dialogue under sleep deprivation and drug treatment , 1996, ICSLP.

[5]  Helen Wright,et al.  Automatic utterance type detection using suprasegmental features , 1998, ICSLP.

[6]  Corinna Cortes,et al.  Support-Vector Networks , 1995, Machine Learning.

[7]  M K Tanenhaus,et al.  Functional clauses and sentence segmentation. , 1978, Journal of speech and hearing research.

[8]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[9]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[10]  Elizabeth Shriberg,et al.  The ICSI Meeting Recorder Dialog Act (MRDA) Corpus , 2004, SIGDIAL Workshop.

[11]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[12]  Kôiti Hasida,et al.  ISO 24617-2: A semantically-based standard for dialogue annotation , 2012, LREC.

[13]  Jean Carletta,et al.  The AMI Meeting Corpus: A Pre-announcement , 2005, MLMI.

[14]  Zoubin Ghahramani,et al.  Learning Dynamic Bayesian Networks , 1997, Summer School on Neural Networks.

[15]  Harry Bunt,et al.  The annotation of the switchboard corpus with the new ISO standard for dialogue act analysis , 2012 .

[16]  Shrikanth S. Narayanan,et al.  Combining lexical, syntactic and prosodic cues for improved online dialog act tagging , 2009, Comput. Speech Lang..

[17]  Alon Lavie,et al.  Domain Specific Speech Acts for Spoken Language Translation , 2003, SIGDIAL Workshop.

[18]  Ricardo Ribeiro,et al.  Mapping the Dialog Act Annotations of the LEGO Corpus into the Communicative Functions of ISO 24617-2 , 2016, ArXiv.

[19]  Fredrik Olsson,et al.  Active Learning for Dialogue Act Classification , 2011, INTERSPEECH.

[20]  Thomas G. Dietterich What is machine learning? , 2020, Archives of Disease in Childhood.

[21]  Matthew Stone,et al.  Communicative Intentions and Conversational Processes in Human-Human and Human-Computer Dialogue , 2002 .

[22]  Steve Renals,et al.  DBN Based Joint Dialogue Act Recognition of Multiparty Meetings , 2007, 2007 IEEE International Conference on Acoustics, Speech and Signal Processing - ICASSP '07.

[23]  Matthias Abt Verbmobil A Translation System For Face To Face Dialog , 2016 .

[24]  John C. Platt,et al.  Fast training of support vector machines using sequential minimal optimization, advances in kernel methods , 1999 .

[25]  Anton Nijholt,et al.  Dialogue Act Recognition with Bayesian Networks for Dutch Dialogues , 2002, SIGDIAL Workshop.

[26]  N. Metropolis,et al.  The Monte Carlo method. , 1949 .

[27]  Maxine Eskénazi,et al.  Doing research on a deployed spoken dialogue system: one year of let's go! experience , 2006, INTERSPEECH.

[28]  Laura Schweitzer,et al.  Advances In Kernel Methods Support Vector Learning , 2016 .

[29]  Srinivas Bangalore,et al.  Supertagging: An Approach to Almost Parsing , 1999, CL.

[30]  Chih-Jen Lin,et al.  LIBLINEAR: A Library for Large Linear Classification , 2008, J. Mach. Learn. Res..

[31]  David W. Aha,et al.  Instance-Based Learning Algorithms , 1991, Machine Learning.

[32]  Peter E. Hart,et al.  Nearest neighbor pattern classification , 1967, IEEE Trans. Inf. Theory.

[33]  Eduardo Lleida,et al.  Design and acquisition of a telephone spontaneous speech dialogue corpus in Spanish: DIHANA , 2006, LREC.

[34]  Andreas Stolcke,et al.  Dialogue act modeling for automatic tagging and recognition of conversational speech , 2000, CL.

[35]  J. Ross Quinlan,et al.  C4.5: Programs for Machine Learning , 1992 .

[36]  L. Baum,et al.  Statistical Inference for Probabilistic Functions of Finite State Markov Chains , 1966 .

[37]  Eric Brill,et al.  Transformation-Based Error-Driven Learning and Natural Language Processing: A Case Study in Part-of-Speech Tagging , 1995, CL.

[38]  Wolfgang Minker,et al.  A Parameterized and Annotated Spoken Dialog Corpus of the CMU Let’s Go Bus Information System , 2012, LREC.

[39]  Frank Rosenblatt,et al.  PRINCIPLES OF NEURODYNAMICS. PERCEPTRONS AND THE THEORY OF BRAIN MECHANISMS , 1963 .

[40]  Nir Friedman,et al.  Bayesian Network Classifiers , 1997, Machine Learning.

[41]  Michael Ferguson,et al.  Automatic Extraction of Cue Phrases for Cross-Corpus Dialogue Act Classification , 2010, COLING.

[42]  J.A. Anderson,et al.  Neurocomputing: Foundations of Research@@@Neurocomputing 2: Directions for Research , 1992 .

[43]  John J. Godfrey,et al.  SWITCHBOARD: telephone speech corpus for research and development , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[44]  Matthew Henderson,et al.  The Second Dialog State Tracking Challenge , 2014, SIGDIAL Conference.

[45]  A. Koller,et al.  Speech Acts: An Essay in the Philosophy of Language , 1969 .

[46]  Ken Samuel,et al.  Dialogue Act Tagging with Transformation-Based Learning , 1998, ACL.

[47]  Harry Bunt,et al.  The DialogBank: dialogues with interoperable annotations , 2016, Language Resources and Evaluation.

[48]  M. Rotaru Dialog Systems ” class , Spring 2002-TERM PROJECT-Dialog Act Tagging using Memory-Based Learning , 2007 .

[49]  F. Wilcoxon Individual Comparisons by Ranking Methods , 1945 .