Mapping the Dialog Act Annotations of the LEGO Corpus into the Communicative Functions of ISO 24617-2

In this paper we present strategies for mapping the dialog act annotations of the LEGO corpus into the communicative functions of the ISO 24617-2 standard. Using these strategies, we obtained an additional 347 dialogs annotated according to the standard. This is particularly important given the reduced amount of existing data in those conditions due to the recency of the standard. Furthermore, these are dialogs from a widely explored corpus for dialog related tasks. However, its dialog annotations have been neglected due to their high domain-dependency, which renders them unuseful outside the context of the corpus. Thus, through our mapping process, we both obtain more data annotated according to a recent standard and provide useful dialog act annotations for a widely explored corpus in the context of dialog research.

[1]  Pavel Král,et al.  Dialogue Act Recognition Approaches , 2010, Comput. Informatics.

[2]  E. Maier,et al.  Dialogue Acts in VERBMOBIL , 1995 .

[3]  Harry Bunt,et al.  The annotation of the switchboard corpus with the new ISO standard for dialogue act analysis , 2012 .

[4]  Wolfgang Minker,et al.  A Parameterized and Annotated Spoken Dialog Corpus of the CMU Let’s Go Bus Information System , 2012, LREC.

[5]  Chung Hee Hwang,et al.  The TRAINS project: a case study in building a conversational planning agent , 1994, J. Exp. Theor. Artif. Intell..

[6]  John R. Searle,et al.  Speech Acts: An Essay in the Philosophy of Language , 1970 .

[7]  Jean Carletta,et al.  HCRC dialogue structure coding manual , 1995 .

[8]  John J. Godfrey,et al.  SWITCHBOARD: telephone speech corpus for research and development , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[9]  Kôiti Hasida,et al.  Towards an ISO Standard for Dialogue Act Annotation , 2010, LREC.

[10]  Wolfgang Minker,et al.  Speaker state recognition with neural network-based classification and self-adaptive heuristic feature selection , 2014, 2014 11th International Conference on Informatics in Control, Automation and Robotics (ICINCO).

[11]  Kôiti Hasida,et al.  ISO 24617-2: A semantically-based standard for dialogue annotation , 2012, LREC.

[12]  Elizabeth Shriberg,et al.  Switchboard SWBD-DAMSL shallow-discourse-function annotation coders manual , 1997 .

[13]  Barbara Di Eugenio,et al.  The COCONUT project: Dialogue Annotation Manual , 1998 .

[14]  Maxine Eskénazi,et al.  Doing research on a deployed spoken dialogue system: one year of let's go! experience , 2006, INTERSPEECH.

[15]  Gwyneth Doherty-Sneddon,et al.  THE HCRC MAP TASK CORPUS: Natural Dialogue For Speech Recognition , 1993, HLT.

[16]  Eugene Semenkin,et al.  Multicriteria neural network design in the speech-based emotion recognition problem , 2015, 2015 12th International Conference on Informatics in Control, Automation and Robotics (ICINCO).

[17]  M K Tanenhaus,et al.  Functional clauses and sentence segmentation. , 1978, Journal of speech and hearing research.

[18]  Harry Bunt,et al.  The DialogBank: dialogues with interoperable annotations , 2016, Language Resources and Evaluation.

[19]  David Griol,et al.  A Two-Stage Combining Classifier Model for the Development of Adaptive Dialog Systems , 2016, Int. J. Neural Syst..

[20]  Wolfgang Minker,et al.  On Quality Ratings for Spoken Dialogue Systems – Experts vs. Users , 2013, NAACL.