Japanese Dialogue Corpus of Information Navigation and Attentive Listening Annotated with Extended ISO-24617-2 Dialogue Act Tags

Large-scale dialogue data annotated with dialogue states is necessary to model a natural conversation with machines. However, largescale conventional dialogue corpora are mainly built for specified tasks (e.g., task-oriented systems for restaurant or bus information navigation) with specially designed dialogue states. Text-chat based dialogue corpora have also been built due to the growth of social communication through the internet; however, most of them do not reflect dialogue behaviors in face-to-face conversation, including backchannelings or interruptions. In this paper, we try to build a corpus that covers a wider range of dialogue tasks than existing task-oriented systems or text-chat systems, by transcribing face-to-face dialogues held in natural conversational situations in tasks of information navigation and attentive listening. The corpus is recorded in Japanese and annotated with an extended ISO-24617-2 dialogue act tag-set, which is defined to see behaviors in natural conversation. The developed data can be used to build a dialogue model based on the ISO-24617-2 dialogue act tags.

[1]  Satoshi Nakamura,et al.  Multimodal interaction data between clinical psychologists and students for attentive listening modeling , 2016, 2016 Conference of The Oriental Chapter of International Committee for Coordination and Standardization of Speech Databases and Assessment Techniques (O-COCOSDA).

[2]  Antoine Raux,et al.  The Dialog State Tracking Challenge , 2013, SIGDIAL Conference.

[3]  David Vandyke,et al.  Policy committee for adaptation in multi-domain spoken dialogue systems , 2015, 2015 IEEE Workshop on Automatic Speech Recognition and Understanding (ASRU).

[4]  Alexander I. Rudnicky,et al.  Expanding the Scope of the ATIS Task: The ATIS-3 Corpus , 1994, HLT.

[5]  Graham Neubig,et al.  Pointwise Prediction for Robust, Adaptable Japanese Morphological Analysis , 2011, ACL.

[6]  Satoshi Nakamura,et al.  Information Navigation System with Discovering User Interests , 2017, SIGDIAL Conference.

[7]  Mark G. Core,et al.  Coding Dialogs with the DAMSL Annotation Scheme , 1997 .

[8]  Kôiti Hasida,et al.  ISO 24617-2: A semantically-based standard for dialogue annotation , 2012, LREC.

[9]  J. Fleiss,et al.  Statistical methods for rates and proportions , 1973 .

[10]  Yannis Stylianou,et al.  Single-Model Multi-domain Dialogue Management with Deep Learning , 2017, IWSDS.

[11]  Rafael E. Banchs,et al.  The fifth dialog state tracking challenge , 2016, 2016 IEEE Spoken Language Technology Workshop (SLT).

[12]  B. Everitt,et al.  Statistical methods for rates and proportions , 1973 .

[13]  Tatsuya Kawahara,et al.  News Navigation System Based on Proactive Dialogue Strategy , 2015, IWSDS.

[14]  John J. Godfrey,et al.  SWITCHBOARD: telephone speech corpus for research and development , 1992, [Proceedings] ICASSP-92: 1992 IEEE International Conference on Acoustics, Speech, and Signal Processing.

[15]  Verena Rieser,et al.  Towards Learning Transferable Conversational Skills using Multi-dimensional Dialogue Modelling , 2018, ArXiv.

[16]  Satoshi Nakamura,et al.  Analyzing the Effect of Entrainment on Dialogue Acts , 2016, SIGDIAL Conference.

[17]  Kôiti Hasida,et al.  Towards an ISO Standard for Dialogue Act Annotation , 2010, LREC.

[18]  Rafael E. Banchs,et al.  The Fourth Dialog State Tracking Challenge , 2016, IWSDS.

[19]  Nigel G. Ward,et al.  Analysis and Prediction of Morphological Patterns of Backchannels for Attentive Listening Agents , 2015 .

[20]  Tatsuya Kawahara,et al.  Attentive listening system with backchanneling, response generation and flexible turn-taking , 2017, SIGDIAL Conference.