论文信息 - Hierarchical Multi-Label Dialog Act Recognition on Spanish Data

Hierarchical Multi-Label Dialog Act Recognition on Spanish Data

Dialog acts reveal the intention behind the uttered words. Thus, their automatic recognition is important for a dialog system trying to understand its conversational partner. The study presented in this article approaches that task on the DIHANA corpus, whose three-level dialog act annotation scheme poses problems which have not been explored in recent studies. In addition to the hierarchical problem, the two lower levels pose multi-label classification problems. Furthermore, each level in the hierarchy refers to a different aspect concerning the intention of the speaker both in terms of the structure of the dialog and the task. Also, since its dialogs are in Spanish, it allows us to assess whether the state-of-the-art approaches on English data generalize to a different language. More specifically, we compare the performance of different segment representation approaches focusing on both sequences and patterns of words and assess the importance of the dialog history and the relations between the multiple levels of the hierarchy. Concerning the single-label classification problem posed by the top level, we show that the conclusions drawn on English data also hold on Spanish data. Furthermore, we show that the approaches can be adapted to multi-label scenarios. Finally, by hierarchically combining the best classifiers for each level, we achieve the best results reported for this corpus.

Ricardo Ribeiro | David Martins de Matos | Eug'enio Ribeiro | Ricardo Ribeiro | Eugénio Ribeiro

[1] Anne H. Anderson,et al. The Hcrc Map Task Corpus , 1991 .

[2] Elizabeth Shriberg,et al. Switchboard SWBD-DAMSL shallow-discourse-function annotation coders manual , 1997 .

[3] Wolfgang Minker,et al. A Parameterized and Annotated Spoken Dialog Corpus of the CMU Let’s Go Bus Information System , 2012, LREC.

[4] Fabio Pianesi,et al. NESPOLE!'s Multilingual and Multimodal Corpus , 2002, LREC.

[5] Luis A. Pineda,et al. Predicting Obligation Dialogue Acts from Prosodic and Speaker Information , 2005 .

[6] Klaus Ries,et al. HMM and neural network based speech act detection , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[7] R. Granell,et al. Acquisition and Labelling of a Spontaneous Speech Dialogue Corpus ∗ , 2005 .

[8] Luis Alberto Pineda,et al. Predicting Dialogue Acts from Prosodic Information , 2006, CICLing.

[9] Yoon Kim,et al. Convolutional Neural Networks for Sentence Classification , 2014, EMNLP.

[10] Franck Dernoncourt,et al. Sequential Short-Text Classification with Recurrent and Convolutional Neural Networks , 2016, NAACL.

[11] Ricardo Ribeiro,et al. A Study on Dialog Act Recognition using Character-Level Tokenization , 2018, AIMSA.

[12] Ricardo Ribeiro,et al. The Influence of Context on Dialogue Act Recognition , 2015, ArXiv.

[13] John R. Searle,et al. Speech Acts: An Essay in the Philosophy of Language , 1970 .

[14] Carlos D. Martínez-Hinarejos,et al. Statistical framework for a Spanish spoken dialogue corpus , 2008, Speech Commun..

[15] Barbara Di Eugenio,et al. Dialogue Act Classification, Higher Order Dialogue Structure, and Instance-Based Learning , 2010 .

[16] Emilio Sanchis Arnal,et al. A Labelling Proposal to Annotate Dialogues , 2002, LREC.

[17] Matthias Abt. Verbmobil A Translation System For Face To Face Dialog , 2016 .

[18] Evgeny A. Stepanov,et al. ISO-Standard Domain-Independent Dialogue Act Tagging for Conversational Agents , 2018, COLING.