论文信息 - Speaker-change Aware CRF for Dialogue Act Classification

Speaker-change Aware CRF for Dialogue Act Classification

Recent work in Dialogue Act (DA) classification approaches the task as a sequence labeling problem, using neural network models coupled with a Conditional Random Field (CRF) as the last layer. CRF models the conditional probability of the target DA label sequence given the input utterance sequence. However, the task involves another important input sequence, that of speakers, which is ignored by previous work. To address this limitation, this paper proposes a simple modification of the CRF layer that takes speaker-change into account. Experiments on the SwDA corpus show that our modified CRF layer outperforms the original one, with very wide margins for some DA labels. Further, visualizations demonstrate that our CRF layer can learn meaningful, sophisticated transition patterns between DA label pairs conditioned on speaker-change in an end-to-end way. Code is publicly available.

Jean-Pierre Lorré | Michalis Vazirgiannis | Guokan Shang | Antoine Jean-Pierre Tixier

[1] Ryuichiro Higashinaka,et al. Towards an open-domain conversational system fully based on natural language processing , 2014, COLING.

[2] Andrew McCallum,et al. An Introduction to Conditional Random Fields , 2010, Found. Trends Mach. Learn..

[3] Jean-Pierre Lorré,et al. LinTO : Assistant vocal open-source respectueux des données personnelles pour les réunions d'entreprise , 2019, ArXiv.

[4] A. Koller,et al. Speech Acts: An Essay in the Philosophy of Language , 1969 .

[5] Sean R Eddy,et al. What is dynamic programming? , 2004, Nature Biotechnology.

[6] Xiao Li,et al. A Dual-Attention Hierarchical Recurrent Neural Network for Dialogue Act Classification , 2018, CoNLL.

[7] Hongfei Lin,et al. An attention‐based BiLSTM‐CRF approach to document‐level chemical named entity recognition , 2018, Bioinform..

[8] Phil Blunsom,et al. Recurrent Convolutional Neural Networks for Discourse Compositionality , 2013, CVSM@ACL.

[9] Wei Li,et al. Multi-level Gated Recurrent Neural Network for dialog act classification , 2016, COLING.

[10] Jamin Shin,et al. Hierarchical Meta-Embeddings for Code-Switching Named Entity Recognition , 2019, EMNLP/IJCNLP.

[11] Gina-Anne Levow,et al. Dialog act tagging with support vector machines and hidden Markov models , 2006, INTERSPEECH.

[12] Jean-Pierre Lorré,et al. Unsupervised Abstractive Meeting Summarization with Multi-Sentence Compression and Budgeted Submodular Maximization , 2018, ACL.

[13] Nir Friedman,et al. Probabilistic Graphical Models - Principles and Techniques , 2009 .

[14] Deng Cai,et al. Dialogue Act Recognition via CRF-Attentive Structured Network , 2017, SIGIR.

[15] Andreas Stolcke,et al. Dialogue act modeling for automatic tagging and recognition of conversational speech , 2000, CL.

[16] Wen Wang,et al. BERT for Joint Intent Classification and Slot Filling , 2019, ArXiv.

[17] Andrew J. Viterbi,et al. Error bounds for convolutional codes and an asymptotically optimum decoding algorithm , 1967, IEEE Trans. Inf. Theory.

[18] Xipeng Qiu,et al. TENER: Adapting Transformer Encoder for Named Entity Recognition , 2019, ArXiv.

[19] David Vilar,et al. Dialogue act classification using a Bayesian approach ∗ , 2004 .

[20] Yun Lei,et al. Using Context Information for Dialog Act Classification in DNN Framework , 2017, EMNLP.

[21] Elizabeth Shriberg,et al. Automatic dialog act segmentation and classification in multiparty meetings , 2005, Proceedings. (ICASSP '05). IEEE International Conference on Acoustics, Speech, and Signal Processing, 2005..

[22] Jan Alexanderssony,et al. Dialogue acts in VERBMOBIL-2 , 1997 .

[23] Matthias Zimmermann,et al. Joint segmentation and classification of dialog acts using conditional random fields , 2009, INTERSPEECH.

[24] Eric Fosler-Lussier,et al. Combining phonetic attributes using conditional random fields , 2006, INTERSPEECH.

[25] Elizabeth Shriberg,et al. Switchboard SWBD-DAMSL shallow-discourse-function annotation coders manual , 1997 .

[26] James F. Allen,et al. Draft of DAMSL Dialog Act Markup in Several Layers , 2007 .

[27] Maria Leonor Pacheco,et al. of the Association for Computational Linguistics: , 2001 .

[28] Klaus Ries,et al. HMM and neural network based speech act detection , 1999, 1999 IEEE International Conference on Acoustics, Speech, and Signal Processing. Proceedings. ICASSP99 (Cat. No.99CH36258).

[29] Mark G. Core,et al. Coding Dialogs with the DAMSL Annotation Scheme , 1997 .

[30] Kôiti Hasida,et al. ISO 24617-2: A semantically-based standard for dialogue annotation , 2012, LREC.

[31] Yorick Wilks,et al. Dialogue Act Classification Based on Intra-Utterance Features∗ , 2005 .

[32] F. Inglis. How To Do Things With Words. , 1971 .

[33] Jeffrey Dean,et al. Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[34] Petr Sojka,et al. Software Framework for Topic Modelling with Large Corpora , 2010 .

[35] Timothy Baldwin,et al. Classifying Dialogue Acts in One-on-One Live Chats , 2010, EMNLP.

[36] Kôiti Hasida,et al. Towards an ISO Standard for Dialogue Act Annotation , 2010, LREC.

[37] Harry Bunt,et al. 'Who's next? Speaker-selection mechanisms in multiparty dialogue' , 2009 .

[38] Franck Dernoncourt,et al. Sequential Short-Text Classification with Recurrent and Convolutional Neural Networks , 2016, NAACL.

[39] Joel R. Tetreault,et al. Dialogue Act Classification with Context-Aware Self-Attention , 2019, NAACL.

[40] Iryna Gurevych,et al. Optimal Hyperparameters for Deep LSTM-Networks for Sequence Labeling Tasks , 2017, ArXiv.