Dialogue Act Classification in Reference Interview Using Convolutional Neural Network with Byte Pair Encoding

Dialogue act classification is an important component of dialogue management, which captures the user’s intention and chooses the appropriate response action. In this paper, we focus on the dialogue act classification in reference interviews to model the behaviors of librarians in the information seeking dialogues. Reference interviews sometimes include rare words and phrases. Therefore, the existing approaches that use words as units of input often do not work well here. We used the byte pair encoding compression algorithm to build a new vocabulary for the inputs of the classifier. By using this new unit as a feature of the convolutional neural network-based classifier, we improved the accuracy of the dialogue act classification while suppressing the size of vocabulary.

[1]  Noriaki Horii,et al.  A multichannel convolutional neural network for cross-language dialog state tracking , 2016, 2016 IEEE Spoken Language Technology Workshop (SLT).

[2]  Ina Fourie,et al.  Conducting the Reference Interview: A How‐to‐Do‐it Manual for Librarians , 2003 .

[3]  Milica Gasic,et al.  POMDP-Based Statistical Spoken Dialog Systems: A Review , 2013, Proceedings of the IEEE.

[4]  Catherine Sheldrick Ross,et al.  Conducting the Reference Interview: A How-to-do-it Manual for Librarians , 2002, Program.

[5]  Tatsuya Kawahara,et al.  Conversational system for information navigation based on POMDP with user focus tracking , 2015, Comput. Speech Lang..

[6]  Steve J. Young,et al.  Partially observable Markov decision processes for spoken dialog systems , 2007, Comput. Speech Lang..

[7]  Keisuke Inoue,et al.  An Investigation of Digital Reference Interviews: A Dialogue Act Approach. , 2013 .

[8]  Ayumi Shinohara,et al.  Speeding Up Pattern Matching by Text Compression , 2000, CIAC.

[9]  Philip Gage,et al.  A new algorithm for data compression , 1994 .

[10]  Ilya Sutskever,et al.  SUBWORD LANGUAGE MODELING WITH NEURAL NETWORKS , 2011 .

[11]  Kôiti Hasida,et al.  ISO 24617-2: A semantically-based standard for dialogue annotation , 2012, LREC.

[12]  Yorick Wilks,et al.  Dialogue Act Classification Based on Intra-Utterance Features∗ , 2005 .

[13]  Rico Sennrich,et al.  Neural Machine Translation of Rare Words with Subword Units , 2015, ACL.

[14]  Tatsuya Kawahara,et al.  Dialogue strategy to clarify user's queries for document retrieval system with speech interface , 2005, INTERSPEECH.

[15]  Andreas Stolcke,et al.  Dialogue act modeling for automatic tagging and recognition of conversational speech , 2000, CL.

[16]  Lynn Silipigni Connaway,et al.  "Are we getting warmer?" Query clarification in live chat virtual reference , 2011 .

[17]  Alessandro Moschitti,et al.  UNITN: Training Deep Convolutional Neural Network for Twitter Sentiment Classification , 2015, *SEMEVAL.