Improving Dialogue Act Classification for Spontaneous Arabic Speech and Instant Messages at Utterance Level

The ability to model and automatically detect dialogue act is an important step toward understanding spontaneous speech and Instant Messages. However, it has been difficult to infer a dialogue act from a surface utterance because it highly depends on the context of the utterance and speaker linguistic knowledge; especially in Arabic dialects. This paper proposes a statistical dialogue analysis model to recognize utterance's dialogue acts using a multi-classes hierarchical structure. The model can automatically acquire probabilistic discourse knowledge from a dialogue corpus were collected and annotated manually from multi-genre Egyptian call-centers. Extensive experiments were conducted using Support Vector Machines classifier to evaluate the system performance. The results attained in the term of average F-measure scores of 0.912; showed that the proposed approach has moderately improved F-measure by approximately 20%.

[1]  AbdelRahim A. Elmadany,et al.  Towards Understanding Egyptian Arabic Dialogues , 2015, ArXiv.

[2]  Edward Ivanovic,et al.  Automatic instant messaging dialogue using statistical models and dialogue acts , 2008 .

[3]  Mounir Zrigui,et al.  A Combined Method Based on Stochastic and Linguistic Paradigm for the Understanding of Arabic Spontaneous Utterances , 2013, CICLing.

[4]  AbdelRahim A. Elmadany,et al.  Arabic Inquiry-Answer Dialogue Acts Annotation Schema , 2014, ArXiv.

[5]  Timothy Baldwin,et al.  Classifying Dialogue Acts in One-on-One Live Chats , 2010, EMNLP.

[6]  AbdelRahim A. Elmadany,et al.  Turn Segmentation into Utterances for Arabic Spontaneous Dialogues and Instance Messages , 2015, ArXiv.

[7]  Sherif M. Abdou,et al.  JANA: An Arabic human-human dialogues corpus , 2015, 2015 IEEE 2nd International Conference on Recent Trends in Information Systems (ReTIS).

[8]  Youngjoong Ko,et al.  Hierarchical speech-act classification for discourse analysis , 2013, Pattern Recognit. Lett..

[9]  Edward Ivanovic,et al.  Automatic Utterance Segmentation in Instant Messaging Dialogue , 2005, ALTA.

[10]  Jungyun Seo,et al.  An Integrated Dialogue Analysis Model for Determining Speech Acts and Discourse Structures , 2005, IEICE Trans. Inf. Syst..

[11]  Shrikanth S. Narayanan,et al.  Combining lexical, syntactic and prosodic cues for improved online dialog act tagging , 2009, Comput. Speech Lang..

[12]  Yorick Wilks,et al.  Dialogue Act Classification Based on Intra-Utterance Features∗ , 2005 .

[13]  A. Graesser,et al.  AuToMATED SPEECh ACT CLASSIFICATIoN IN ARAbIC , 2010 .

[14]  Robert Tibshirani,et al.  Classification by Pairwise Coupling , 1997, NIPS.

[15]  Nick Webb,et al.  Cue-based dialogue act classification , 2010 .

[16]  Lamia Hadrich Belguith,et al.  Automatic Dialogue Act Annotation within Arabic Debates , 2015, CICLing.

[17]  David R. Traum,et al.  Utterance Units in Spoken Dialogue , 1996, ECAI Workshop on Dialogue Processing in Spoken Language Systems.

[18]  John H. L. Hansen,et al.  University of Colorado Dialogue Systems for Travel and Navigation , 2001, HLT.

[19]  Tomek Strzalkowski,et al.  Data-Driven Strategies for an Automated Dialogue System , 2004, ACL.

[20]  AbdelRahim A. Elmadany,et al.  A Survey of Arabic Dialogues Understanding for Spontaneous Dialogues and Instant Message , 2015, ArXiv.

[21]  David Vilar,et al.  Dialogue act classification using a Bayesian approach ∗ , 2004 .

[22]  Ken Samuel,et al.  Dialogue Act Tagging with Transformation-Based Learning , 1998, ACL.

[23]  Victor Zue,et al.  Interactive Problem Solving and Dialogue in the ATIS Domain , 1991, HLT.

[24]  Ian H. Witten,et al.  The WEKA data mining software: an update , 2009, SKDD.

[25]  Andreas Stolcke,et al.  Dialogue act modeling for automatic tagging and recognition of conversational speech , 2000, CL.

[26]  Toine Andernach A Machine Learning Approach to the Classification of Dialogue Utterances , 1996, ArXiv.

[27]  Lamia Hadrich Belguith,et al.  Statistical Framework with Knowledge Base Integration for Robust Speech Understanding of the Tunisian Dialect , 2015, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[28]  Barbara Di Eugenio,et al.  Dialogue Act Classification, Higher Order Dialogue Structure, and Instance-Based Learning , 2010 .

[29]  Zuhair Bandar,et al.  ArabChat: An Arabic Conversational Agent , 2014, 2014 6th International Conference on Computer Science and Information Technology (CSIT).

[30]  Lamia Hadrich Belguith,et al.  Discriminative Framework for Spoken Tunisian Dialect Understanding , 2013, SLSP.

[31]  Nick Webb,et al.  Data-Driven Language Understanding for Spoken Language Dialogue , 2005 .

[32]  Zuhair Bandar,et al.  User's utterance classification using machine learning for Arabic Conversational Agents , 2013, 2013 5th International Conference on Computer Science and Information Technology.