Integration of Word and Semantic Features for Theme Identification in Telephone Conversations

The paper describes a research about the possibility of integrating different types of word and semantic features for automatically identifying themes of real-life telephone conversations in a customer care service (CCS). Features are all the words of the application vocabulary, the probabilities obtained with latent Dirichlet allocation (LDA) of selected discriminative words and semantic features obtained with a limited human supervision of words and patterns expressing entities and relations of the application ontology. A deep neural network (DNN) is proposed for integrating these features. Experimental results on manual and automatic conversation transcriptions are presented showing the effective contribution of the integration. The results show how to automatically select a large subset of the test corpus with high precision and recall, making it possible to automatically obtain theme mention proportions in different time periods.

[1]  Georges Linarès,et al.  The LIA Speech Recognition System: From 10xRT to 1xRT , 2007, TSD.

[2]  Hsin-Min Wang,et al.  Exploiting semantic associative information in topic modeling , 2010, 2010 IEEE Spoken Language Technology Workshop.

[3]  Mohamed Morchid,et al.  Improving dialogue classification using a topic space representation and a Gaussian classifier based on the decision rule , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[4]  Timothy J. Hazen MCE Training Techniques for Topic Identification of Spoken Audio Documents , 2011, IEEE Transactions on Audio, Speech, and Language Processing.

[5]  Frédéric Béchet,et al.  DECODA: a call-centre human-human spoken conversation corpus , 2012, LREC.

[6]  Geoffrey E. Hinton,et al.  Application of Deep Belief Networks for Natural Language Understanding , 2014, IEEE/ACM Transactions on Audio, Speech, and Language Processing.

[7]  Gokhan Tur,et al.  Spoken Language Understanding: Systems for Extracting Semantic Information from Speech , 2011 .

[8]  Michael I. Jordan,et al.  Latent Dirichlet Allocation , 2001, J. Mach. Learn. Res..

[9]  T. Landauer,et al.  Indexing by Latent Semantic Analysis , 1990 .

[10]  Claudio Carpineto,et al.  An information-theoretic approach to automatic query expansion , 2001, TOIS.

[11]  Mohamed Morchid,et al.  I-vector based representation of highly imperfect automatic transcriptions , 2014, INTERSPEECH.

[12]  Dilek Z. Hakkani-Tür,et al.  Human/Human Conversation Understanding , 2011 .

[13]  Mohamed Morchid,et al.  Theme identification in human-human conversations with features from specific speaker type hidden spaces , 2014, INTERSPEECH.

[14]  Helen F. Hastie,et al.  Training a statistical surface realiser from automatic slot labelling , 2014, 2014 IEEE Spoken Language Technology Workshop (SLT).

[15]  Mohamed Morchid,et al.  An I-vector Based Approach to Compact Multi-Granularity Topic Spaces Representation of Textual Documents , 2014, EMNLP.

[16]  Alexander I. Rudnicky,et al.  Leveraging frame semantics and distributional semantics for unsupervised semantic slot induction in spoken dialogue systems , 2014, 2014 IEEE Spoken Language Technology Workshop (SLT).