A Feature-Enriched Method for User Intent Classification by Leveraging Semantic Tag Expansion

User intent identification and classification has become a vital topic of query understanding in human-computer dialogue applications. The identification of users’ intent is especially crucial for assisting system to understand users’ queries so as to classify the queries accurately to improve users’ satisfaction. Since the posted queries are usually short and lack of context, conventional methods heavily relying on query n-grams or other common features are not sufficient enough. This paper proposes a compact yet effective user intention classification method named as ST-UIC based on a constructed semantic tag repository. The method proposes to use a combination of four kinds of features including characters, non-key-noun part-of-speech tags, target words, and semantic tags. The experiments are based on a widely applied dataset provided by the First Evaluation of Chinese Human-Computer Dialogue Technology. The result shows that the method achieved a F1 score of 0.945, exceeding a list of baseline methods and demonstrating its effectiveness in user intent classification.

[1]  Gökhan Tür,et al.  The AT&T spoken language understanding system , 2006, IEEE Transactions on Audio, Speech, and Language Processing.

[2]  Xiao Li,et al.  Precomputing search features for fast and accurate query classification , 2010, WSDM '10.

[3]  Timothy J. Hazen,et al.  LARGE-SCALE WORD REPRESENTATION FEATURES FOR IMPROVED SPOKEN LANGUAGE UNDERSTANDING , 2015 .

[4]  Gökhan Tür,et al.  Exploiting query click logs for utterance domain detection in spoken language understanding , 2011, 2011 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[5]  Amanda Spink,et al.  Determining the user intent of web search engine queries , 2007, WWW '07.

[6]  Gang Wang,et al.  Understanding user's query intent with wikipedia , 2009, WWW '09.

[7]  James R. Glass,et al.  Query understanding enhanced by hierarchical parsing structures , 2013, 2013 IEEE Workshop on Automatic Speech Recognition and Understanding.

[8]  Paolo Rosso,et al.  A Simple Model for Classifying Web Queries by User Intent , 2012 .

[9]  Jun Zhang,et al.  Large-scaleword representation features for improved spoken language understanding , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[10]  Renato De Mori,et al.  Spoken language understanding: a survey , 2007, ASRU.

[11]  Charles L. A. Clarke,et al.  Classifying and Characterizing Query Intent , 2009, ECIR.

[12]  Wanxiang Che,et al.  The First Evaluation of Chinese Human-Computer Dialogue Technology , 2017, ArXiv.

[13]  Bing Liu,et al.  Multi-Domain Adversarial Learning for Slot Filling in Spoken Language Understanding , 2017, ArXiv.

[14]  Liu Yiqun,et al.  Research in Search Engine User Behavior Based on Log Analysis , 2004 .

[15]  Ruhi Sarikaya,et al.  Contextual domain classification in spoken language understanding systems using recurrent neural network , 2014, 2014 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[16]  Gökhan Tür,et al.  Use of kernel deep convex networks and end-to-end learning for spoken language understanding , 2012, 2012 IEEE Spoken Language Technology Workshop (SLT).

[17]  Qingyao Wu,et al.  Leveraging question target word features through semantic relation expansion for answer type classification , 2017, Knowl. Based Syst..

[18]  Kinam Park,et al.  Automatic extraction of user’s search intention from web search logs , 2010, Multimedia Tools and Applications.

[19]  Yangyang Shi,et al.  Contextual spoken language understanding using recurrent neural networks , 2015, 2015 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).

[20]  Dilek Z. Hakkani-Tür,et al.  Leveraging Web Query Logs to Learn User Intent Via Bayesian Discrete Latent Variable Model , 2011 .

[21]  Wang Bin A Survey of Web Search Query Intention Classification , 2008 .

[22]  Tianyong Hao,et al.  A WordNet Expansion-Based Approach for Question Targets Identification and Classification , 2015, CCL.

[23]  Giuseppe Riccardi,et al.  How may I help you? , 1997, Speech Commun..

[24]  Stephanie Seneff,et al.  Spoken Dialogue Systems , 2008 .

[25]  Yoshua Bengio,et al.  Deep Learning of Representations: Looking Forward , 2013, SLSP.

[26]  Gökhan Tür,et al.  Towards deeper understanding: Deep convex networks for semantic utterance classification , 2012, 2012 IEEE International Conference on Acoustics, Speech and Signal Processing (ICASSP).