Query Intent Recognition Based on Multi-Class Features

In order to enhance the user search experience of the search engine, an intent recognition search based on natural language input is proposed. By using reality mining technology to obtain the potential consciousness information from the query expression, search engines can better predict the query results that meet users’ requirements. With the development of conventional machine learning and deep learning, it is possible to further improve the accuracy of prediction results. This paper adopts a similarity calculation method based on long short-term memory (LSTM) and a traditional machine learning method based on multi-feature extraction. It is found that entity features can significantly improve the accuracy of intention classification. Second, the accuracy of intention classification based on the feature sequence constructed by key entities is up to 94.16% in the field of manual labeling by using the BiLSTM classification model.

[1]  Andrei Broder,et al.  A taxonomy of web search , 2002, SIGF.

[2]  Helen Ashman,et al.  The effect of user intent on the stability of search engine results , 2011, J. Assoc. Inf. Sci. Technol..

[3]  Jens Grivolla,et al.  FBM: Combining lexicon-based ML and heuristics for Social Media Polarities , 2013, *SEMEVAL.

[4]  Ming Li,et al.  Learning Question Similarity with Recurrent Neural Networks , 2017, 2017 IEEE International Conference on Big Knowledge (ICBK).

[5]  Andrew McCallum,et al.  Conditional Random Fields: Probabilistic Models for Segmenting and Labeling Sequence Data , 2001, ICML.

[6]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[7]  Yirui Wu,et al.  Multi-glimpse LSTM with color-depth feature fusion for human detection , 2017, 2017 IEEE International Conference on Image Processing (ICIP).

[8]  Kuldip K. Paliwal,et al.  Bidirectional recurrent neural networks , 1997, IEEE Trans. Signal Process..

[9]  Ricardo A. Baeza-Yates,et al.  The Intention Behind Web Queries , 2006, SPIRE.

[10]  John Atkinson,et al.  Ensembling Classifiers for Detecting User Intentions behind Web Queries , 2016, IEEE Internet Computing.

[11]  Vladimir N. Vapnik,et al.  The Nature of Statistical Learning Theory , 2000, Statistics for Engineering and Information Science.

[12]  Steve Chien,et al.  Semantic similarity between search engine queries using temporal correlation , 2005, WWW '05.

[13]  Xin Yao,et al.  Diversity creation methods: a survey and categorisation , 2004, Inf. Fusion.

[14]  Yasubumi Sakakibara,et al.  Probabilistic Context-Free Grammars , 2010, Encyclopedia of Machine Learning.

[15]  Maarten Versteegh,et al.  Learning Text Similarity with Siamese Recurrent Networks , 2016, Rep4NLP@ACL.

[16]  Lih-Yuan Deng,et al.  The Cross-Entropy Method: A Unified Approach to Combinatorial Optimization, Monte-Carlo Simulation, and Machine Learning , 2006, Technometrics.

[17]  Andreas Christmann,et al.  Support vector machines , 2008, Data Mining and Knowledge Discovery Handbook.

[18]  Jeffrey Dean,et al.  Efficient Estimation of Word Representations in Vector Space , 2013, ICLR.

[19]  Min-Yen Kan,et al.  Functional Faceted Web Query Analysis , 2007 .

[20]  Xiaojie Wang,et al.  Dependency Relation Based Detection of Lexicalized User Goals , 2010, UIC.

[21]  Alex Graves,et al.  Supervised Sequence Labelling with Recurrent Neural Networks , 2012, Studies in Computational Intelligence.

[22]  Amanda Spink,et al.  Classifying the user intent of web queries using k-means clustering , 2010, Internet Res..

[23]  Jonas Mueller,et al.  Siamese Recurrent Architectures for Learning Sentence Similarity , 2016, AAAI.