论文信息 - Role-explicit query identification and intent role annotation

Role-explicit query identification and intent role annotation

Understanding the information need or intent encoded within a query has long been regarded as an essential factor of effective information retrieval. For better query representation and understanding, two intent roles (kernel-object and modifier) are introduced to structurally parse a class of role-explicit queries, which constitute a majority of common user queries. Furthermore, we focus on two research problems: RP-1: Given a role-explicit query, how to identify the kernel-object and modifier, namely intent role annotation; RP-2: How to determine whether an arbitrary query is role-explicit or not. To solve RP-1, we propose a simplified word n-gram role model (SWNR), which quantifies the generating probability of a role-explicit query and performs intent role annotation effectively. Using a set of discriminative features, we build classifiers to address RP-2 in a supervised manner. The experimental results show that: (1) SWNR can achieve a satisfactory performance, more than 73% in terms of different metrics; (2) The classifiers can achieve more than 90% precision in identifying role-explicit queries; (3) Compared with traditional techniques for query representation and understanding, e.g., name entity recognition in query and class-level query intent inference, intent role annotation provides a more flexible framework and a number of applications can benefit from annotating role-explicit queries, such as intent mining and diversified document ranking.

Fuji Ren | Haitao Yu

[1] Deepayan Chakrabarti,et al. Mining broad latent query aspects from search sessions , 2009, KDD.

[2] Chorkin Chan,et al. Chinese Word Segmentation based on Maximum Matching and Word Binding Force , 1996, COLING.

[3] Hermann Ney,et al. On structuring probabilistic dependences in stochastic language modelling , 1994, Comput. Speech Lang..

[4] Zhimin Zhang,et al. Using search session context for named entity recognition in query , 2010, SIGIR.

[5] Song Liu,et al. Qualifier Mining for NTCIR-INTENT , 2011, NTCIR.

[6] Fuji Ren,et al. From Cloud Computing to Language Engineering, Affective Computing and Advanced Intelligence ∗ , 2010 .

[7] Hermann Ney,et al. Statistical Language Modeling and Word Triggers , 1996 .

[8] Hang Li,et al. Named entity recognition in query , 2009, SIGIR.

[9] Monika Henzinger,et al. Analysis of a very large web search engine query log , 1999, SIGF.

[10] Marius Pasca,et al. Weakly-supervised discovery of named entities using web search queries , 2007, CIKM '07.

[11] Daniel Jurafsky,et al. Automatic Labeling of Semantic Roles , 2002, CL.