An Efficient Method for Tagging a Query with Category Labels Using Wikipedia towards Enhancing Search Engine Results

This paper intends to present a straightforward, extensive, and noise resistant method for efficiently tagging a web query, submitted to a search engine, with proper category labels. These labels are intended to represent the closest categories related to the query which can ultimately be used to enhance the results of any typical search engine by either restricting the results to matching categories or enriching the query itself. The presented method effectively rules out noise words within a query, forms the optimal keyword packs using a density function, and returns a set of category labels which represent the common topics of the given query using Wikipedia category hierarchy.

[1]  Qiang Yang,et al.  Query enrichment for web-query classification , 2006, TOIS.

[2]  Ying Li,et al.  Product query classification , 2009, CIKM.

[3]  Daniel S. Weld,et al.  Autonomously semantifying wikipedia , 2007, CIKM '07.

[4]  Ophir Frieder,et al.  Automatic classification of Web queries using very large unlabeled query logs , 2007, TOIS.

[5]  Qiang Yang,et al.  Q2C@UST: our winning solution to query classification in KDDCUP 2005 , 2005, SKDD.

[6]  Ophir Frieder,et al.  Improving automatic query classification via semi-supervised learning , 2005, Fifth IEEE International Conference on Data Mining (ICDM'05).

[7]  Ophir Frieder,et al.  Automatic web query classification using labeled and unlabeled training data , 2005, SIGIR '05.

[8]  Péter Schönhofen Identifying document topics using the Wikipedia category network , 2009, Web Intell. Agent Syst..

[9]  Somnath Banerjee,et al.  Clustering short texts using wikipedia , 2007, SIGIR.

[10]  Lehel Csató,et al.  Wikipedia-Based Kernels for Text Categorization , 2007, Ninth International Symposium on Symbolic and Numeric Algorithms for Scientific Computing (SYNASC 2007).

[11]  Qiang Yang,et al.  Building bridges for web query classification , 2006, SIGIR.

[12]  Ian H. Witten,et al.  Mining Meaning from Wikipedia , 2008, Int. J. Hum. Comput. Stud..

[13]  Ying Li,et al.  KDD CUP-2005 report: facing a great challenge , 2005, SKDD.

[14]  Jian Hu,et al.  Using Wikipedia knowledge to improve text classification , 2009, Knowledge and Information Systems.

[15]  Gang Wang,et al.  Understanding user's query intent with wikipedia , 2009, WWW '09.