Wiki-MetaSemantik: A Wikipedia-derived query expansion approach based on network properties

This paper discusses the use of Wikipedia for building semantic ontologies to do Query Expansion (QE) in order to improve the search results of search engines. In this technique, selecting related Wikipedia concepts becomes important. We propose the use of network properties (degree, closeness, and pageRank) to build an ontology graph of user query concepts which is derived directly from Wikipedia structures. The resulting expansion system is called Wiki-MetaSemantik. We tested this system against other online thesauruses and ontology based QE in both individual and meta-search engines setups. Despite that our system has to build a Wikipedia ontology graph in order to do its work, the technique turns out to work very fast (1:281) compared to another ontology QE baseline (Wikipedia Persian ontology QE). It has thus the potential to be utilized online. Furthermore, it shows significant improvement in accuracy. Wiki-MetaSemantik also shows better performance in a meta-search engine (MSE) set up rather than in an individual search engine set up.

[1]  Rusdi Efendi,et al.  An MDL-Based Frequent Itemset Hierarchical Clustering Technique to Improve Query Search Results of an Individual Search Engine , 2015, AIRS.

[2]  Iraklis Varlamis,et al.  An Experimental Study on Unsupervised Graph-based Word Sense Disambiguation , 2010, CICLing.

[3]  Jintao Li,et al.  Improved latent concept expansion using hierarchical markov random fields , 2010, CIKM.

[4]  I. S. W. B. Prasetya,et al.  The Analysis of Rank Fusion Techniques to Improve Query Relevance , 2015 .

[5]  Derek Greene,et al.  Unsupervised graph-based topic labelling using dbpedia , 2013, WSDM.

[6]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[7]  Khaled Abd El-Fatah Mohamed Merging Multiple Search Results Approach: for Meta Search Engines , 2010 .

[8]  Hossein Jadidoleslamy,et al.  Search Result Merging and Ranking Strategies in Meta-Search Engines: A Survey , 2012 .

[9]  K. Nakayama,et al.  Wikipedia Mining Wikipedia as a Corpus for Knowledge Extraction , 2008 .

[10]  Milad Shokouhi,et al.  Query Expansion Using External Evidence , 2009, ECIR.

[11]  Hsin-Hsi Chen,et al.  Query Expansion with ConceptNet and WordNet: An Intrinsic Comparison , 2006, AIRS.

[12]  Carson Bruce,et al.  Query Expansion Powered by Wikipedia Hyperlinks , 2012, Australasian Conference on Artificial Intelligence.

[13]  Sofia Stamou,et al.  Web query disambiguation using PageRank , 2012, J. Assoc. Inf. Sci. Technol..

[14]  Rabia Nuray-Turan,et al.  Automatic ranking of information retrieval systems using data fusion , 2006, Inf. Process. Manag..

[15]  Maryam Mahmoudi,et al.  Query Expansion Using Persian Ontology Derived from Wikipedia , 2009 .

[16]  ChengXiang Zhai,et al.  Adaptive relevance feedback in information retrieval , 2009, CIKM.

[17]  Sung-Hyon Myaeng,et al.  Query Phrase Expansion Using Wikipedia in Patent Class Search , 2011, AIRS.

[18]  Moni Naor,et al.  Rank aggregation methods for the Web , 2001, WWW '01.

[19]  Milad Shokouhi,et al.  Query Suggestion and Data Fusion in Contextual Disambiguation , 2015, WWW.