Modified rough set based aggregation for effective evaluation of web search systems

Rank Aggregation is the problem of generating a single consensus ranking for a given set of rankings. Rough set based Rank aggregation is a user feedback based technique for rank aggregation, which learns ranking rules using rough set theory. In this paper, we discuss an improved version of the Rough set based Rank aggregation technique, which is more suitable for aggregation of different Web search evaluation techniques. For learning the ranking rules, we obtain the implicit user feedback to the search results returned by a search engine in response to a set of fifteen queries and mine the ranking rules using rough set theory. In the modified rough set based rank aggregation technique, we incorporate the confidence of the rules in predicting a class for a given set of data. That means, we do not say surely that the record belongs to a particular class according to a particular rule. Instead, we associate a score variable to the predicted class of the record, where the value of the variable is equal to the confidence measure of the rule. We validate the mined ranking rules by comparing the predicted user feedback based ranking with the actual user feedback based ranking. We apply the ranking rules to another set of thirty seven queries for aggregating different rankings of search results obtained on the basis of different evaluation techniques. We show our experimental results pertaining to seven public search engines.

[1]  Peter B. Danzig,et al.  Boolean Similarity Measures for Resource Discovery , 1997, IEEE Trans. Knowl. Data Eng..

[2]  Garrison W. Cottrell,et al.  Fusion Via a Linear Combination of Scores , 1999, Information Retrieval.

[3]  Moni Naor,et al.  Rank aggregation methods for the Web , 2001, WWW '01.

[4]  L. Cranor,et al.  Declared-strategy voting: an instrument for group decision-making , 1996 .

[5]  Rabia Nuray-Turan,et al.  Automatic ranking of information retrieval systems using data fusion , 2006, Inf. Process. Manag..

[6]  Javed A. Aslam,et al.  Models for metasearch , 2001, SIGIR '01.

[7]  Ronald Fagin,et al.  Efficient similarity search and classification via rank aggregation , 2003, SIGMOD '03.

[8]  Rashid Ali,et al.  Rough Set Based Rank Aggregation for the Web , 2007, IICAI.

[9]  Javed A. Aslam,et al.  Condorcet fusion for improved retrieval , 2002, CIKM '02.

[10]  Rashid Ali,et al.  Automatic Performance Evaluation of Web Search Systems using Rough Set based Rank Aggregation , 2009, IHCI.

[11]  W. Pirie Spearman Rank Correlation Coefficient , 2006 .

[12]  Nesar Ahmad,et al.  Web search enhancement by mining user actions , 2007, Inf. Sci..

[13]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[14]  Jerzy W. Grzymala-Busse,et al.  Rough Sets , 1995, Commun. ACM.

[15]  Gerard Salton,et al.  A vector space model for automatic indexing , 1975, CACM.

[16]  Ronald Fagin,et al.  Searching the workplace web , 2003, WWW '03.

[17]  Yoram Singer,et al.  Learning to Order Things , 1997, NIPS.

[18]  M. M. Sufyan Beg A subjective measure of web search quality , 2005, Inf. Sci..

[19]  Umberto Straccia,et al.  Web metasearch: rank vs. score based rank aggregation methods , 2003, SAC '03.