Automatic Performance Evaluation of Web Search Systems using Rough Set based Rank Aggregation

Web searching is such an activity that its importance can just not be ignored in the current scenario. Since there are a large number of publicly accessible search engines, shich differ in their indexing algorithms and hence the search results, the evaluation of search engines performance is needed to determine which one is the best. The human intelligence may be used to measure the search engine effectiveness. But, a subjective evaluation done on the basis of user-feedback is costly in terms of the time required. Therefore, it is also not scalable. So, there is a need of an automatic evaluation method. In this paper, we present the architecture of an automatic Web search evaluation system that combines the different evaluation techniques using a Rough Set based Rank aggregation technique. The rough set based rank aggregation models the user’s feedback based rank aggregation. In the rough set based aggregation technique, the ranking rules are learnt on the basis of the user feedback in the training data sets. The learned rules are then used to estimate the overall ranking for the other data sets, for which user feedback is not available. We show our experimental results pertaining to seven public search engines.

[1]  Emine Yilmaz,et al.  A statistical method for system evaluation using incomplete judgments , 2006, SIGIR.

[2]  Jerzy W. Grzymala-Busse,et al.  Rough Sets , 1995, Commun. ACM.

[3]  Abdur Chowdhury,et al.  Automatic evaluation of world wide web search services , 2002, SIGIR '02.

[4]  Peter B. Danzig,et al.  Boolean Similarity Measures for Resource Discovery , 1997, IEEE Trans. Knowl. Data Eng..

[5]  Ian Soboroff,et al.  Ranking retrieval systems without relevance judgments , 2001, SIGIR '01.

[6]  Rajeev Motwani,et al.  The PageRank Citation Ranking : Bringing Order to the Web , 1999, WWW 1999.

[7]  Shengli Wu,et al.  Methods for ranking information retrieval systems without relevance judgments , 2003, SAC '03.

[8]  Rabia Nuray-Turan,et al.  Automatic performance evaluation of Web search engines , 2004, Inf. Process. Manag..

[9]  Yiyu Yao,et al.  Mining Ordering Rules Using Rough Set Theory , 2008 .

[10]  W. Pirie Spearman Rank Correlation Coefficient , 2006 .

[11]  Rashid Ali,et al.  Rough Set Based Rank Aggregation for the Web , 2007, IICAI.

[12]  Bernard J. Jansen,et al.  Automated evaluation of search engine performance via implicit user feedback , 2005, SIGIR '05.

[13]  Víctor Pàmies,et al.  Open Directory Project , 2003 .

[14]  M. M. Sufyan Beg A subjective measure of web search quality , 2005, Inf. Sci..

[15]  Nesar Ahmad,et al.  Web search enhancement by mining user actions , 2007, Inf. Sci..

[16]  Longzhuang Li,et al.  Precision Evaluation of Search Engines , 2004, World Wide Web.

[17]  Gerard Salton,et al.  A vector space model for automatic indexing , 1975, CACM.

[18]  Abdur Chowdhury,et al.  Using titles and category names from editor-driven taxonomies for automatic evaluation , 2003, CIKM '03.

[19]  Rabia Nuray-Turan,et al.  Automatic ranking of information retrieval systems using data fusion , 2006, Inf. Process. Manag..