A Semantic Framework for Evaluating Topical Search Methods

The absence of reliable and efficient techniques to evaluate information retrieval systems has become a bottleneck in the development of novel retrieval methods. In traditional approaches users or hired evaluators provide manual assessments of relevance. However these approaches are neither efficient nor reliable since they do not scale with the complexity and heterogeneity of available digital information. Automatic approaches, on the other hand, could be efficient but disregard semantic data, which is usually important to assess the actual performance of the evaluated methods. This article proposes to use topic ontologies and semantic similarity data derived from these ontologies to implement an automatic semantic evaluation framework for information retrieval systems. The use of semantic similarity data allows to capture the notion of partial relevance, generalizing traditional evaluation metrics, and giving rise to novel performance measures such as semantic precision and semantic harmonic mean. The validity of the approach is supported by user studies and the application of the proposed framework is illustrated with the evaluation of topical retrieval systems. The evaluated systems include a baseline, a supervised version of the Bo1 query refinement method and two multi-objective evolutionary algorithms for context-based retrieval. Finally, we discuss the advantages of applying evaluation metrics that account for semantic similarity data and partial relevance over existing metrics based on the notion of total relevance.

[1]  Ana Gabriela Maguitman,et al.  Multiobjective evolutionary algorithms for context-based search , 2010, J. Assoc. Inf. Sci. Technol..

[2]  Roy Rada,et al.  Development and application of a metric on semantic nets , 1989, IEEE Trans. Syst. Man Cybern..

[3]  Dayne Freitag,et al.  A Machine Learning Architecture for Optimizing Web Search Engines , 1999 .

[4]  R. Akavipat,et al.  Emerging semantic communities in peer web search , 2006, P2PIR '06.

[5]  Dekang Lin,et al.  An Information-Theoretic Definition of Similarity , 1998, ICML.

[6]  Ana Gabriela Maguitman,et al.  Suggesting novel but related topics: towards context-based support for knowledge model extension , 2005, IUI '05.

[7]  Gobinda G. Chowdhury,et al.  TREC: Experiment and Evaluation in Information Retrieval , 2007 .

[8]  Filippo Menczer,et al.  Algorithmic Computation and Approximation of Semantic Similarity , 2006, World Wide Web.

[9]  Filippo Menczer,et al.  Correlated topologies in citation networks and the Web , 2004 .

[10]  David Carmel,et al.  Scaling IR-system evaluation using term relevance sets , 2004, SIGIR '04.

[11]  J. J. Rocchio,et al.  Relevance feedback in information retrieval , 1971 .

[12]  Ana Gabriela Maguitman,et al.  Using genetic algorithms to evolve a population of topical queries , 2008, Inf. Process. Manag..

[13]  Ana Gabriela Maguitman,et al.  Multiobjective evolutionary algorithms for context-based search , 2010 .

[14]  Philip Resnik,et al.  Using Information Content to Evaluate Semantic Similarity in a Taxonomy , 1995, IJCAI.

[15]  Longzhuang Li,et al.  A new method for automatic performance comparison of search engines , 2004, World Wide Web.

[16]  Ana Gabriela Maguitman,et al.  A semi-supervised incremental algorithm to automatically formulate topical queries , 2009, Inf. Sci..

[17]  Gerard Salton,et al.  The SMART Retrieval System—Experiments in Automatic Document Processing , 1971 .

[18]  Cyril W. Cleverdon,et al.  The significance of the Cranfield tests on index languages , 1991, SIGIR '91.

[19]  Abdur Chowdhury,et al.  Using titles and category names from editor-driven taxonomies for automatic evaluation , 2003, CIKM '03.

[20]  C. J. van Rijsbergen,et al.  Information Retrieval , 1979, Encyclopedia of GIS.

[21]  Ciro Cattuto,et al.  Evaluating similarity measures for emergent semantics of social tagging , 2009, WWW '09.

[22]  Kalyanmoy Deb,et al.  A fast and elitist multiobjective genetic algorithm: NSGA-II , 2002, IEEE Trans. Evol. Comput..

[23]  Ben He,et al.  Terrier : A High Performance and Scalable Information Retrieval Platform , 2022 .

[24]  Yiqun Liu,et al.  Automatic search engine performance evaluation with click-through data analysis , 2007, WWW '07.

[25]  Dan Klein,et al.  Evaluating strategies for similarity search on the web , 2002, WWW '02.

[26]  C. J. van Rijsbergen,et al.  Probabilistic models of information retrieval based on measuring the divergence from randomness , 2002, TOIS.

[27]  Filippo Menczer,et al.  GiveALink: mining a semantic network of bookmarks for web search and recommendation , 2005, LinkKDD '05.

[28]  Thomas M. Cover,et al.  Elements of Information Theory , 2005 .

[29]  A. Kandel Fuzzy Mathematical Techniques With Applications , 1986 .

[30]  Filippo Menczer,et al.  Algorithmic detection of semantic similarity , 2005, WWW '05.