论文信息 - Extended structural relevance framework: a framework for evaluating structured document retrieval

Extended structural relevance framework: a framework for evaluating structured document retrieval

A structured document retrieval (SDR) system aims to minimize the effort users spend to locate relevant information by retrieving parts of documents. To evaluate the range of SDR tasks, from element to passage to tree retrieval, numerous task-specific measures have been proposed. This has resulted in SDR evaluation measures that cannot easily be compared with respect to each other and across tasks. In previous work, we defined the SDR task of tree retrieval where passage and element are special cases. In this paper, we look in greater detail into tree retrieval to identify the main components of SDR evaluation: relevance, navigation, and redundancy. Our goal is to evaluate SDR within a single probabilistic framework based on these components. This framework, called Extended Structural Relevance (ESR), calculates user expected gain in relevant information depending on whether it is seen via hits (relevant results retrieved), unseen via misses (relevant results not retrieved), or possibly seen via near-misses (relevant results accessed via navigation). We use these expectations as parameters to formulate evaluation measures for tree retrieval. We then demonstrate how existing task-specific measures, if viewed as tree retrieval, can be formulated, computed and compared using our framework. Finally, we experimentally validate ESR across a range of SDR tasks.

Mounia Lalmas | Mariano P. Consens | Mir Sadek Ali

[1] Kalervo Järvelin,et al. Evaluating the effectiveness of relevance feedback based on a user simulation model: effects of a user scenario on cumulated gain value , 2008, Information Retrieval.

[2] Andrew Trotman,et al. Report on the SIGIR 2006 workshop on XML element retrieval methodology , 2006, SIGF.

[3] Mark D. Dunlop. Time, relevance and interaction modelling for information retrieval , 1997, SIGIR '97.

[4] Gabriella Kazai,et al. Tolerance to irrelevance: a user-effort oriented evaluation of retrieval systems without predefined retrieval unit , 2004 .

[5] Gabriella Kazai,et al. Choosing an Ideal Recall-Base for the Evaluation of the Focused Task: Sensitivity Analysis of the XCG Evaluation Measures , 2006, INEX.

[6] EFFORT-PRECISION AND GAIN-RECALL BASED ON A PROBABILISTIC NAVIGATION MODEL Integrating Post-Query Navigation within a Measure of Retrieval Effectiveness , 2007 .

[7] Stephen E. Robertson,et al. THE PARAMETRIC DESCRIPTION OF RETRIEVAL TESTS: PART I: THE BASIC PARAMETERS , 1969 .

[8] Sihem Amer-Yahia,et al. XQuery Full-Text extensions explained , 2006, IBM Syst. J..

[9] Mariano P. Consens,et al. AxPRE Summaries: Exploring the (Semi-)Structure of XML Web Collections , 2008, 2008 IEEE 24th International Conference on Data Engineering.

[10] Steve Fox,et al. Evaluating implicit measures to improve web search , 2005, TOIS.

[11] Ramanathan V. Guha,et al. Semantic search , 2003, WWW '03.