DCU and ISI@INEX 2010: Adhoc and Data-Centric Tracks

We describe the participation of Dublin City University (DCU) and Indian Statistical Institute (ISI) in INEX 2010 for the Ad-hoc and Data Centric tracks. The main contributions of this paper are: i) a simplified version of Hierarchical Language Model (HLM), which involves scoring XML elements with a combined probability of generating the given query from itself and the top level articl node, is shown to outperform the baselines of LM and VSM scoring of XML elements; ii) the Expectation Maximization (EM) feedback in LM is shown to be the most effective on the domain specific collection of IMDB; iii) automated removal of sentences indicating aspects of irrelevance from the narratives of INEX ad hoc topics is shown to improve retrieval effectiveness.

[1]  Djoerd Hiemstra,et al.  Using language models for information retrieval , 2001 .

[2]  Andrew Trotman,et al.  Advances in Focused Retrieval, 7th International Workshop of the Initiative for the Evaluation of XML Retrieval, INEX 2008, Dagstuhl Castle, Germany, December 15-18, 2008. Revised and Selected Papers , 2009, INEX.

[3]  Andrew Trotman,et al.  Focused Retrieval and Evaluation, 8th International Workshop of the Initiative for the Evaluation of XML Retrieval, INEX 2009, Brisbane, Australia, December 7-9, 2009, Revised and Selected Papers , 2010, INEX.

[4]  Andrew Trotman,et al.  Overview of the INEX 2008 Ad Hoc Track , 2008, INEX.

[5]  Amitabh Kumar Singhal,et al.  Term Weighting Revisited , 1996 .

[6]  M. de Rijke,et al.  An Element-based Approach to XML Retrieval , 2004 .

[7]  Sukomal Pal,et al.  Using Negative Information in Search , 2011, 2011 Second International Conference on Emerging Applications of Information Technology.

[8]  Sukomal Pal,et al.  Parameter Tuning in Pivoted Normalization for XML Retrieval: ISI@INEX09 Adhoc Focused Task , 2009, INEX.

[9]  Julie Beth Lovins,et al.  Development of a stemming algorithm , 1968, Mech. Transl. Comput. Linguistics.

[10]  Mounia Lalmas,et al.  Advances in XML Information Retrieval, Third International Workshop of the Initiative for the Evaluation of XML Retrieval, INEX 2004, Dagstuhl Castle, Germany, December 6-8, 2004, Revised Selected Papers , 2005, INEX.

[11]  James P. Callan,et al.  Hierarchical Language Models for XML Component Retrieval , 2004, INEX.