Our objective in the INEX 2012 campaign was to integrate the semantic tags and the linked data in our proximity retrieval model. This model was sucessfully used in previous INEX campaigns and obtained good results, particularly in 2007 with the second place in the Ad Hoc Track Focused Task [1], and in 2010 with the rst place in the Ad Hoc Track Relevant in Context Task [2] Though we had several discom tures with the collection because i) there were several versions of the collection, the last one available at the end of June, one week before the initial run submission deadline, ii) the di erent versions were di cult to follow because they were not clearly identi ed, iii) not every documents were well formed according to the XML format, iv) the provided DTD gives little information on the actual structure and its semantics, v) the documents contains many semantic annotations but the underlying ideas used to generate them are not documented making them di cult to apprehend. We present in section 2 how we processed the documents to alleviate the problems with the DTD. Thus we only have been able to do some basic experiments presented in section 3. In section 4 we present our work in progress.
[1]
Michel Beigbeder,et al.
ENSM-SE at INEX 2009 : Scoring with Proximity and Semantic Tag Information
,
2009,
INEX.
[2]
Andrew Trotman,et al.
Overview of the INEX 2007 Ad Hoc Track
,
2008,
INEX.
[3]
Helmut Schmidt,et al.
Probabilistic part-of-speech tagging using decision trees
,
1994
.
[4]
Anders Møller,et al.
Static Validation of XSL Transformations
,
2005
.
[5]
Michel Beigbeder.
Focused retrieval with proximity scoring
,
2010,
SAC '10.
[6]
Andrew Trotman,et al.
Overview of the INEX 2010 Ad Hoc Track
,
2010,
INEX.