A system to interactively access extensible markup language documents aiming at information retrieval (IR) is described. The system has two main modules: the query editor/processor, where the user specifies his/her needs and the document analyser, which performs operations for query evaluation. The interactive construction of queries is based on the manipulation of intermediate results during query edition and evaluation. Queries are written in IXDIRQL, a query language that extends XPath with selection operations to extract the interesting subset of elements from intermediate results. This helps the user in building queries to retrieve the desired results. Moreover, textual similarity search of traditional IR is also possible in IXDIRQL, leading to a ranked list of elements. To support a syntax-directed edition of queries and its incremental evaluation during the iterative process, IXDIRQL is specified by an attribute grammar (AG). This formalisation enables the use of an automatic generator of the desired working environment. In this system, documents are also represented by AG. This representation uniformly defines structure, content and operations over documents; this allows for a better interoperability between components. The system has been used by real users to check its correct behaviour and the correct specification of queries, using selection operations.
[1]
Scott Boag,et al.
XQuery 1.0 : An XML Query Language
,
2007
.
[2]
Alda Lopes Gançarski,et al.
Interactive information retrieval from XML documents represented by attribute grammars
,
2003,
DocEng '03.
[3]
Frank Neven,et al.
Expressiveness of structured document query languages based on attribute grammars
,
2002,
J. ACM.
[4]
Serge Abiteboul,et al.
A logical view of structured files
,
1998,
The VLDB Journal.
[5]
Gerard Salton,et al.
A vector space model for automatic indexing
,
1975,
CACM.
[6]
Frank Neven,et al.
On Implementing Structured Document Query Facilities on Top of a DOOD
,
1997,
DOOD.
[7]
C. M. Sperberg-McQueen,et al.
Extensible Markup Language (XML)
,
1997,
World Wide Web J..
[8]
Mounia Lalmas,et al.
Advances in XML Information Retrieval: Third International Workshop of the Initiative for the Evaluation of XML Retrieval, INEX 2004, Dagstuhl Castle, ... 2004 (Lecture Notes in Computer Science)
,
2005
.
[9]
João Saraiva,et al.
Lrc - A Generator for Incremental Language-Oriented Tools
,
1998,
International Conference on Compiler Construction.