Representing and Querying Multi-dimensional Markup for Question Answering

This paper describes our approach to representing and querying multi-dimensional, possibly overlapping text annotations, as used in our question answering (QA) system. We use a system extending XQuery, the W3C-standard XML query language, with new axes that allow one to jump easily between different annotations of the same data. The new axes are formulated in terms of (partial) overlap and containment. All annotations are made using stand-off XML in a single document, which can be efficiently queried using the XQuery extension. The system is scalable to gigabytes of XML annotations. We show examples of the system in QA scenarios.

[1]  Torsten Grust,et al.  Pathfinder: XQuery - The Relational Way , 2005, VLDB.

[2]  W. Alink XIRAF: an XML Information Retrieval Approach to Digital Forensics , 2005 .

[3]  Alex Dekhtyar,et al.  A Framework for Management of Concurrent XML Markup , 2003, ER.

[4]  Wendell Piez Half-steps toward LMNL , 2004, Extreme Markup Languages®.

[5]  Forbes J. Burkowski Retrieval activities in a database consisting of heterogeneous collections of structured text , 1992, SIGIR '92.

[6]  C. M. Sperberg-McQueen,et al.  GODDAG: A Data Structure for Overlapping Hierarchies , 2000, DDEP/PODDP.

[7]  Peter Boncz,et al.  Pathfinder: relational XQuery over multi-gigabyte XML inputs in interactive time , 2005 .

[8]  Kenneth C. Litkowski,et al.  Use of Metadata for Question Answering and Novelty Tasks , 2003, TREC.

[9]  Steven J. DeRose,et al.  Markup Overlap: A Review and a Horse , 2004, Extreme Markup Languages®.

[10]  Gilad Mishne,et al.  The University of Amsterdam at QA@CLEF 2004 , 2003, CLEF.

[11]  Valentin Jijkoun,et al.  Towards an Offline XML-Based Strategy for Answering Questions , 2005, CLEF.

[12]  Peter Boncz,et al.  UvA-DARE ( Digital Academic Repository ) Monet ; a next-Generation DBMS Kernel For Query-Intensive Applications , 2007 .

[13]  Kenneth C. Litkowski,et al.  Question Answering Using XML-Tagged Documents , 2002, TREC.

[14]  Torsten. Grust,et al.  Accelerating XPath location steps , 2002, SIGMOD '02.

[15]  Alex Dekhtyar,et al.  XPath Extension for Querying Concurrent XML Markup , 2004 .