XPath Query Relaxation through Rewriting Rules

Query relaxation is the process of weakening a query to a more general one, and it is frequently employed to support approximate query answering. In this paper, rewriting systems for a wide fragment of XPath are investigated, which accomplish query relaxation through the application of simple rewriting rules transforming navigational axes and node tests into relaxed ones. Specifically, a general yet simple form of rewriting rules is considered, which subsumes the forms adopted in several rewriting systems for approximate XPath query answering. The expressiveness of rewriting systems based on this form of rules is characterized in terms of their capability of transforming a query into every more general formulation. It is shown that traditional rewriting systems are not only incomplete w.r.t. containment, but also w.r.t. the stricter form known as containment by homomorphism. This limitation is overcome by defining a set R* of rewriting rules which are still of the same simple form of traditional ones, but are expressive enough to catch at least containment by homomorphism. Then, an algorithm is proposed which exploits R* to provide approximate answers of queries along with a measure of their approximation degree.

[1]  Laks V. S. Lakshmanan,et al.  FleXPath: flexible structure and full-text querying for XML , 2004, SIGMOD '04.

[2]  Hamid Pirahesh,et al.  A Framework for Using Materialized XPath Views in XML Query Processing , 2004, VLDB.

[3]  Alin Deutsch,et al.  Rewriting nested XML queries using nested views , 2006, SIGMOD Conference.

[4]  R. D. Rodrigues,et al.  Aliança: A proposal for a fuzzy database architecture incorporating XML , 2009, Fuzzy Sets Syst..

[5]  Reda Alhajj,et al.  Fuzzy querying of nested XML , 2008, 2008 IEEE International Conference on Information Reuse and Integration.

[6]  Henri Prade,et al.  Flexible querying of semistructured data: A fuzzy‐set‐based approach , 2007, Int. J. Intell. Syst..

[7]  Dan Suciu,et al.  Containment and equivalence for a fragment of XPath , 2004, JACM.

[8]  Z. Meral Özsoyoglu,et al.  Rewriting XPath Queries Using Materialized Views , 2005, VLDB.

[9]  Yehoshua Sagiv,et al.  Flexible queries over semistructured data , 2001, PODS '01.

[10]  Dan Suciu,et al.  Query Caching and View Selection for XML Databases , 2005, VLDB.

[11]  Sven Groppe,et al.  Satisfiability-Test, Rewriting and Refinement of Users' XPath Queries According to XML Schema Definitions , 2006, ADBIS.

[12]  Sergio Flesca,et al.  Top-k Answers to Fuzzy XPath Queries , 2009, DEXA.

[13]  Ernesto Damiani,et al.  FuzzyXPath: Using Fuzzy Logic an IR Features to Approximately Query XML Documents , 2007, IFSA.

[14]  Laks V. S. Lakshmanan,et al.  Answering tree pattern queries using views , 2006, VLDB.

[15]  Jeffrey Xu Yu,et al.  On Tree Pattern Query Rewriting Using Views , 2007, WISE.

[16]  Diego Calvanese,et al.  Answering regular path queries using views , 2000, Proceedings of 16th International Conference on Data Engineering (Cat. No.00CB37073).

[17]  Patrice Buche,et al.  Approximate Querying of XML Fuzzy Data , 2006, FQAS.

[18]  Ernesto Damiani,et al.  A fuzzy extension of the XPath query language , 2008, Journal of Intelligent Information Systems.

[19]  Cong Yu,et al.  Constraint-based XML query rewriting for data integration , 2004, SIGMOD '04.

[20]  Ioana Manolescu,et al.  Answering XML Queries on Heterogeneous Data Sources , 2001, VLDB.

[21]  Gerhard Weikum,et al.  Adding Relevance to XML , 2000, WebDB.

[22]  Sihem Amer-Yahia,et al.  Tree Pattern Relaxation , 2002, EDBT.

[23]  Ronaldo dos Santos Mello,et al.  Querying Heterogeneous XML Sources through a Conceptual Schema , 2003, ER.

[24]  Balder ten Cate,et al.  Axiomatizing the Logical Core of XPath 2.0 , 2007, Theory of Computing Systems.

[25]  Sergio Flesca,et al.  Retrieving XML data from heterogeneous sources through vague querying , 2009, TOIT.

[26]  Norbert Fuhr,et al.  XIRQL: An XML query language based on information retrieval concepts , 2004, TOIS.

[27]  Wenfei Fan,et al.  Rewriting Regular XPath Queries on XML Views , 2007, 2007 IEEE 23rd International Conference on Data Engineering.

[28]  Alin Deutsch,et al.  Reformulation of XML Queries and Constraints , 2003, ICDT.

[29]  Sihem Amer-Yahia,et al.  Structure and Content Scoring for XML , 2005, VLDB.

[30]  Alon Y. Halevy,et al.  Efficient query reformulation in peer data management systems , 2004, SIGMOD '04.

[31]  Mong-Li Lee,et al.  Efficient Mining of XML Query Patterns for Caching , 2003, VLDB.

[32]  Maarten Marx,et al.  Axiomatizing the Logical Core of XPath 2.0 , 2008, Theory of Computing Systems.

[33]  Adnan Yazici,et al.  Fuzzy Data Representation and Querying in XML Database , 2007, Int. J. Uncertain. Fuzziness Knowl. Based Syst..

[34]  Gabriel M. Kuper,et al.  Structural properties of XPath fragments , 2003, Theor. Comput. Sci..

[35]  Torsten Schlieder Schema-Driven Evaluation of Approximate Tree-Pattern Queries , 2002, EDBT.