Efficient reasoning about a robust XML key fragment

We review key constraints in the context of XML as introduced by Buneman et al. We demonstrate that: (1) one of the proposed inference rules is not sound in general, and (2) the inference rules are incomplete for XML key implication, even for nonempty sets of simple key paths. This shows, in contrast to earlier statements, that the axiomatizability of XML keys is still open, and efficient algorithms for deciding their implication still need to be developed. Solutions to these problems have a wide range of applications including consistency validation, XML schema design, data exchange and integration, consistent query answering, XML query optimization and rewriting, and indexing. In this article, we investigate the axiomatizability and implication problem for XML keys with nonempty sets of simple key paths. In particular, we propose a set of inference rules that is indeed sound and complete for the implication of such XML keys. We demonstrate that this fragment is robust by showing the duality of XML key implication to the reachability problem of fixed nodes in a suitable digraph. This enables us to develop a quadratic-time algorithm for deciding implication, and shows that reasoning about this XML key fragment is practically efficient. Therefore, XML applications can be unlocked effectively since they benefit not only from those XML keys specified explicitly by the data designer but also from those that are specified implicitly.

[1]  Ronald Fagin,et al.  The Theory of Data Dependencies - An Overview , 1984, ICALP.

[2]  Jan Chomicki,et al.  Consistent Query Answering: Five Easy Pieces , 2007, ICDT.

[3]  Maurizio Lenzerini,et al.  On The Satisfiability of Dependency Constraints in Entity-Relationship Schemata , 1987, VLDB.

[4]  Serge Abiteboul,et al.  Foundations of Databases , 1994 .

[5]  C. M. Sperberg-McQueen,et al.  Extensible Markup Language (XML) , 1997, World Wide Web J..

[6]  Vidur Apparao,et al.  Document Object Model (DOM) Level 2 Traversal and Range Specification , 2000 .

[7]  Bernhard Thalheim,et al.  Dependencies in relational databases , 1991, Teubner-Texte zur Mathematik.

[8]  Wenfei Fan,et al.  Erratum to "Propagating XML constraints to relations" [JCSS 73 (2007) 316-361] , 2008, J. Comput. Syst. Sci..

[9]  Jing Wang,et al.  On the Notion of an XML Key , 2008, SDKB.

[10]  Wenfei Fan,et al.  Keys for XML , 2002, Comput. Networks.

[11]  Z. M. Sikora Entity Relationship Modelling , 1997 .

[12]  Sven Hartmann,et al.  Numerical Constraints for XML , 2007, WoLLIC.

[13]  Peter T. Wood,et al.  Containment for XPath Fragments under DTD Constraints , 2003, ICDT.

[14]  Gabriel M. Kuper,et al.  Structural properties of XPath fragments , 2003, Theor. Comput. Sci..

[15]  Sven Hartmann,et al.  On the implication problem for cardinality constraints and functional dependencies , 2001, Annals of Mathematics and Artificial Intelligence.

[16]  Steven J. DeRose,et al.  XML Path Language (XPath) Version 1.0 , 1999 .

[17]  Christian S. Jensen,et al.  Extending Existing Dependency Theory to Temporal Databases , 1996, IEEE Trans. Knowl. Data Eng..

[18]  Joachim Biskup,et al.  Decomposition of object-oriented database schemas , 2004, Annals of Mathematics and Artificial Intelligence.

[19]  Victor Vianu,et al.  A Web Odyssey: from Codd to XML , 2001, PODS.

[20]  HartmannSven,et al.  Efficient reasoning about a robust XML key fragment , 2009 .

[21]  Sven Hartmann,et al.  Characterising nested database dependencies by fragments of propositional logic , 2008, Ann. Pure Appl. Log..

[22]  Alin Deutsch,et al.  XML queries and constraints, containment and reformulation , 2005, Theor. Comput. Sci..

[23]  Thomas Schwentick,et al.  On the complexity of XPath containment in the presence of disjunction, DTDs, and variables , 2006, Log. Methods Comput. Sci..

[24]  Minoru Ito,et al.  Implication Problems for Functional Constraints on Databases Supporting Complex Objects , 1994, J. Comput. Syst. Sci..

[25]  Marc Gyssens,et al.  The Structure of the Relational Database Model , 1989, EATCS Monographs on Theoretical Computer Science.

[26]  Wenfei Fan,et al.  Erratum Erratum to “ Propagating XML constraints to relations , 2008 .

[27]  Dieter Jungnickel,et al.  Graphs, Networks, and Algorithms , 1980 .

[28]  Wenfei Fan,et al.  Integrity constraints for XML , 2000, PODS.

[29]  Jennifer Widom Data Management for XML: Research Directions , 1999, IEEE Data Eng. Bull..

[30]  Zahir Tari,et al.  Object normal forms and dependency constraints for object-oriented schemata , 1997, TODS.

[31]  Professor Dr. Bernhard Thalheim Entity-Relationship Modeling , 2000, Springer Berlin Heidelberg.

[32]  Sven Hartmann,et al.  Unlocking Keys for XML Trees , 2007, ICDT.

[33]  Mukesh K. Mohania,et al.  On the equivalence between FDs in XML and FDs in relations , 2007, Acta Informatica.

[34]  Wenfei Fan,et al.  Integrity constraints for XML , 2003, J. Comput. Syst. Sci..

[35]  Wenfei Fan,et al.  Path Constraints in Semistructured Databases , 2000, J. Comput. Syst. Sci..

[36]  Gerhard Weikum,et al.  ACM Transactions on Database Systems , 2005 .

[37]  Marcelo Arenas,et al.  An information-theoretic approach to normal forms for relational and XML data , 2003, PODS.

[38]  Dan Suciu,et al.  Containment and equivalence for a fragment of XPath , 2004, JACM.

[39]  Carmem S. Hara,et al.  Reasoning about nested functional dependencies , 1999, PODS '99.

[40]  Chengfei Liu,et al.  Strong functional dependencies and their application to normal forms in XML , 2004, TODS.

[41]  Wenfei Fan,et al.  On XML integrity constraints in the presence of DTDs , 2001, JACM.

[42]  Wenfei Fan,et al.  Constraints for semistructured data and XML , 2001, SGMD.

[43]  Wenfei Fan,et al.  Reasoning about Keys for XML , 2001, DBPL.

[44]  Marcelo Arenas,et al.  A normal form for XML documents , 2004, TODS.

[45]  Jan Chomicki,et al.  On the Feasibility of Checking Temporal Integrity Constraints , 1995, J. Comput. Syst. Sci..

[46]  Wenfei Fan,et al.  Propagating XML constraints to relations , 2007, J. Comput. Syst. Sci..

[47]  Dan Suciu,et al.  On database theory and XML , 2001, SGMD.

[48]  Grant E. Weddell,et al.  Reasoning about functional dependencies generalized for semantic data models , 1992, TODS.

[49]  David W. Embley,et al.  Cardinality Constraints in Semantic Data Models , 1993, Data Knowl. Eng..