Boolean Constraints for XML Modeling

The study of integrity constraints has been identified as one of the major challenges in XML database research. The main difficulty is finding a balance between the expressiveness and the existence of automated reasoning tools for different classes of constraints. In this paper we define Boolean constraints for XML by exploring homomorphisms between XML data trees and XML schema graphs. These constraints are naturally exhibited by XML data due to its nested structure. We demonstrate, in contrast to many other proposals, that reasoning about Boolean constraints is well-founded. That is, we establish that the interaction between Boolean constraints corresponds precisely to the logical implication of Boolean propositional formulae. Therefore, our Boolean constraints do not only capture valuable semantic information about XML data but also permit reasoning support by off-the-shelf SAT solvers. Finally, we identify a few subclasses of Boolean constraints for which the implication problem can be solved efficiently.

[1]  Stefano Paraboschi,et al.  XML: Current Developments and Future Challenges for the Database Community , 2000, EDBT.

[2]  Marcelo Arenas,et al.  A normal form for XML documents , 2004, TODS.

[3]  Tok Wang Ling,et al.  Designing Functional Dependencies for XML , 2002, EDBT.

[4]  Junhu Wang A Comparative Study of Functional Dependencies for XML , 2005, APWeb.

[5]  Jennifer Widom Data Management for XML: Research Directions , 1999, IEEE Data Eng. Bull..

[6]  Dan Suciu,et al.  On database theory and XML , 2001, SGMD.

[7]  Chengfei Liu,et al.  Strong functional dependencies and their application to normal forms in XML , 2004, TODS.

[8]  Ronald Fagin,et al.  Functional Dependencies in a Relational Data Base and Propositional Logic , 1977, IBM J. Res. Dev..

[9]  Stephen A. Cook,et al.  The complexity of theorem-proving procedures , 1971, STOC.

[10]  Jixue Liu,et al.  Completeness and Decidability Properties for Functional Dependencies in XML , 2003, ArXiv.

[11]  Ronald Fagin,et al.  An Equivalence Between Relational Database Dependencies and a Fragment of Propositional Logic , 1981, JACM.

[12]  Zahir Tari,et al.  Object normal forms and dependency constraints for object-oriented schemata , 1997, TODS.

[13]  Junhu Wang,et al.  Removing XML Data Redundancies Using Functional and Equality-Generating Dependencies , 2005, ADC.

[14]  Herbert B. Enderton,et al.  A mathematical introduction to logic , 1972 .

[15]  Robert E. Tarjan,et al.  A Linear-Time Algorithm for Testing the Truth of Certain Quantified Boolean Formulas , 1979, Inf. Process. Lett..

[16]  I. Anderson Combinatorics of Finite Sets , 1987 .

[17]  Grant E. Weddell,et al.  Reasoning about functional dependencies generalized for semantic data models , 1992, TODS.

[18]  Sven Hartmann,et al.  Efficient Reasoning About XFDs with Pre-image Semantics , 2007, DASFAA.

[19]  Zvi Galil,et al.  An Almost Linear-Time Algorithm for Computing a Dependency Basis in a Relational Database , 1982, JACM.

[20]  Ping Yan,et al.  Functional Dependencies in XML Documents , 2006, APWeb Workshops.

[21]  Solmaz Kolahi,et al.  On redundancy vs dependency preservation in normalization: an information-theoretic study of 3NF , 2006, PODS '06.

[22]  Bernhard Thalheim,et al.  Dependencies in relational databases , 1991, Teubner-Texte zur Mathematik.

[23]  Sven Hartmann,et al.  More Functional Dependencies for XML , 2003, ADBIS.

[24]  Alon Itai,et al.  On the complexity of time table and multi-commodity flow problems , 1975, 16th Annual Symposium on Foundations of Computer Science (sfcs 1975).

[25]  Jean H. Gallier,et al.  Linear-Time Algorithms for Testing the Satisfiability of Propositional Horn Formulae , 1984, J. Log. Program..

[26]  Dieter Jungnickel,et al.  Graphs, Networks, and Algorithms , 1980 .

[27]  C. M. Sperberg-McQueen,et al.  Extensible markup language , 1997 .