Consistency of XML Specifications

Specifications of XML documents typically consist of typing information (for example, a DTD), and integrity constraints (for example, keys and foreign keys). We show that combining the two may lead to seemingly reasonable specifications that are nevertheless inconsistent: there is no XML document that both conforms to the DTD and satisfies the constraints. We then survey results on the complexity of consistency checking, and show that, depending on the classes of DTDs and constraints involved, it ranges from linear time to undecidable. Furthermore, we show that for some of the most common classes of specifications checking consistency is intractable.

[1]  Wenfei Fan,et al.  Integrity constraints for XML , 2003, J. Comput. Syst. Sci..

[2]  Yuri V. Matiyasevich,et al.  Hilbert's 10th Problem , 1993 .

[3]  Michael J. Carey,et al.  XPERANTO: Publishing Object-Relational Data as XML , 2000, WebDB.

[4]  Diego Calvanese,et al.  On the interaction between ISA and cardinality constraints , 1994, Proceedings of 1994 IEEE 10th International Conference on Data Engineering.

[5]  Kenneth Steiglitz,et al.  Combinatorial Optimization: Algorithms and Complexity , 1981 .

[6]  Alan R. Simon,et al.  Understanding the New SQL: A Complete Guide , 1993 .

[7]  Rajeev Rastogi,et al.  Capturing both types and constraints in data integration , 2003, SIGMOD '03.

[8]  Jonathan Robie,et al.  Document Object Model (DOM) Level 2 Specification , 1998 .

[9]  C. M. Sperberg-McQueen,et al.  Extensible Markup Language (XML) , 1997, World Wide Web J..

[10]  Hamid Pirahesh,et al.  Efficiently publishing relational data as XML documents , 2001, The VLDB Journal.

[11]  Dan Suciu,et al.  Towards a Scalable Infrastructure for Advanced E-Services. , 2001 .

[12]  Daniela Florescu,et al.  Storing and Querying XML Data using an RDMBS , 1999, IEEE Data Eng. Bull..

[13]  Wenfei Fan,et al.  Integrity constraints for XML , 2000, PODS.

[14]  S. Boag,et al.  XQuery 1.0 : An XML query language, W3C Working Draft 12 November 2003 , 2003 .

[15]  Moshe Y. Vardi,et al.  Polynomial-time implication problems for unary inclusion dependencies , 1990, JACM.

[16]  Tova Milo,et al.  Integrating and customizing heterogeneous e-commerce applications , 2001, The VLDB Journal.

[17]  Serge Abiteboul,et al.  Regular path queries with constraints , 1997, J. Comput. Syst. Sci..

[18]  Rajshekhar Sunderraman,et al.  XML - Data를 이용한 웹 질의처리 , 2000 .

[19]  Wenfei Fan,et al.  What's Hard about XML Schema Constraints? , 2002, DEXA.

[20]  Marcelo Arenas,et al.  A normal form for XML documents , 2002, PODS '02.

[21]  Wenfei Fan,et al.  Reasoning about keys for XML , 2003, Inf. Syst..

[22]  Scott Boag,et al.  XQuery 1.0 : An XML Query Language , 2007 .

[23]  Wenfei Fan,et al.  Keys for XML , 2002, Comput. Networks.

[24]  Val Tannen,et al.  Object/relational query optimization with chase and backchase , 2000 .

[25]  Wenfei Fan,et al.  On verifying consistency of XML specifications , 2002, PODS.

[26]  Chaitanya K. Baru,et al.  XML-based information mediation with MIX , 1999, SIGMOD '99.

[27]  Eric van der Vlist,et al.  XML Schema , 2002 .

[28]  Frank Neven,et al.  Extensions of Attribute Grammars for Structured Document Queries , 1999, DBPL.

[29]  Serge Abiteboul,et al.  Foundations of Databases , 1994 .

[30]  Malcolm P. Atkinson,et al.  Issues Raised by Three Years of Developing PJama: An Orthogonally Persistent Platform for Java , 1999, ICDT.

[31]  Gottfried Vossen,et al.  The World Wide Web and Databases , 2001, Lecture Notes in Computer Science.

[32]  Steven J. DeRose,et al.  XML Path Language (XPath) , 1999 .

[33]  Wenfei Fan,et al.  On XML integrity constraints in the presence of DTDs , 2001, JACM.

[34]  Dongwon Lee,et al.  Constraints-Preserving Transformation from XML Document Type Definition to Relational Schema , 2000, ER.

[35]  Patrick Valduriez,et al.  A Methodology for Query Reformulation in CIS Using Semantic Knowledge , 1996, Int. J. Cooperative Inf. Syst..

[36]  Jeffrey D. Ullman,et al.  Introduction to automata theory, languages, and computation, 2nd edition , 2001, SIGA.

[37]  Grzegorz Rozenberg,et al.  Handbook of Formal Languages , 1997, Springer Berlin Heidelberg.

[38]  Dan Suciu,et al.  Aggregation and Accumulation of XML Data. , 2001 .

[39]  Robert Givan,et al.  Tarskian Set Constraints , 2002, Inf. Comput..

[40]  David J. DeWitt,et al.  Relational Databases for Querying XML Documents: Limitations and Opportunities , 1999, VLDB.

[41]  Catriel Beeri,et al.  Schemas for Integration and Translation of Structured and Semi-structured Data , 1999, ICDT.

[42]  Veda C. Storey,et al.  Conceptual Modeling — ER 2000 , 2003, Lecture Notes in Computer Science.

[43]  Vishu Krishnamurthy,et al.  Performance Challenges in Object-Relational DBMSs , 1999, IEEE Data Eng. Bull..

[44]  Alberto O. Mendelzon,et al.  Research Issues in Structured and Semistructured Database Programming , 1999, Lecture Notes in Computer Science.

[45]  Jeffrey D. Ullman,et al.  Principles of Database and Knowledge-Base Systems, Volume II , 1988, Principles of computer science series.

[46]  Wenfei Fan,et al.  Keys with Upward Wildcards for XML , 2001, DEXA.

[47]  Paris C. Kanellakis On the Computational Complexity of Cardinality Constraints in Relational Databases , 1980, Inf. Process. Lett..

[48]  Jeffrey D. Ullman,et al.  Introduction to Automata Theory, Languages and Computation , 1979 .

[49]  Yuri Matiyasevich,et al.  Hilbert’s tenth problem , 2019, 100 Years of Math Milestones.

[50]  Wenfei Fan,et al.  Query Optimization for Semistructured Data Using Path Constraints in a Deterministic Data Model , 1999, DBPL.

[51]  Diego Calvanese,et al.  Making object-oriented schemas more expressive , 1994, PODS '94.

[52]  Diego Calvanese,et al.  Representing and Reasoning on XML Documents: A Description Logic Approach , 1999, J. Log. Comput..