Validation of Expressive XML Keys with XML Schema and XQuery

The eXtensible Markup Language (XML) is the defacto industry standard for exchanging data on the Web and elsewhere. While the relational model of data enjoys a well-accepted denition of a key, several competing notions of keys exist in XML. These have complementary properties and therefore serve dierent applications domains. In a nutshell, XML keys allow us to capture important domain semantics in XML documents and thereby advance data processing in most applications. In this paper we propose how to validate XML documents against an expressive class of XML keys using XML Schema and XQuery, respectively. It is somewhat surprising how simple it is to express sophisticated notions of XML keys in these o-the-shelf tools. Experiments show that our simple validation technique works well for real-world data of reasonable size. For large-scale data, however, dedicated tools must be developed.

[1]  Sven Hartmann,et al.  Numerical constraints on XML data , 2010, Inf. Comput..

[2]  Wenfei Fan,et al.  Integrity constraints for XML , 2000, PODS '00.

[3]  Wenfei Fan,et al.  On the Complexity of Verifying Consistency of XML Specifications , 2008, SIAM J. Comput..

[4]  Sven Hartmann,et al.  Efficient reasoning about a robust XML key fragment , 2009, TODS.

[5]  Sven Hartmann,et al.  More Functional Dependencies for XML , 2003, ADBIS.

[6]  Alessandro Artale,et al.  Entity-from-Relationship Modelling , 2013, AI*IA.

[7]  Dan Suciu,et al.  On database theory and XML , 2001, SGMD.

[8]  Wenfei Fan,et al.  Keys for XML , 2001, WWW '01.

[9]  Wenfei Fan,et al.  On XML integrity constraints in the presence of DTDs , 2001, PODS '01.

[10]  Bernhard Thalheim,et al.  Dependencies in relational databases , 1991, Teubner-Texte zur Mathematik.

[11]  Wenfei Fan,et al.  Constraints for semistructured data and XML , 2001, SGMD.

[12]  Wenfei Fan,et al.  What's Hard about XML Schema Constraints? , 2002, DEXA.

[13]  Z. M. Sikora Entity Relationship Modelling , 1997 .

[14]  Susan B. Davidson,et al.  XKvalidator: a constraint validator for XML , 2002, CIKM '02.

[15]  Alin Deutsch,et al.  XML queries and constraints, containment and reformulation , 2005, Theor. Comput. Sci..

[16]  Jing Wang,et al.  On the Notion of an XML Key , 2008, SDKB.

[17]  Peter T. Wood,et al.  Containment for XPath Fragments under DTD Constraints , 2003, ICDT.

[18]  Wenfei Fan,et al.  Reasoning about Keys for XML , 2001, DBPL.

[19]  Shiwei Tang,et al.  Validating key constraints over XML document using XPath and structure checking , 2005, Future Gener. Comput. Syst..

[20]  David Beech,et al.  XML-Schema Part 1: Structures Second Edition , 2004 .

[21]  Marcelo Arenas,et al.  A normal form for XML documents , 2004, TODS.

[22]  Dan Suciu,et al.  Data on the Web: From Relations to Semistructured Data and XML , 1999 .

[23]  Scott Boag,et al.  XQuery 1.0 : An XML Query Language , 2007 .

[24]  Béatrice Bouchou-Markhoff,et al.  Tree Automata to Verify XML Key Constraints , 2003, WebDB.

[25]  C. M. Sperberg-McQueen,et al.  W3C XML Schema Definition Language (XSD) 1.1 Part 1: Structures , 2012 .

[26]  Sven Hartmann,et al.  Unlocking Keys for XML Trees , 2007, ICDT.

[27]  Mukesh K. Mohania,et al.  On the equivalence between FDs in XML and FDs in relations , 2007, Acta Informatica.

[28]  Giuseppe Di Battista,et al.  26 Computer Networks , 2004 .

[29]  Dan Suciu,et al.  Containment and equivalence for a fragment of XPath , 2004, JACM.

[30]  Béatrice Bouchou-Markhoff,et al.  Incremental Constraint Checking for XML Documents , 2004, XSym.

[31]  Serge Abiteboul,et al.  Foundations of Databases , 1994 .

[32]  Michael Schrefl,et al.  Keys in XML: Capturing Identification and Uniqueness , 2009, WISE.

[33]  Chengfei Liu,et al.  Strong functional dependencies and their application to normal forms in XML , 2004, TODS.

[34]  Cong Yu,et al.  XML schema refinement through redundancy detection and normalization , 2008, The VLDB Journal.

[35]  Steven J. DeRose,et al.  XML Path Language (XPath) Version 1.0 , 1999 .

[36]  Thomas Schwentick,et al.  On the complexity of XPath containment in the presence of disjunction, DTDs, and variables , 2006, Log. Methods Comput. Sci..

[37]  Gabriel M. Kuper,et al.  Structural properties of XPath fragments , 2003, Theor. Comput. Sci..

[38]  Shiwei Tang,et al.  Extracting Key Value and Checking Structural Constraints for Validating XML Key Constraints , 2004, WAIM.

[39]  Wenfei Fan,et al.  Path Constraints in Semistructured Databases , 2000, J. Comput. Syst. Sci..