Efficient Incremental Validation of XML Documents After Composite Updates

We describe an efficient method for the incremental validation of XML documents after composite updates. We introduce the class of Bounded-Edit (BE) DTDs and XML Schemas, and give a simple incremental revalidation algorithm that yields optimal performance for them, in the sense that its time complexity is linear in the number of operations in the update. We give extensive experimental results showing that our algorithm exhibits excellent scalability. Finally, we provide a statistical analysis of over 250 DTDs and XML Schema specifications found on the Web, showing that over 99% of them are in fact in BE.

[1]  Yannis Papakonstantinou,et al.  DTD inference for views of XML data , 2000, PODS.

[2]  Grzegorz Rozenberg,et al.  Handbook of Formal Languages , 1997, Springer Berlin Heidelberg.

[3]  Béatrice Bouchou-Markhoff,et al.  Updates and Incremental Validation of XML Documents , 2003, DBPL.

[4]  Wenfei Fan,et al.  Query Optimization for Semistructured Data Using Path Constraints in a Deterministic Data Model , 1999, DBPL.

[5]  Frank Neven,et al.  DTDs versus XML schema: a practical study , 2004, WebDB '04.

[6]  Neil Immerman,et al.  Dyn-FO: A Parallel, Dynamic Complexity Class , 1997, J. Comput. Syst. Sci..

[7]  Limsoon Wong,et al.  On the Power of Incremental Evaluation in SQL-Like Languages , 1999, DBPL.

[8]  C. M. Sperberg-McQueen,et al.  Extensible Markup Language (XML) , 1997, World Wide Web J..

[9]  J W Ballard,et al.  Data on the web? , 1995, Science.

[10]  Luc Segoufin,et al.  Typing and querying XML documents: some complexity bounds , 2003, PODS.

[11]  Derick Wood,et al.  One-Unambiguous Regular Languages , 1998, Inf. Comput..

[12]  Gabriel M. Kuper,et al.  Structural Properties of XPath Fragments , 2003, ICDT.

[13]  Elke A. Rundensteiner,et al.  Consistently updating XML documents using incremental constraint check queries , 2002, WIDM '02.

[14]  Sebastian Maneth,et al.  Efficient Memory Representation of XML Documents , 2005, DBPL.

[15]  Yannis Papakonstantinou,et al.  Incremental validation of XML documents , 2003, TODS.

[16]  Ioana Manolescu,et al.  The XML benchmark project , 2001 .

[17]  Rudolf Schmid,et al.  Organization for the advancement of structured information standards , 2002 .

[18]  Denilson Barbosa,et al.  Efficient incremental validation of XML documents , 2004, Proceedings. 20th International Conference on Data Engineering.