Updates, Schema Updates and Validation of XML Documents - Using Abstract State Machines with Automata-Defined States

The exact validation of streaming XML documents can be realised by using visibly push-down automata (VPA) that are defined by Extended Document Type Def- initions (EDTD). It is straightforward to represent such an automaton as an Abstract State Machine (ASM). In doing so we enable computations on abstract states that are defined by a certain class of automata, in this case VPAs. In this paper we elaborate on this approach by taking also updates of XML documents into account. In this way the ASM-approach combines vertical refinements, which first make states explicit and then instantiate by a specific EDTD, with horizontal refinements, which replace streaming XML documents by stored ones and then add updates. Furthermore, as the EDTD appears as part of the abstract state, updating it is another natural extension by hor- izontal refinement. In this way we obtain consistently integrated updates and schema updates for XML documents, which can even be extended to become fault-tolerant by taking at most k errors in the document into consideration. It further provides an example of ASM-based computation with automata-defined states.

[1]  Andreas Blass,et al.  Background of Computation , 2007, Bull. EATCS.

[2]  Bernhard Thalheim,et al.  Abstract State Services , 2008, ER Workshops.

[3]  Klaus-Dieter Schewe,et al.  Using Abstract State Machines for the Design of Multi-level Transaction Schedulers , 2009, Rigorous Methods for Software Construction and Analysis.

[4]  Egon Börger,et al.  Abstract State Machines , 2003 .

[5]  Gail E. Kaiser,et al.  DOM-based content extraction of HTML documents , 2003, WWW '03.

[6]  Klaus-Dieter Schewe,et al.  A Customised ASM Thesis for Database Transformations , 2010, Acta Cybern..

[7]  Yuri Gurevich,et al.  Sequential abstract-state machines capture sequential algorithms , 2000, TOCL.

[8]  Joachim Hammer,et al.  Updatex---an xquery-based language for processing updates in xml , 2004 .

[9]  Elliotte Rusty Harold Processing XML with Java: A Guide to Sax, Dom, Jdom, Jaxp, and Trax , 2002 .

[10]  Bernhard Thalheim,et al.  Validation of streaming XML documents with abstract state machines , 2008, iiWAS.

[11]  Qing Wang,et al.  XML Machines , 2009, ER Workshops.

[12]  Jan Gilbert Van Den Bussche Formal aspects of object identity in database manipulation , 1993 .

[13]  Andreas Blass,et al.  Abstract state machines capture parallel algorithms , 2003, TOCL.

[14]  Nikolai Tillmann,et al.  Partial updates , 2005, Theor. Comput. Sci..

[15]  Victor Vianu,et al.  Validating streaming XML documents , 2002, PODS.

[16]  Bernhard Thalheim,et al.  XML Database Transformations with Tree Updates , 2008, ABZ.

[17]  Egon Börger,et al.  The ASM Refinement Method , 2003, Formal Aspects of Computing.

[18]  Egon Börger,et al.  Abstract State Machines , 2003 .

[19]  Leonidas Fegaras The Joy of SAX , 2004, XIME-P.

[20]  Christopher Ré,et al.  XQuery!: An XML Query Language with Side Effects , 2006, EDBT Workshops.

[21]  Jan Van den Bussche,et al.  On the completeness of object-creating database transformation languages , 1997, JACM.

[22]  Hubert Comon,et al.  Tree automata techniques and applications , 1997 .

[23]  Klaus-Dieter Schewe,et al.  Dynamic Data Warehouse Design with Abstract State Machines , 2009, J. Univers. Comput. Sci..

[24]  Mahesh Viswanathan,et al.  Visibly pushdown automata for streaming XML , 2007, WWW '07.

[25]  Don Box,et al.  Essential XML: Beyond MarkUp , 2000 .

[26]  Gerhard Schellhorn,et al.  ASM Refinement Preserving Invariants , 2008, J. Univers. Comput. Sci..

[27]  Jörg Flum,et al.  Finite model theory , 1995, Perspectives in Mathematical Logic.

[28]  Klaus-Dieter Schewe,et al.  Towards a Logic for Abstract MetaFinite State Machines , 2008, FoIKS.

[29]  Charles Wallace,et al.  Formalizing Database Recovery , 1997, J. Univers. Comput. Sci..

[30]  Yannis Papakonstantinou,et al.  DTD inference for views of XML data , 2000, PODS.

[31]  J. Büchi Weak Second‐Order Arithmetic and Finite Automata , 1960 .

[32]  Andreas Blass,et al.  Background, Reserve, and Gandy Machines , 2000, CSL.

[33]  Yuri Gurevich,et al.  Metafinite Model Theory , 1994, Inf. Comput..

[34]  Jan Van den Bussche,et al.  Non-Deterministic Aspects of Object-Creating Database Transformations , 1992, FMLDO.

[35]  Egon Börger,et al.  Java and the Java Virtual Machine: Definition, Verification, Validation , 2001 .

[36]  Serge Abiteboul,et al.  Datalog Extensions for Database Queries and Updates , 1991, J. Comput. Syst. Sci..

[37]  Egon Börger,et al.  Modularizing Theorems for Software Product Lines: The Jbook Case Study , 2008, J. Univers. Comput. Sci..

[38]  Jan Van den Bussche,et al.  Abstract State Machines and Computationally Complete Query Languages , 2002, Inf. Comput..

[39]  Srinivasan Venkatesh,et al.  Visibly Pushdown Transducers for Approximate Validation of Streaming XML , 2008, FoIKS.

[40]  Klaus-Dieter Schewe,et al.  Using XML to Support Media Types , 2005, ISTA.

[41]  Erik Wilde Advanced XML Technologies , 2004, The Practical Handbook of Internet Computing.

[42]  Thomas Wilke,et al.  An Algebraic Characterization of Frontier Testable Tree Languages , 1996, Theor. Comput. Sci..

[43]  John Doner,et al.  Tree Acceptors and Some of Their Applications , 1970, J. Comput. Syst. Sci..

[44]  James W. Thatcher,et al.  Generalized finite automata theory with an application to a decision problem of second-order logic , 1968, Mathematical systems theory.

[45]  Paolo Manghi,et al.  Language Bindings to XML , 2003, IEEE Internet Comput..

[46]  Rajeev Alur,et al.  Visibly pushdown languages , 2004, STOC '04.

[47]  Jan Van den Bussche,et al.  Semi-determinism , 1992, PODS.

[48]  Balder ten Cate,et al.  XPath, transitive closure logic, and nested tree walking automata , 2008, PODS.

[49]  Murali Mani,et al.  Taxonomy of XML schema languages using formal language theory , 2005, TOIT.