XML Schema, Tree Logic and Sheaves Automata

XML documents, and other forms of semi-structured data, may be roughly described as edge labeled trees; it is therefore natural to use tree automata to reason on them. This idea has already been successfully applied in the context of Document Type Definition (DTD), the simplest standard for defining XML documents validity, but additional work is needed to take into account XML Schema, a more advanced standard, for which regular tree automata are not satisfactory. In this paper, we define a tree logic that directly embeds XML Schema as a plain subset as well as a new class of automata for unranked trees, used to decide this logic, which is well-suited to the processing of XML documents and schemas.

[1]  Hitoshi Ohsaki,et al.  Beyond Regularity: Equational Tree Automata for Associative and Commutative Theories , 2001, CSL.

[2]  Luca Cardelli,et al.  A Query Language Based on the Ambient Logic , 2001, SEBD.

[3]  Philip Wadler,et al.  MSL — a model for W3C XML schema , 2001, WWW '01.

[4]  Bruno Courcelle,et al.  The Monadic Second-Order Logic of Graphs. I. Recognizable Sets of Finite Graphs , 1990, Inf. Comput..

[5]  Thomas Schwentick,et al.  Numerical document queries , 2003, PODS.

[6]  Joachim Niehren,et al.  Feature Automata and Recognizable Sets of Feature Trees , 1993, TAPSOFT.

[7]  James W. Thatcher,et al.  Characterizing Derivation Trees of Context-Free Grammars through a Generalization of Finite Automata Theory , 1967, J. Comput. Syst. Sci..

[8]  Benjamin C. Pierce,et al.  Regular expression pattern matching for XML , 2003, POPL '01.

[9]  M. Fischer,et al.  SUPER-EXPONENTIAL COMPLEXITY OF PRESBURGER ARITHMETIC , 1974 .

[10]  Makoto Murata,et al.  Extended path expressions of XML , 2001, PODS.

[11]  Denis Lugiez,et al.  Multitree automata that count , 2005, Theor. Comput. Sci..

[12]  Thomas Schwentick,et al.  Counting in Trees for Free , 2004, ICALP.

[13]  Luca Cardelli,et al.  Anytime, anywhere: modal logics for mobile ambients , 2000, POPL '00.

[14]  Nicolás Marín,et al.  Review of Data on the Web: from relational to semistructured data and XML by Serge Abiteboul, Peter Buneman, and Dan Suciu. Morgan Kaufmann 1999. , 2003, SGMD.

[15]  Dan Suciu,et al.  Data on the Web: From Relations to Semistructured Data and XML , 1999 .

[16]  Felix Klaedtke,et al.  Monadic Second-Order Logics with Cardinalities , 2003, ICALP.

[17]  Frank Neven,et al.  Automata theory for XML researchers , 2002, SGMD.