On the Power of Tree-Walking Automata

Tree-walking automata (TWAs) recently received new attention in the fields of formal languages and databases. Towards a better understanding of their expressiveness, we characterize them in terms of transitive closure logic formulas in normal form. It is conjectured by Engelfriet and Hoogeboom that TWAs cannot define all regular tree languages, or equivalently, all of monadic second-order logic. We prove this conjecture for a restricted, but powerful, class of TWAs. In particular, we show that 1-bounded TWAs, that is TWAs that are only allowed to traverse every edge of the input tree at most once in every direction, cannot define all regular languages. We then extend this result to a class of TWAs that can simulate first-order logic (FO) and is capable of expressing properties not definable in FO extended with regular path expressions; the latter logic being a valid abstraction of current query languages for XML and semi-structured data.

[1]  Johann A. Makowsky,et al.  The Expressive Power of Transitive Closue and 2-way Multihead Automata , 1991, CSL.

[2]  Joost Engelfriet,et al.  Tree-Walking Pebble Automata , 1999, Jewels are Forever.

[3]  Frank Neven,et al.  A formal model for an expressive fragment of XSLT , 2000, Inf. Syst..

[4]  Dan Suciu,et al.  Typechecking for XML transformers , 2000, J. Comput. Syst. Sci..

[5]  Zoltán Fülöp,et al.  Domains of partial attributed tree transducers , 2000, Inf. Process. Lett..

[6]  Thomas Schwentick,et al.  Expressive and efficient pattern languages for tree-structured data (extended abstract) , 2000, PODS '00.

[7]  Derick Wood,et al.  Caterpillars, context, tree automata and tree pattern matching , 2000, Developments in Language Theory.

[8]  Ferenc Gécseg,et al.  Tree Languages , 1997, Handbook of Formal Languages.

[9]  Giora Slutzki,et al.  Parallel and Two-Way Automata on Directed Ordered Acyclic Graphs , 1981, Inf. Control..

[10]  Grzegorz Rozenberg,et al.  Handbook of Formal Languages , 1997, Springer Berlin Heidelberg.

[11]  Joost Engelfriet,et al.  A Comparison of Tree Transductions Defined by Monadic Second Order Logic and by Attribute Grammars , 2000, J. Comput. Syst. Sci..

[12]  H. Gaifman On Local and Non-Local Properties , 1982 .

[13]  Wolfgang Thomas,et al.  Languages, Automata, and Logic , 1997, Handbook of Formal Languages.

[14]  Frank Neven Design and analysis of query languages for structured documents. A formal and logical approach , 1999 .

[15]  Dan Suciu,et al.  Data on the Web: From Relations to Semistructured Data and XML , 1999 .

[16]  Jan-Pascal van Best,et al.  Trips on Trees , 1999, Acta Cybern..

[17]  Alfred V. Aho,et al.  Translations on a Context-Free Grammar , 1971, Inf. Control..