Using regular tree automata as XML schemas

We address the problem of tight XML schemas and propose regular tree automata to model XML data. We show that the tree automata model is more powerful than the XML DTDs and is closed under main algebraic operations. We introduce the XML query algebra based on the tree automata model, and discuss the query optimization and query pruning techniques. Finally we show the conversion of tree automata schema into XML DTDs.

[1]  Dan Suciu,et al.  Optimizing regular path expressions using graph schemas , 1998, Proceedings 14th International Conference on Data Engineering.

[2]  David Maier Database Desiderata for an XML Query Language , 1998, QL.

[3]  Yannis Papakonstantinou,et al.  Enhancing semistructured data mediators with document type definitions , 1999, Proceedings 15th International Conference on Data Engineering (Cat. No.99CB36337).

[4]  Dan Suciu,et al.  Adding Structure to Unstructured Data , 1997, ICDT.

[5]  Masako Takahashi,et al.  Generalizations of Regular Sets and Their Applicatin to a Study of Context-Free Languages , 1975, Inf. Control..

[6]  Makoto Murata Data Model for Document Transformation and Assembly , 1998, PODDP.

[7]  Roy Goldman,et al.  DataGuides: Enabling Query Formulation and Optimization in Semistructured Databases , 1997, VLDB.

[8]  Alin Deutsch,et al.  A Query Language for XML , 1999, Comput. Networks.

[9]  james w.thatcher,et al.  tree automata: an informal survey , 1974 .

[10]  Helena Ahonen Disambiguation of SGML Content Models , 1996, PODP.

[11]  Klemens Böhm,et al.  Query optimization for structured documents based on knowledge on the document type definition , 1998, Proceedings IEEE International Forum on Research and Technology Advances in Digital Libraries -ADL'98-.

[12]  James W. Thatcher,et al.  Characterizing Derivation Trees of Context-Free Grammars through a Generalization of Finite Automata Theory , 1967, J. Comput. Syst. Sci..

[13]  Makoto Murata,et al.  Hedge automata: a formal model for xml schemata , 1999 .

[14]  Serge Abiteboul,et al.  Querying Semi-Structured Data , 1997, Encyclopedia of Database Systems.

[15]  Jeffrey D. Ullman,et al.  Introduction to Automata Theory, Languages and Computation , 1979 .

[16]  Catriel Beeri,et al.  Schemas for Integration and Translation of Structured and Semi-structured Data , 1999, ICDT.

[17]  Tova Milo,et al.  An Algebra for Pomsets , 1995, ICDT.

[18]  Anne Brüggemann-Klein,et al.  Unambiguity of Extended Regular Expressions in SGML Document Grammars , 1993, ESA.

[19]  Scott Vandenberg,et al.  Algebras for object-oriented query languages , 1993 .

[20]  Serge Abiteboul,et al.  Extracting schema from semistructured data , 1998, SIGMOD '98.

[21]  Yannis Papakonstantinou,et al.  View Definition and DTD Inference for XML , 1999 .

[22]  Object-Oriented Query Languages: Analysis And Evaluation , .

[23]  Serge Abiteboul On Views and XML , 1999, PODS.

[24]  Derick Wood,et al.  Standard Generalized Markup Language: Mathematical and Philosophical Issues , 1995, Computer Science Today.

[25]  Alberto O. Mendelzon,et al.  Database techniques for the World-Wide Web: a survey , 1998, SGMD.

[26]  J W Ballard,et al.  Data on the web? , 1995, Science.

[27]  Jennifer Widom,et al.  Query Optimization for XML , 1999, VLDB.