Typechecking for Semistructured Data

Semistructured data is used in data exchange applications, like B2B and EAI, and represents data in a flexible format. Every data item has a unique tag (also called label), and data items can be nested. Formally, a semistructured data instance is a tree whose nodes are labeled with tags and leaves are labeled with data values. XML [Con98] is a standard syntax for describing such trees; Fig. 1 shows a tree representing a semistructured data instance and its XML syntax. We will refer interchangeably to semistructured data instances as trees or XML trees.

[1]  Frank Neven,et al.  A formal model for an expressive fragment of XSLT , 2002, Inf. Syst..

[2]  Makoto Murata,et al.  Transformation of Documents and Schemas by Patterns and Contextual Conditions , 1996, PODP.

[3]  Wolfgang Thomas,et al.  Automata on Infinite Objects , 1991, Handbook of Theoretical Computer Science, Volume B: Formal Models and Sematics.

[4]  Jan van Leeuwen,et al.  Formal models and semantics , 1990 .

[5]  Catriel Beeri,et al.  Schemas for Integration and Translation of Structured and Semi-structured Data , 1999, ICDT.

[6]  Benjamin C. Pierce,et al.  XDuce: A Typed XML Processing Language (Preliminary Report) , 2000, WebDB.

[7]  Yannis Papakonstantinou,et al.  DTD inference for views of XML data , 2000, PODS.

[8]  Dan Suciu,et al.  SilkRoute: trading between relations and XML , 2000, Comput. Networks.

[9]  Philip Wadler,et al.  MSL: a model for W3C XML Schema , 2002, Comput. Networks.

[10]  Noga Alon,et al.  Typechecking XML views of relational databases , 2001, Proceedings 16th Annual IEEE Symposium on Logic in Computer Science.

[11]  Roy Goldman,et al.  DataGuides: Enabling Query Formulation and Optimization in Semistructured Databases , 1997, VLDB.

[12]  Benjamin C. Pierce,et al.  Regular expression pattern matching for XML , 2003, POPL '01.

[13]  Hamid Pirahesh,et al.  Efficiently publishing relational data as XML documents , 2001, The VLDB Journal.

[14]  Dan Suciu,et al.  Data on the Web: From Relations to Semistructured Data and XML , 1999 .

[15]  John C. Mitchell,et al.  Foundations for programming languages , 1996, Foundation of computing series.

[16]  Dan Suciu,et al.  Adding Structure to Unstructured Data , 1997, ICDT.

[17]  C. M. Sperberg-McQueen,et al.  Extensible Markup Language (XML) , 1997, World Wide Web J..

[18]  Helmut Seidl Deciding Equivalence of Finite Tree Automata , 1990, SIAM J. Comput..