XML to Relational Conversion Using Theory of Regular Tree Grammars

In this paper, we study the different steps of translation from XML to relational models, while maintaining semantic constraints. Our work is based on the theory of regular tree grammars, which provides a useful formal framework for understanding various aspects of XML schema languages. We first study two normal form representations for regular tree grammars. The first normal form representation, called NF1, is used in the two scenarios: (a) Several document validation algorithms use the NF1 representation as the first step in the validation process for efficiency reasons, and (b) NF1 representation can be used to check whether a given schema satisfies the structural constraints imposed by the schema language. The second normal form representation, called NF2, forms the basis for conversion of a set of type definitions in a schema language L1 that supports union types (e.g., XML-Schema), to a schema language L2 that does not support union types (e.g., SQL), and is used as the first step in our XML to relational conversion algorithm.

[1]  Hubert Comon,et al.  Tree automata techniques and applications , 1997 .

[2]  Daniela Florescu,et al.  Storing and Querying XML Data using an RDMBS , 1999, IEEE Data Eng. Bull..

[3]  Alin Deutsch,et al.  Storing semistructured data with STORED , 1999, SIGMOD '99.

[4]  Benjamin C. Pierce,et al.  XDuce: A Typed XML Processing Language (Preliminary Report) , 2000, WebDB.

[5]  Dongwon Lee,et al.  CPI: Constraints-Preserving Inlining algorithm for mapping XML DTD to relational schema , 2001, Data Knowl. Eng..

[6]  Benjamin C. Pierce,et al.  Xduce: a typed xml processing language , 1997 .

[7]  Arvind Malhotra,et al.  Xml schema part 2: datatypes , 1999 .

[8]  Dongwon Lee,et al.  Semantic Data Modeling Using XML Schemas , 2001, ER.

[9]  David J. DeWitt,et al.  Relational Databases for Querying XML Documents: Limitations and Opportunities , 1999, VLDB.

[10]  Dongwon Lee,et al.  NeT & CoT: translating relational schemas to XML schemas using semantic constraints , 2002, CIKM '02.

[11]  Dongwon Lee,et al.  Nesting-Based Relational-to-XML Schema Translation , 2001, International Workshop on the Web and Databases.

[12]  Wenfei Fan,et al.  Keys for XML , 2002, Comput. Networks.

[13]  Juliana Freire,et al.  From XML schema to relations: a cost-based approach to XML storage , 2002, Proceedings 18th International Conference on Data Engineering.

[14]  Masako Takahashi,et al.  Generalizations of Regular Sets and Their Applicatin to a Study of Context-Free Languages , 1975, Inf. Control..