Functional dependencies on extended relations defined by regular languages

Functional dependency (FD) is one of the most analyzed integrity constraints for any data model. In the relational data model, FDs are defined in a natural way: the values of an attribute set Y depend on the values of another attribute set X, that is, “Y is a function of X”. FDs are well studied and are widely used in normalization theory. XML functional dependencies (XFD) can be defined in different ways and no universally best definition has been proposed. They are defined by very intricate concepts, and they are mostly based upon XML elements described by XML schema languages such as a DTD or an XML Schema definition. The instances of these elements are semi-structured tuples. A semi-structured tuple is an ordered list of (attribute: value) pairs. We may think of a tuple as a sentence of a formal language, where the values are the terminal symbols and the attribute names are the nonterminal symbols. In this way, the sequence of the attribute names is the left side of a production rule used to accept the next terminal symbol, that is, the next value of the tuple. So the list of values forms the sentence and the list of attributes forms the dual sentence. In this paper, we introduce the notion of the extended tuple as a sentence from a regular language generated by a grammar where the nonterminal symbols of the grammar are the attribute names of the tuple. Sets of extended tuples are the extended relations (relations are instances). We then introduce the dual language, which generates the tuple types allowed to occur in extended relations. We define functional dependencies over extended relations. The syntax of functional dependencies will be given on the graph of the finite state automaton accepting the regular language. Using this model, we can also handle extended relations generated by recursive regular expressions. The implication problem of our class of dependencies is decidable and finitely axiomatizable by a version of the Chase algorithm performed on the graph of the associated finite state automaton.

[1]  Jim Melton,et al.  XML schema , 2003, SGMD.

[2]  Carmem S. Hara,et al.  Reasoning about nested functional dependencies , 1999, PODS '99.

[3]  Wenfei Fan,et al.  Reasoning about keys for XML , 2003, Inf. Syst..

[4]  Sven Hartmann,et al.  More Functional Dependencies for XML , 2003, ADBIS.

[5]  Klaus-Dieter Schewe,et al.  Functional Dependencies over XML Documents with DTDs , 2005, Acta Cybern..

[6]  B. Watson A taxonomy of finite automata construction algorithms , 1993 .

[7]  Ping Yan,et al.  Mapping Relational Schemas to XML DTDs with Constraints , 2006, First International Multi-Symposiums on Computer and Computational Sciences (IMSCCS'06).

[8]  Sven Hartmann,et al.  Characterising nested database dependencies by fragments of propositional logic , 2008, Ann. Pure Appl. Log..

[9]  Murali Mani,et al.  Taxonomy of XML schema languages using formal language theory , 2005, TOIT.

[10]  Klaus-Dieter Schewe,et al.  Axiomatisations of functional dependencies in the presence of records, lists, sets and multisets , 2006, Theor. Comput. Sci..

[11]  J. Hopcroft,et al.  Reasoning about Xml Schema Languages Using Formal Language Theory , 2000 .

[12]  Sven Hartmann,et al.  Numerical constraints on XML data , 2010, Inf. Comput..

[13]  András A. Benczúr,et al.  Functional Dependencies on Extended Relations Defined by Regular Languages , 2012, FoIKS.

[14]  Paolo Atzeni,et al.  Functional Dependencies and Constraints on Null Values in Database Relations , 1986, Inf. Control..

[15]  Wenfei Fan,et al.  Keys for XML , 2002, Comput. Networks.

[16]  Ronald Fagin,et al.  Functional Dependencies in a Relational Data Base and Propositional Logic , 1977, IBM J. Res. Dev..

[17]  Junhu Wang,et al.  Removing XML Data Redundancies Using Functional and Equality-Generating Dependencies , 2005, ADC.

[18]  Filip Murlak,et al.  XML schema mappings , 2009, PODS.

[19]  Serge Abiteboul,et al.  Foundations of Databases , 1994 .

[20]  Djelloul Ziadi,et al.  Canonical derivatives, partial derivatives and finite automaton constructions , 2002, Theor. Comput. Sci..

[21]  Mukesh K. Mohania,et al.  On the equivalence between FDs in XML and FDs in relations , 2007, Acta Informatica.

[22]  Junhu Wang A Comparative Study of Functional Dependencies for XML , 2005, APWeb.

[23]  Chen Zhan-sheng,et al.  Strong functional dependencies and their application to normal forms in XML , 2005 .

[24]  Sven Hartmann,et al.  Solving the Implication Problem for XML Functional Dependencies with Properties , 2010, WoLLIC.

[25]  V. Glushkov THE ABSTRACT THEORY OF AUTOMATA , 1961 .

[26]  Jianxin Li,et al.  Holistic Constraint-Preserving Transformation from Relational Schema into XML Schema , 2008, DASFAA.

[27]  Cong Yu,et al.  XML schema refinement through redundancy detection and normalization , 2008, The VLDB Journal.

[28]  Gérard Berry,et al.  From Regular Expressions to Deterministic Automata , 1986, Theor. Comput. Sci..

[29]  Chengfei Liu,et al.  Constraint Preserving Transformation from Relational Schema to XML Schema , 2006, World Wide Web.

[30]  Susan B. Davidson,et al.  Constraint Preserving XML Storage in Relations , 2002 .

[31]  Marcelo Arenas,et al.  A normal form for XML documents , 2002, PODS '02.

[32]  Sven Hartmann,et al.  On the Existence of Armstrong Data Trees for XML Functional Dependencies , 2010, FoIKS.

[33]  Janusz A. Brzozowski,et al.  Derivatives of Regular Expressions , 1964, JACM.

[34]  Lucja Kot,et al.  Characterization of the Interaction of XML Functional Dependencies with DTDs , 2007, ICDT.

[35]  Patrick C. Fischer,et al.  Interactions between Dependencies and Nested Relational Structures , 1985, J. Comput. Syst. Sci..

[36]  Sven Hartmann,et al.  Efficient reasoning about a robust XML key fragment , 2009, TODS.

[37]  Cyril Nicaud,et al.  Average Analysis of Glushkov Automata under a BST-Like Model , 2010, FSTTCS.

[38]  Wenfei Fan,et al.  Propagating XML constraints to relations , 2007, J. Comput. Syst. Sci..

[39]  Klaus-Dieter Schewe,et al.  Counter-Free Keys and Functional Dependencies in Higher-Order Datamodels , 2006, Fundam. Informaticae.