Propagating functional dependencies with conditions

The dependency propagation problem is to determine, given a view defined on data sources and a set of dependencies on the sources, whether another dependency is guaranteed to hold on the view. This paper investigates dependency propagation for recently proposed conditional functional dependencies (CFDs). The need for this study is evident in data integration, exchange and cleaning since dependencies on data sources often only hold conditionally on the view. We investigate dependency propagation for views defined in various fragments of relational algebra, CFDs as view dependencies, and for source dependencies given as either CFDs or traditional functional dependencies (FDs). (a) We establish lower and upper bounds, all matching, ranging from PTIME to undecidable. These not only provide the first results for CFD propagation, but also extend the classical work of FD propagation by giving new complexity bounds in the presence of finite domains. (b) We provide the first algorithm for computing a minimal cover of all CFDs propagated via SPC views; the algorithm has the same complexity as one of the most efficient algorithms for computing a cover of FDs propagated via a projection view, despite the increased expressive power of CFDs and SPC views. (c) We experimentally verify that the algorithm is efficient.

[1]  Michael J. Maher Constrained Dependencies , 1995, Theor. Comput. Sci..

[2]  Michael J. Maher,et al.  Chasing constrained tuple-generating dependencies , 1996, PODS.

[3]  Georg Gottlob Computing covers for embedded functional dependencies , 1987, PODS '87.

[4]  Serge Abiteboul,et al.  Foundations of Databases , 1994 .

[5]  Ronald Fagin,et al.  Horn clauses and database dependencies , 1982, JACM.

[6]  Anthony C. Klug,et al.  Determining View dependencies using tableaux , 1982, TODS.

[7]  Alfred V. Aho,et al.  Equivalences Among Relational Expressions , 1979, SIAM J. Comput..

[8]  Shuai Ma,et al.  Extending Dependencies with Conditions , 2007, VLDB.

[9]  Maurizio Lenzerini,et al.  Data integration: a theoretical perspective , 2002, PODS.

[10]  Marianne Baudinet,et al.  Constraint-Generating Dependencies , 1994, PPCP.

[11]  Phokion G. Kolaitis Schema mappings, data exchange, and metadata management , 2005, PODS '05.

[12]  J. H. Jou,et al.  Succinctness in Dependency Systems , 1983, Theor. Comput. Sci..

[13]  J. D. Uiiman,et al.  Principles of Database Systems , 2004, PODS 2004.

[14]  Val Tannen,et al.  An Equational Chase for Path-Conjunctive Queries, Constraints, and Views , 1999, ICDT.

[15]  Seymour Ginsburg,et al.  Properties of functional-dependency families , 1982, JACM.

[16]  David Toman,et al.  On Keys and Functional Dependencies as First-Class Citizens in Description Logics , 2006, IJCAR.

[17]  FanWenfei,et al.  Conditional functional dependencies for capturing data inconsistencies , 2008 .

[18]  Paul De Bra,et al.  Conditional Dependencies for Horizontal Decompositions , 1983, ICALP.

[19]  David Toman,et al.  On reasoning about structural equality in XML: a description logic approach , 2003, Theor. Comput. Sci..

[20]  Mihalis Yannakakis,et al.  Equivalences Among Relational Expressions with the Union and Difference Operators , 1980, J. ACM.

[21]  Abraham Silberschatz,et al.  Database System Concepts , 1980 .

[22]  Tok Wang Ling,et al.  Extending and inferring functional dependencies in schema transformation , 2004, CIKM '04.

[23]  David S. Johnson,et al.  Computers and Intractability: A Guide to the Theory of NP-Completeness , 1978 .

[24]  Anthony C. Klug Calculating constraints on relational expression , 1980, TODS.

[25]  Wenfei Fan,et al.  Propagating XML constraints to relations , 2007, J. Comput. Syst. Sci..

[26]  Richard Hull Non-Finite Specifiability of Projections of Functional Dependency Families , 1985, Theor. Comput. Sci..