Discovering Functional Dependencies from Ontologies

Discovering functional dependencies is a fundamental step in the design of relational databases and in most system reengineering processes. Typically, this task has been performed over relational databases, at the logical or physical level. Those works addressing it at the logical level, often make some unrealistic assumptions (such as completeness of the data structures or similar names for semantically related attributes), while those addressing it at the physical level propose solutions that are computationally expensive, whose performance deteriorates with a large number of attributes or instances, and which cannot tolerate erroneous data. To overcome these limitations, while also better capturing data dependencies, we propose to rely instead on a conceptual representation of the domain of interest, specified in ER and formalized through a domain ontology expressed in the DL-Lite Description Logic. We propose an algorithm to discover functional dependencies from the domain ontology that exploits the inference capabilities of DL-Lite, thus fully taking into account the semantics of the domain. We also provide an evaluation of our approach in a real-world scenario.

[1]  Maurizio Lenzerini,et al.  Data integration: a theoretical perspective , 2002, PODS.

[2]  W. H. Inmon,et al.  Building the data warehouse , 1992 .

[3]  János Demetrovics,et al.  Some Remarks On Generating Armstrong And Inferring Functional Dependencies Relation , 1995, Acta Cybern..

[4]  Diego Calvanese,et al.  Identification Constraints and Functional Dependencies in Description Logics , 2001, IJCAI.

[5]  Jean-Luc Hainaut,et al.  Contribution to a theory of database reverse engineering , 1993, [1993] Proceedings Working Conference on Reverse Engineering.

[6]  Peter A. Flach,et al.  Database Dependency Discovery: A Machine Learning Approach , 1999, AI Commun..

[7]  Diego Calvanese,et al.  Tractable Reasoning and Efficient Query Answering in Description Logics: The DL-Lite Family , 2007, Journal of Automated Reasoning.

[8]  Gyula O. H. Katona,et al.  Functional dependencies distorted by errors , 2008, Discret. Appl. Math..

[9]  Diego Calvanese,et al.  Data Complexity of Query Answering in Description Logics , 2006, Description Logics.

[10]  Serge Abiteboul,et al.  Foundations of Databases , 1994 .

[11]  Diego Calvanese,et al.  QuOnto: Querying Ontologies , 2005, AAAI.

[12]  Antoni Olivé Ramon,et al.  EU-Rent car rentals specification , 2003 .

[13]  David Toman,et al.  On Keys and Functional Dependencies as First-Class Citizens in Description Logics , 2006, IJCAR.

[14]  Ralph Kimball,et al.  The Data Warehouse Lifecycle Toolkit: Expert Methods for Designing, Developing and Deploying Data Warehouses with CD Rom , 1998 .

[15]  Diego Calvanese,et al.  Description Logics for Conceptual Data Modeling , 1998, Logics for Databases and Information Systems.

[16]  Raghu Ramakrishnan,et al.  Database Management Systems , 1976 .

[17]  David Toman,et al.  On the Interaction between Inverse Features and Path-functional Dependencies in Description Logics , 2005, IJCAI.

[18]  Wie Ming Lim Discovery of constraints from data for information system reverse engineering , 1997, Proceedings of Australian Software Engineering Conference ASWEC 97.

[19]  Moshe Y. Vardi The complexity of relational query languages (Extended Abstract) , 1982, STOC '82.

[20]  Diego Calvanese,et al.  Path-Based Identification Constraints in Description Logics , 2008, KR.

[21]  Editors , 1986, Brain Research Bulletin.

[22]  Diego Calvanese,et al.  Reasoning on UML class diagrams , 2005, Artif. Intell..

[23]  Diego Calvanese,et al.  Conceptual Modeling for Data Integration , 2009, Conceptual Modeling: Foundations and Applications.

[24]  Heikki Mannila,et al.  Discovering functional and inclusion dependencies in relational databases , 1992, Int. J. Intell. Syst..

[25]  Diego Calvanese,et al.  Linking Data to Ontologies , 2008, J. Data Semant..

[26]  Moshe Y. Vardi,et al.  Polynomial-time implication problems for unary inclusion dependencies , 1990, JACM.

[27]  Ian Horrocks,et al.  Using an Expressive Description Logic: FaCT or Fiction? , 1998, KR.

[28]  Yuan Zhao,et al.  Automated elicitation of functional dependencies from source codes of database transactions , 2004, Inf. Softw. Technol..

[29]  Antonella Poggi,et al.  Ontology-based database access with DIG-Mastro and the OBDA Plugin for Protégé (Demo Description) , 2008, OWLED.