A comprehensive database schema integration method based on the theory of formal concepts

Integrating heterogeneous database schemata is a major task in federated database design where preexisting and heterogeneous database systems need to be integrated virtually by providing a homogenization database interface. Most proposed schema integration methods suffer from very complex result schemata and insufficient handling of extensional relations, i.e. in the way how redundant data of the input systems are dealt with. Redundancy among the input systems may thus remain undetected and, hence, remains uncontrolled.Our GIM (Generic Integration Model) method is based on the elegant and mathematically founded theory of formal concept analysis (FCA). The main idea is to integrate schemata into one formal context which is a binary relation between a set of attributes and a set of base extensions (set of potential objects). From that context we apply an FCA-algorithm to semi-automatically derive a concept lattice which we interpret as an inheritance hierarchy of classes for a homogenized schema. Thus, the integration task following our method can be supported by tools.

[1]  Manuel García-Solaco,et al.  Modelling Specialization as BLOOM semilattices , 1994 .

[2]  Amit P. Sheth,et al.  On Automatic Reasoning for Schema Integration , 1993, Int. J. Cooperative Inf. Syst..

[3]  William Kent A RIGOROUS MODEL OF OBJECT REFERENCE, IDENTITY, AND EXISTENCE , 1994 .

[4]  Bernhard Convent,et al.  Unsolvable Problems Related To The View Integration Approach , 1986, ICDT.

[5]  Gunter Saake,et al.  Deriving Relationships between Integrity Constraints for Schema Comparison , 1998, ADBIS.

[6]  Fèlix Saltor,et al.  A Canonical Model for the Interoperability Among Object-Oriented and Relational Databases , 1992, IWDOM.

[7]  James A. Larson,et al.  A Theory of Attribute Equivalence in Databases with Application to Schema Integration , 1989, IEEE Trans. Software Eng..

[8]  Arno Siebes,et al.  Schema Integration in Object-Oriented Databases , 1993, CAiSE.

[9]  Ahmed K. Elmagarmid,et al.  Object orientation in multidatabase systems , 1995, CSUR.

[10]  Love Ekenberg,et al.  Conflictfreeness as a Basis for Schema Integration , 1995, CISMOD.

[11]  Fèlix Saltor,et al.  Semantic heterogeneity in multidatabase systems , 1995 .

[12]  Umeshwar Dayal,et al.  View Definition and Generalization for Database Integration in a Multidatabase System , 1984, IEEE Transactions on Software Engineering.

[13]  Gunter Saake,et al.  Integration of Inheritance Trees as Part of View Generation For Database Federations , 1996, ER.

[14]  Ingo Schmitt,et al.  Restructuring Class Hierarchies for Schema Integration , 1997, DASFAA.

[15]  Dennis McLeod,et al.  A federated architecture for information management , 1985, TOIS.

[16]  Jungyun Seo,et al.  Classifying schematic and data heterogeneity in multidatabase systems , 1991, Computer.

[17]  Gunter Saake,et al.  Schema Integration with Integrity Constraints , 1997, BNCOD.

[18]  C. Pu Key equivalence in heterogeneous databases , 1991, [1991] Proceedings. First International Workshop on Interoperability in Multidatabase Systems.

[19]  Amar Gupta,et al.  Formulating Global Integrity Constraints During Derivation of Global Schema , 1995, Data Knowl. Eng..

[20]  Paul Johannesson,et al.  Using Conceptual Graph Theory to Support Schema Integration , 1993, ER.

[21]  Fèlix Saltor,et al.  A structure based schema integration methodology , 1995, Proceedings of the Eleventh International Conference on Data Engineering.

[22]  Günter Preuner Definition of Behavior in Object-Oriented Databases by View Integration , 1999, DISDBIS.

[23]  Maurizio Lenzerini,et al.  A Methodology for Data Schema Integration in the Entity Relationship Model , 1984, IEEE Transactions on Software Engineering.

[24]  Yann Dupont Resolving Fragmentation Conflicts in Schema Integration , 1994, ER.

[25]  Stefano Spaccapietra,et al.  Schema Integration Engineering inCooperative Database Systems , 1996 .

[26]  A. R. Hurson,et al.  Linguistic support for semantic identification and interpretation in multidatabases , 1991, [1991] Proceedings. First International Workshop on Interoperability in Multidatabase Systems.

[27]  Fèlix Saltor,et al.  Suitability of datamodels as canonical models for federated databases , 1991, SGMD.

[28]  Stefano Spaccapietra,et al.  Conflicts and correspondence assertions in interoperable databases , 1991, SGMD.

[29]  Jeffrey D. Ullman,et al.  Principles of Database and Knowledge-Base Systems, Volume II , 1988, Principles of computer science series.

[30]  Lotfi Lakhal,et al.  iO2 - An Algorithmic Method for Building Inheritance Graphs in Object Database Design , 1996, ER.

[31]  LINDA G. DEMICHIEL,et al.  Resolving Database Incompatibility: An Approach to Performing Relational Operations over Mismatched Domains , 1989, IEEE Trans. Knowl. Data Eng..

[32]  Ingo Schmitt,et al.  An incremental approach to schema integration by refining extensional relationships , 1998, CIKM '98.

[33]  Peter M. G. Apers,et al.  The Role of Integrity Constraints in Database Interoperation , 1996, VLDB.

[34]  Gunter Saake,et al.  Inheritance conditions for object life cycle diagrams , 1994 .

[35]  Ming-Chien Shan,et al.  Object Identification in Multidatabase Systems , 1992, DS-5.

[36]  Jan Chomicki,et al.  Declarative Definition of Object-Oriented Multidatabase Mappings , 1992, IWDOM.

[37]  Fèlix Saltor,et al.  A Semantic-Discriminated Approach to Integration of Federated Databases , 1995, CoopIS.

[38]  Vipul Kashyap,et al.  Semantic and schematic similarities between database objects: a context-based approach , 1996, The VLDB Journal.

[39]  Vipul Kashyap,et al.  So Far (Schematically) yet So Near (Semantically) , 1992, DS-5.

[40]  Joachim Biskup,et al.  A formal view integration method , 1986, SIGMOD '86.

[41]  Ahmed K. Elmagarmid,et al.  Object-Oriented Multidatabase Systems: A Solution for Advanced Applications , 1995 .

[42]  Michael Schrefl,et al.  Extending object-oriented systems with roles , 1996, TOIS.

[43]  James A. Larson,et al.  Integrating User Views in Database Design , 1986, Computer.

[44]  Wolfgang Effelsberg,et al.  A rule-based approach for merging generalization hierarchies , 1988, Inf. Syst..

[45]  Ravi Krishnamurthy,et al.  Interoperability of heterogeneous databases with schematic discrepancies , 1991, [1991] Proceedings. First International Workshop on Interoperability in Multidatabase Systems.

[46]  Ingo Schmitt,et al.  Dealing with Integrity Constraints During Schema Integration , 1997, EFDBS.

[47]  W.W.M. Vermeer Semantic Interoperability For Legacy Databases , 1997 .

[48]  Michael Schrefl,et al.  Metaclasses and Their Application: Data Model Tailoring and Database Integration , 1995 .

[49]  Stefano Spaccapietra,et al.  Model independent assertions for integration of heterogeneous schemas , 1992, The VLDB Journal.

[50]  Paul Johannesson Schema Transformations as an Aid in View Integration , 1993, CAiSE.

[51]  Gunter Saake,et al.  Managing Object Identity in Federated Database Systems , 1995, OOER.

[52]  Amar Gupta,et al.  A Methodology for Integration of Heterogeneous Databases , 1994, IEEE Trans. Knowl. Data Eng..

[53]  Frank Eliassen,et al.  Interoperability and object identity , 1991, SGMD.

[54]  Peter P. Chen The entity-relationship approach , 1993 .

[55]  Fèlix Saltor,et al.  Overcoming Schematic Discrepancies in Interoperable Databases , 1992, DS-5.

[56]  Dennis Tsichritzis,et al.  The ANSI/X3/SPARC DBMS Framework Report of the Study Group on Dabatase Management Systems , 1978, Inf. Syst..

[57]  Gunter Saake,et al.  Design Support for Database Federations , 1999, ER.

[58]  Fèlix Saltor,et al.  Discriminated operations for interoperable databases , 1991, [1991] Proceedings. First International Workshop on Interoperability in Multidatabase Systems.

[59]  Venkataraman Ramesh,et al.  Integrity Constraint Integration in Heterogeneous Databases: An Enhanced Methodology for Schema Integration , 1997, Inf. Syst..

[60]  Renée J. Miller,et al.  The Use of Information Capacity in Schema Integration and Translation , 1993, VLDB.

[61]  Jaideep Srivastava,et al.  Entity Identification in Database Integration , 1996, Inf. Sci..

[62]  Can Türker Semantic Integrity Constraints in Federated Database Schemata , 1999, DISDBIS.

[63]  Love Ekenberg,et al.  A Formal Basis for Dynamic Schema Integration , 1996, ER.

[64]  Nick Roussopoulos,et al.  Interoperability of multiple autonomous databases , 1990, CSUR.

[65]  Richard Hull Relative Information Capacity of Simple Relational Database Schemata , 1986, SIAM J. Comput..

[66]  Shamkant B. Navathe,et al.  A schema integration facility using object-oriented data model , 1995 .

[67]  Fèlix Saltor,et al.  Discovering interdatabase resemblance of classes for interoperable databases , 1993, Proceedings RIDE-IMS `93: Third International Workshop on Research Issues in Data Engineering: Interoperability in Multidatabase Systems.

[68]  Michael Gertz,et al.  Data Integration Techniques based on Data Quality Aspects , 1998, Föderierte Datenbanken.

[69]  Renée J. Miller,et al.  Schema equivalence in heterogeneous systems: bridging theory and practice , 1994, Inf. Syst..

[70]  Gunter Saake,et al.  Merging inheritance hierarchies for database integration , 1998, Proceedings. 3rd IFCIS International Conference on Cooperative Information Systems (Cat. No.98EX122).

[71]  Arbee L. P. Chen,et al.  Identifying object isomerism in multidatabase systems , 2004, Distributed and Parallel Databases.

[72]  Inderpal Singh Mumick,et al.  Using Object Matching And Materialization To Integrate Heterogeneous Databases , 1999 .

[73]  Isabelle Comyn-Wattiau,et al.  Conceptual Modeling — ER ’99: 18th International Conference on Conceptual Modeling Paris, France, November 15–18, 1999 Proceedings , 1999, Lecture Notes in Computer Science.

[74]  Ingo Schmitt,et al.  Considering Integrity Constraints During Federated Database Design , 1998, BNCOD.

[75]  Gunter Saake,et al.  Consistent Handling of Integrity Constraints and Extensional Assertions for Schema Integration , 1999, ADBIS.

[76]  Amihai Motro,et al.  Superviews: Virtual Integration of Multiple Databases , 1987, IEEE Transactions on Software Engineering.

[77]  Stuart E. Madnick,et al.  The inter-database instance identification problem in integrating autonomous systems , 1989, [1989] Proceedings. Fifth International Conference on Data Engineering.

[78]  Roel Wieringa,et al.  Using Dynamic Classes and Role Classes to Model Object Migration , 1995, Theory Pract. Object Syst..

[79]  Sandra Heiler,et al.  Distributed Object Management , 1992, Int. J. Cooperative Inf. Syst..

[80]  Maurizio Lenzerini,et al.  Interschema Knowledge in Cooperative Information Systems , 1993, ICICIS.

[81]  Won Kim,et al.  On resolving schematic heterogeneity in multidatabase systems , 1995, Distributed and Parallel Databases.

[82]  Ali R. Hurson,et al.  Object-oriented multidatabase systems , 1995 .