Modeling Data Federations in ORM

Two major problems in constructing data federations (for example, data warehouses and database federations) concern achieving and maintaining consistency and a uniform representation of the data on the global level of the federation. The first step in creating uniform representations of data is known as data extraction, whereas data reconciliation is concerned with resolving data inconsistencies. Our approach to constructing a global conceptual schema as the result of integrating a collection of (semantically) heterogeneous component schemas is based on the concept of exact views. We show that a global schema constructed in terms of exact views integrates component schemas in such a way that the global schema is populated by exactly those instances allowed by the local schemas (and in special cases, also the other way around). In this sense, the global schema is equivalent to the set of component schemas from which the global schema is derived. This paper describes a modeling framework for data federations based on the Object-Role Modeling (ORM) approach. In particular, we show that we can represent exact views within ORM, providing the means to resolve in a combined setting data extraction and reconciliation problems on the global level of the federation.

[1]  Peter Wilson Information Modeling , 1987, IEEE Computer Graphics and Applications.

[2]  Il-Yeol Song,et al.  A UML profile for multidimensional modeling in data warehouses , 2006, Data Knowl. Eng..

[3]  Heinrich Hußmann,et al.  Using UML/OCL Constraints for Relational Database Design , 1999, UML.

[4]  David W. Embley,et al.  Automatic direct and indirect schema mapping: experiences and lessons learned , 2004, SGMD.

[5]  Gunter Saake,et al.  Global Extensional Assertions and Local Integrity Constraints in Federated Schemata , 2000, Inf. Syst..

[6]  Terry Halpin,et al.  ORM/NIAM Object-Role Modeling , 1998 .

[7]  Sjaak Brinkkemper,et al.  Conceptual Modelling in Information Systems Engineering , 2007 .

[8]  Herman Balsters,et al.  Modelling Database Views with Derived Classes in the UML/OCL-Framework , 2003, UML.

[9]  Martin Gogolla,et al.  Object Constraint Language , 2009, Encyclopedia of Database Systems.

[10]  Alon Y. Halevy,et al.  Answering queries using views: A survey , 2001, The VLDB Journal.

[11]  Serge Abiteboul,et al.  Foundations of Databases , 1994 .

[12]  Heinrich Hußmann,et al.  OCL as a Specification Language for Business Rules in Database Applications , 2001, UML.

[13]  Robert Meersman,et al.  On the Move to Meaningful Internet Systems 2004: CoopIS, DOA, and ODBASE , 2004, Lecture Notes in Computer Science.

[14]  Alberto Abelló,et al.  Research in data warehouse modeling and design: dead or alive? , 2006, DOLAP '06.

[15]  Mehdi Khosrow-Pour,et al.  Printed at: , 2011 .

[16]  Terry A. Halpin Information Modeling in UML and ORM , 2005, Encyclopedia of Information Science and Technology.

[17]  Heinrich C. Mayr,et al.  Information systems technology and its applications , 2003 .

[18]  Peter P. Chen The Entity-Relationship Model: Towards a unified view of Data , 1976 .

[19]  Terry A. Halpin,et al.  Business Rule Verbalization , 2004, ISTA.

[20]  Terry Halpin,et al.  Information modeling and relational databases: from conceptual analysis to logical design , 2001 .

[21]  Peter P. Chen The entity-relationship model: toward a unified view of data , 1975, VLDB '75.

[22]  Terry Halpin,et al.  Fact-Oriented Modeling: Past, Present and Future , 2007 .

[23]  Herman Balsters,et al.  Integration of Integrity Constraints in Federated Schemata Based on Tight Constraining , 2004, CoopIS/DOA/ODBASE.

[24]  Richard Hull,et al.  Managing semantic heterogeneity in databases: a theoretical prospective , 1997, PODS.

[25]  Erhard Rahm,et al.  A survey of approaches to automatic schema matching , 2001, The VLDB Journal.

[26]  Serge Abiteboul,et al.  Complexity of answering queries using materialized views , 1998, PODS.

[27]  Laura M. Haas,et al.  Schema Mapping as Query Discovery , 2000, VLDB.

[28]  Maurizio Lenzerini,et al.  Data integration: a theoretical perspective , 2002, PODS.

[29]  Gottfried Vossen,et al.  Schema Versioning in Data Warehouses , 2004, ER.

[30]  Michael R. Blaha,et al.  Object-Oriented Modeling and Design for Database Applications , 1997 .

[31]  Tatyana Yakhno,et al.  Advances in Information Systems , 2002, Lecture Notes in Computer Science.

[32]  Maurizio Lenzerini,et al.  Editorial: Introduction to: Data extraction, cleaning, and reconciliation a special issue of information systems, an international journal , 2001 .

[33]  Isabelle Comyn-Wattiau,et al.  A UML-based data warehouse design method , 2006, Decis. Support Syst..

[34]  Anneke Kleppe,et al.  The Object Constraint Language: Getting Your Models Ready for MDA , 2003 .

[35]  Terry Halpin,et al.  Subtyping Revisited , 2007, EMMSAD.

[36]  Terry A. Halpin,et al.  Orm 2 , 2005, OTM Workshops.

[37]  Herman Balsters,et al.  An Object-Oriented Framework for Reconciliation and Extraction in Heterogeneous Data Federations , 2004, ADVIS.

[38]  Zahir Tari,et al.  On the Move to Meaningful Internet Systems 2005: OTM 2005 Workshops, OTM Confederated International Workshops and Posters, AWeSOMe, CAMS, GADA, MIOS+INTEROP, ORM, PhDS, SeBGIS, SWWS, and WOSE 2005, Agia Napa, Cyprus, October 31 - November 4, 2005, Proceedings , 2005, OTM Workshops.

[39]  Gottfried Vossen,et al.  Schema versioning in data warehouses: Enabling cross-version querying via schema augmentation , 2006, Data Knowl. Eng..