Data Integration — Problems, Approaches, and Perspectives

Data integration is one of the older research fields in the database area and has emerged shortly after database systems were first introduced into the business world. In this paper, we briefly introduce the problem of integration and, based on an architectural perspective, give an overview of approaches to address the integration issue. We discuss the evolution from structural to semantic integration and shortly present our own research in the SIRUP (Semantic Integration Reflecting User-specific semantic Perspectives) approach. Finally, an outlook to challenging areas of future research in the realm of data integration is given.

[1]  Klaus R. Dittrich,et al.  User-Specific Semantic Integration of Heterogeneous Data: The SIRUP Approach , 2004, ICSNW.

[2]  Craig A. Knoblock,et al.  Retrieving and Integrating Data from Multiple Information Sources , 1993, Int. J. Cooperative Inf. Syst..

[3]  Klaus R. Dittrich,et al.  An Approach for Building Secure Database Federations , 1994, VLDB.

[4]  Chris Clifton,et al.  Privacy-preserving data integration and sharing , 2004, DMKD '04.

[5]  Jennifer Widom,et al.  The TSIMMIS Project: Integration of Heterogeneous Information Sources , 1994, IPSJ.

[6]  Marianne Winslett Databases in Virtual Organizations: a collective interview and call for researchers , 2005, SGMD.

[7]  Klaus R. Dittrich,et al.  Three decades of data integration - All problems solved? , 2004, IFIP Congress Topical Sessions.

[8]  Vipul Kashyap,et al.  Observer: an approach for query processing in global information systems based on interoperation across pre-existing ontologies , 1996, Proceedings First IFCIS International Conference on Cooperative Information Systems.

[9]  Klaus R. Dittrich,et al.  Unified Querying of Ontology Languages with the SIRUP Ontology Query API , 2005, BTW.

[10]  Terry A. Landers,et al.  An Overview of MULTIBASE , 1986, DDB.

[11]  Klaus R. Dittrich,et al.  Detecting Similarities in Ontologies with the SOQA-SimPack Toolkit , 2006, EDBT.

[12]  Ali R. Hurson,et al.  Multidatabase Systems: An Advanced Concept in Handling Distributed Data , 1991, Adv. Comput..

[13]  Arne Sølvberg,et al.  Conceptual Modeling in a World of Models , 1999, EMISA.

[14]  Stuart E. Madnick,et al.  Working Paper Alfred P. Sloan School of Management Database Systems in a Dynamic Environment Database Systems in a Dynamic Environment Received Context Interchange: Overcoming the Challenges of Large-scale Interoperable Database Systems in a Dynamic Environment* , 2022 .

[15]  Amit P. Sheth,et al.  Semantic interoperability in global information systems , 1999, SGMD.

[16]  Amit P. Sheth,et al.  Semantic Interoperability in Global Information Systems: A Brief Introduction to the Research Area a , 1999 .

[17]  Heiner Stuckenschmidt,et al.  Ontology-Based Integration of Information - A Survey of Existing Approaches , 2001, OIS@IJCAI.

[18]  Clement T. Yu,et al.  Report on the workshop on heterogenous database systems held at Northwestern University Evanston, Illinois, December 11-13, 1989 sponsored by NSF , 1990, SGMD.

[19]  Alon Y. Halevy Data Integration: A Status Report , 2003, BTW.

[20]  Philip A. Bernstein,et al.  A vision for management of complex models , 2000, SGMD.

[21]  Dan Suciu,et al.  Schema mediation in peer data management systems , 2003, Proceedings 19th International Conference on Data Engineering (Cat. No.03CH37405).

[22]  Vipul Kashyap,et al.  InfoSleuth: agent-based semantic integration of information in open and dynamic environments , 1997, SIGMOD '97.

[23]  Serge Abiteboul,et al.  Web services and data integration , 2002, Proceedings of the Third International Conference on Web Information Systems Engineering, 2002. WISE 2002..

[24]  Klaus R. Dittrich,et al.  All Together Now: Towards Integrating the World's Information Systems , 2000, JISBD.

[25]  David Maier,et al.  From databases to dataspaces: a new abstraction for information management , 2005, SGMD.

[26]  Brian R. Gaines,et al.  Comparing the Conceptual Systems of Experts , 1989, IJCAI.

[27]  Gio Wiederhold,et al.  Mediators in the architecture of future information systems , 1992, Computer.

[28]  Laura M. Haas,et al.  Towards heterogeneous multimedia information systems: the Garlic approach , 1995, Proceedings RIDE-DOM'95. Fifth International Workshop on Research Issues in Data Engineering-Distributed Object Management.

[29]  Gunter Saake,et al.  Schema Integration with Integrity Constraints , 1997, BNCOD.

[30]  Fèlix Saltor,et al.  Semantic heterogeneity in multidatabase systems , 1995 .

[31]  William Kent,et al.  Data and Reality , 1978 .

[32]  Michael Gertz,et al.  Report on the Dagstuhl Seminar , 2004, SGMD.

[33]  William Kent,et al.  Data and Reality: Basic Assumptions in Data Processing Reconsidered , 1978 .

[34]  Michael Stonebraker,et al.  The Asilomar report on database research , 1998, SGMD.

[35]  Ahmed K. Elmagarmid,et al.  Object-Oriented Multidatabase Systems: A Solution for Advanced Applications , 1995 .

[36]  Munindar P. Singh,et al.  Agents on the Web: Ontologies for Agents , 1997, IEEE Internet Comput..

[37]  Amit P. Sheth,et al.  On Automatic Reasoning for Schema Integration , 1993, Int. J. Cooperative Inf. Syst..

[38]  Serge Abiteboul,et al.  The Data Ring: Community Content Sharing , 2007, CIDR.

[39]  Arne Sølvberg,et al.  Data and What They Refer to , 1997, Conceptual Modeling.