Heterogeneous Database Integration in Biomedicine

The rapid expansion of biomedical knowledge, reduction in computing costs, and spread of internet access have created an ocean of electronic data. The decentralized nature of our scientific community and healthcare system, however, has resulted in a patchwork of diverse, or heterogeneous, database implementations, making access to and aggregation of data across databases very difficult. The database heterogeneity problem applies equally to clinical data describing individual patients and biological data characterizing our genome. Specifically, databases are highly heterogeneous with respect to the data models they employ, the data schemas they specify, the query languages they support, and the terminologies they recognize. Heterogeneous database systems attempt to unify disparate databases by providing uniform conceptual schemas that resolve representational heterogeneities, and by providing querying capabilities that aggregate and integrate distributed data. Research in this area has applied a variety of database and knowledge-based techniques, including semantic data modeling, ontology definition, query translation, query optimization, and terminology mapping. Existing systems have addressed heterogeneous database integration in the realms of molecular biology, hospital information systems, and application portability.

[1]  Thomas R. Gruber,et al.  Toward principles for the design of ontologies used for knowledge sharing? , 1995, Int. J. Hum. Comput. Stud..

[2]  Chun Zhang,et al.  Storing and querying ordered XML using a relational database system , 2002, SIGMOD '02.

[3]  Peter Buneman,et al.  Challenges in Integrating Biological Data Sources , 1995, J. Comput. Biol..

[4]  Russ B. Altman,et al.  A Formal model for bridging heterogeneous relational databases in clinical medicine , 1996 .

[5]  M G Kahn,et al.  Unifying heterogeneous distributed clinical data in a relational database. , 1993, Proceedings. Symposium on Computer Applications in Medical Care.

[6]  Martin Gogolla,et al.  Conceptual modelling of database applications using extended ER model , 1992, Data Knowl. Eng..

[7]  C Y Young,et al.  Heterogenous database integration in a physician workstation. , 1991, Proceedings. Symposium on Computer Applications in Medical Care.

[8]  Otto Ritter,et al.  Characterizing Heterogeneous Molecular Biology Database Systems , 1995, J. Comput. Biol..

[9]  Acknowledgements , 2018, Acknowledgements.

[10]  E. F. Codd,et al.  Extending the database relational model to capture more meaning , 1979, ACM Trans. Database Syst..

[11]  Peter P. Chen The entity-relationship model: toward a unified view of data , 1975, VLDB '75.

[12]  John F. Sowa,et al.  Conceptual Structures: Information Processing in Mind and Machine , 1983 .

[13]  Eugene Wong,et al.  Multibase: integrating heterogeneous distributed database systems , 1981, AFIPS '81.

[14]  Joan Peckham,et al.  Semantic data models , 1988, CSUR.

[15]  John Mylopoulos,et al.  A language facility for designing database-intensive applications , 1980, TODS.

[16]  David W. Shipman,et al.  The functional data model and the data languages DAPLEX , 1981, TODS.

[17]  O Ritter,et al.  Prototype implementation of the integrated genomic database. , 1994, Computers and biomedical research, an international journal.

[18]  Steffen Schulze-Kremer,et al.  Ontologies for Molecular Biology , 2001, Electron. Trans. Artif. Intell..

[19]  Craig A. Knoblock,et al.  Retrieving and Integrating Data from Multiple Information Sources , 1993, Int. J. Cooperative Inf. Syst..

[20]  Peter D. Karp,et al.  A Strategy for Database Interoperation , 1995, J. Comput. Biol..

[21]  C Safran,et al.  Sharing electronic medical records across multiple heterogeneous and competing institutions. , 1996, Proceedings : a conference of the American Medical Informatics Association. AMIA Fall Symposium.

[22]  Aziz A. Boxwala,et al.  A virtual repository approach to clinical and utilization studies: application in mammography as alternative to a national database , 1997, AMIA.

[23]  Kevin Wilkinson,et al.  The Iris Architecture and Implementation , 1990, IEEE Trans. Knowl. Data Eng..

[24]  W Sujansky,et al.  Towards a standard query model for sharing decision-support applications. , 1994, Proceedings. Symposium on Computer Applications in Medical Care.

[25]  Stephen Fox,et al.  Heterogeneous distributed database systems for production use , 1990, ACM Comput. Surv..

[26]  Mark R. Cutkosky,et al.  Knowledge Sharing Technology Project Overview , 1991 .

[27]  C Safran,et al.  Using HL7 and the World Wide Web for unifying patient data from remote databases. , 1996, Proceedings : a conference of the American Medical Informatics Association. AMIA Fall Symposium.

[28]  Stefano Ceri,et al.  Distributed Databases: Principles and Systems , 1984 .

[29]  W. Shipman David,et al.  The functional data model and the data language DAPLEX , 1988 .

[30]  Ramez Elmasri,et al.  Data model integration using the structural model , 1979, SIGMOD '79.

[31]  Wesley J. Rishel Pragmatic Considerations in the Design of the HL7 Protocol. , 1989 .

[32]  Limsoon Wong,et al.  Kleisli, a functional query system , 2000, J. Funct. Program..

[33]  Yuri Breitbart,et al.  ADDS - Heterogeneous Distributed Database System , 1984, DDSS.

[34]  T A Pryor,et al.  Sharing MLM's: an experiment between Columbia-Presbyterian and LDS Hospital. , 1993, Proceedings. Symposium on Computer Applications in Medical Care.

[35]  Marius Fieschi,et al.  Towards interoperability of information sources within a hospital Intranet , 1998, AMIA.

[36]  W Sujansky,et al.  An evaluation of the TransFER model for sharing clinical decision-support applications. , 1996, Proceedings : a conference of the American Medical Informatics Association. AMIA Fall Symposium.

[37]  Edward H. Shortliffe,et al.  A rule-based computer program for advising physicians regarding antimicrobial therapy selection , 1974, ACM '74.

[38]  Roger King,et al.  Semantic database modeling: survey, applications, and research issues , 1987, CSUR.

[39]  Latanya Sweeney,et al.  Guaranteeing anonymity when sharing medical data, the Datafly System , 1997, AMIA.