Methods for automated concept mapping between medical databases

The retrieval and exchange of information between medical databases is often impeded by the semantic heterogeneity of concepts contained within the databases. Manual identification of equivalent database elements consumes time and resources, and may often be the rate-limiting technological step in integrating disparate data sources. By employing semantic networks as an intermediary representation of the native databases, automated mapping algorithms can identify equivalent concepts in disparate databases. The algorithms take advantage of the conceptual "context" embodied within a semantic network to produce candidate concept mappings. The performance of automated concept mapping was evaluated by creating semantic network representations for two test laboratory databases. The mapping algorithms identified all equivalent concepts that were present in the databases, and did not leave any equivalent concepts unmapped. The utilization of conceptual context to perform automated concept mapping facilitates the identification of equivalent database concepts and may help decrease the work and costs associated with retrieval and integration of information from disparate databases.

[1]  Qiming Chen,et al.  CoBase: a cooperative database system , 1994 .

[2]  Luigi Palopoli,et al.  A unified graph-based framework for deriving nominal interscheme properties, type conflicts and object cluster similarities , 1999, Proceedings Fourth IFCIS International Conference on Cooperative Information Systems. CoopIS 99 (Cat. No.PR00384).

[3]  et al.,et al.  The Regenstrief Medical Record System: a quarter century experience , 1999, Int. J. Medical Informatics.

[4]  Wesley W. Chu,et al.  The design and implementation of CoBase , 1993, SIGMOD '93.

[5]  Philip A. Bernstein,et al.  Merging Models Based on Given Correspondences , 2003, VLDB.

[6]  Clement J. McDonald,et al.  Development of the Logical Observation Identifier Names and Codes (LOINC) vocabulary. , 1998, Journal of the American Medical Informatics Association : JAMIA.

[7]  Luciano Serafini,et al.  Semantic Coordination: A New Approach and an Application , 2003, SEMWEB.

[8]  M S Tuttle,et al.  From meaning to term: semantic locality in the UMLS Metathesaurus. , 1991, Proceedings. Symposium on Computer Applications in Medical Care.

[9]  K M Kudla,et al.  SNOMED: a controlled vocabulary for computer-based patient records. , 1998, Journal of AHIMA.

[10]  G O Barnett,et al.  Automated translation between medical terminologies using semantic definitions. , 1990, M.D. computing : computers in medical practice.

[11]  Beeler Gw,et al.  Taking HL7 to the next level. , 1999 .

[12]  Betsy L. Humphreys,et al.  Technical Milestone: The Unified Medical Language System: An Informatics Research Collaboration , 1998, J. Am. Medical Informatics Assoc..

[13]  Stanley M. Huff,et al.  Research Paper: Evaluation of a "Lexically Assign, Logically Refine" Strategy for Semi-automated Integration of Overlapping Terminologies , 1998, J. Am. Medical Informatics Assoc..

[14]  Kent A. Spackman,et al.  The SNOMED clinical terms development process: refinement and analysis of content , 2002, AMIA.

[15]  C Marietti,et al.  A modern LOINC (Logical Observation Identifier Names and Codes). , 1998, Healthcare informatics : the business magazine for information and communication systems.

[16]  A. A. Knecht EVALUATION OF A , 1972 .

[17]  J Quinn An HL7 (Health Level Seven) overview. , 1999, Journal of AHIMA.

[18]  Mark A. Musen,et al.  Promptdiff: a fixed-point algorithm for comparing ontology versions , 2002, AAAI/IAAI.

[19]  C. McDonald,et al.  Logical observation identifier names and codes (LOINC) database: a public use set of codes and names for electronic reporting of clinical laboratory test results. , 1996, Clinical chemistry.

[20]  Tom Rees Utah healthcare system watches over Olympians and spectators. , 2002, Profiles in healthcare marketing.

[21]  Erhard Rahm,et al.  Generic Schema Matching with Cupid , 2001, VLDB.

[22]  Erhard Rahm,et al.  A survey of approaches to automatic schema matching , 2001, The VLDB Journal.

[23]  H. Warner,et al.  An interlingua for electronic interchange of medical information: using frames to map between clinical vocabularies. , 1991, Computers and biomedical research, an international journal.

[24]  R J Johnson,et al.  The Development of a Computerized Health Information System to Facilitate Program Planning/Evaluation and Enhanced First Nations Control of Community Health Services , 1997, Canadian journal of public health = Revue canadienne de sante publique.

[25]  Ingo J. Timm,et al.  Terminology Integration for the Management of distributed Information Resources , 2002, Künstliche Intell..

[26]  M G Kahn Three perspectives on integrated clinical databases , 1997, Academic medicine : journal of the Association of American Medical Colleges.

[27]  Martin L. Kersten,et al.  A Graph-Oriented Model for Articulation of Ontology Interdependencies , 1999, EDBT.

[28]  Pedro M. Domingos,et al.  Reconciling schemas of disparate data sources: a machine-learning approach , 2001, SIGMOD '01.

[29]  C. Lindberg The Unified Medical Language System (UMLS) of the National Library of Medicine. , 1990, Journal.

[30]  J. Marc Overhage,et al.  The regenstrief medical record system 2000: Expanding the breadth and depth of a community wide EMR , 2000, AMIA.

[31]  Silvana Castano,et al.  Global Viewing of Heterogeneous Data Sources , 2001, IEEE Trans. Knowl. Data Eng..

[32]  Kate Johnson,et al.  A method for the automated mapping of laboratory results to LOINC , 2000, AMIA.

[33]  S. Huff,et al.  Building a Comprehensive Clinical Information System from Components , 2003, Methods of Information in Medicine.

[34]  A. Rector Thesauri and Formal Classifications: Terminologies for People and Machines , 1998, Methods of Information in Medicine.

[35]  Hua Yang,et al.  CoBase: A scalable and extensible cooperative information system , 1996, Journal of Intelligent Information Systems.

[36]  C. McDonald,et al.  LOINC, a universal standard for identifying laboratory observations: a 5-year update. , 2003, Clinical chemistry.

[37]  Tova Milo,et al.  Using Schema Matching to Simplify Heterogeneous Data Translation , 1998, VLDB.

[38]  Stanley M. Huff,et al.  Research Paper: Automated Mapping of Observation Codes Using Extensional Definitions , 2000, J. Am. Medical Informatics Assoc..

[39]  Pawlick Gf,et al.  Implementing complex networking laboratory information systems in a health maintenance organization. , 1996 .

[40]  J. Marc Overhage,et al.  The Regenstrief Medical Record System 1999: Sharing Data Between Hospitals , 1999, AMIA.

[41]  Olivier Bodenreider,et al.  The Unified Medical Language System (UMLS): integrating biomedical terminology , 2004, Nucleic Acids Res..

[42]  C J McDonald,et al.  Design and implementation of the Indianapolis Network for Patient Care and Research. , 1995, Bulletin of the Medical Library Association.