Integration of Genomic Data in Electronic Health Records

OBJECTIVES In this paper we give an overview about the challenge the postgenomic era poses on biomedical informaticists. The occurrence of new (genomic) data types necessitates new data models, new viewing metaphors and methods to deal with the disclosure of genomic data. We discuss integration issues when inferring phenotype and genotype data. Another challenge is to find the right phenotype to genotype data in order to get appropriate case numbers for sound clinical genotype-phenotype inference studies. METHODS Genomic data could be integrated in an Electronic Health Record (EHR) in several ways. We describe patient-centered and pointer-based integration strategies and the corresponding data types and data models. The inference mechanisms for the interpretation of row data contain different agents. We describe vertical, horizontal and temporal agents. RESULTS We have to deal with several new data types, not being standardized for EHR integration. Genomic data tends to be more structured than phenotype data. Beyond the development of new data models, vertical, horizontal and temporal agents have to be developed in order to link genotype and phenotype. As the genomic EHR will contain very sensitive data, confidentiality and privacy concerns have to be addressed. CONCLUSIONS Given the necessity to capture both environment and genomic state of a patient and their interaction, clinical information systems have to be redesigned. While genotyping seems to be automatable easily, this is not the case for clinical information. More integration work on terminologies and ontologies has to be done.

[1]  Isaac S. Kohane,et al.  Bioinformatics and Clinical Informatics: The Imperative to Collaborate , 2000, J. Am. Medical Informatics Assoc..

[2]  Joachim Dudeck,et al.  Discharge and referral data exchange using global standards - the SCIPHOX project in Germany , 2003, Int. J. Medical Informatics.

[3]  K. Becker,et al.  The Genetic Association Database , 2004, Nature Genetics.

[4]  Ele Holloway From Genotype to Phenotype: Linking Bioinformatics and Medical Informatics Ontologies , 2002, Comparative and functional genomics.

[5]  Maria Jesus Martin,et al.  The SWISS-PROT protein knowledgebase and its supplement TrEMBL in 2003 , 2003, Nucleic Acids Res..

[6]  Werner Ceusters,et al.  Ontology-Assisted Database Integration to Support Natural Language Processing and Biomedical Data-mining , 2004, J. Integr. Bioinform..

[7]  Jaime Prilusky,et al.  GeneCards: a novel functional genomics compendium with automated data mining and query reformulation support , 1998, Bioinform..

[8]  Peter Szolovits,et al.  The Personal Internetworked Notary and Guardian , 2001, Int. J. Medical Informatics.

[9]  John E. Mattison,et al.  Review: The HL7 Clinical Document Architecture , 2001, J. Am. Medical Informatics Assoc..

[10]  Richard A. Baldock,et al.  Bioinformatics integration and agent technology , 2004, J. Biomed. Informatics.

[11]  W. Giere Electronic Patient Information -- Pioneers and MuchMore. A vision, lessons learned, and challenges. , 2004, Methods of information in medicine.

[12]  Randall A. Bolanos,et al.  Whole-genome shotgun assembly and comparison of human genome assemblies , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[13]  Gregory D. Schuler,et al.  Database resources of the National Center for Biotechnology Information: update , 2004, Nucleic acids research.

[14]  Olivier Bodenreider,et al.  The Unified Medical Language System (UMLS): integrating biomedical terminology , 2004, Nucleic Acids Res..

[15]  Jason E. Stewart,et al.  Design and implementation of microarray gene expression markup language (MAGE-ML) , 2002, Genome Biology.

[16]  Bradley Malin,et al.  Technical Evaluation: An Evaluation of the Current State of Genomic Data Privacy Protection Technology and a Roadmap for the Future , 2004, J. Am. Medical Informatics Assoc..

[17]  Bradley Malin,et al.  How (not) to protect genomic data privacy in a distributed network: using trail re-identification to evaluate and design anonymity protection systems , 2004, J. Biomed. Informatics.

[18]  Elizabeth M. Smigielski,et al.  dbSNP: the NCBI database of genetic variation , 2001, Nucleic Acids Res..

[19]  James H Ford,et al.  Information requirements of genomics researchers from the patient clinical record. , 2002, Journal of healthcare information management : JHIM.

[20]  Zhen Lin,et al.  Genomic Research and Human Subject Privacy , 2004, Science.

[21]  Timothy B. Stockwell,et al.  The Sequence of the Human Genome , 2001, Science.

[22]  Steve Buckingham Bioinformatics: Data's future shock , 2004, Nature.

[23]  C. McDonald,et al.  LOINC, a universal standard for identifying laboratory observations: a 5-year update. , 2003, Clinical chemistry.

[24]  Jocelyn Kaiser,et al.  Population Databases Boom, From Iceland to the U.S. , 2002, Science.

[25]  Brian Bray,et al.  The PICNIC approach to regional care networks. , 2003, Studies in health technology and informatics.

[26]  R. Klar Selected Impressions on the Beginning of the Electronic Medical Record and Patient Information , 2004, Methods of Information in Medicine.

[27]  J. Skolnick,et al.  The PDB is a covering set of small protein structures. , 2003, Journal of molecular biology.

[28]  Tony Delamothe,et al.  What next for electronic communication and health care? , 2004, BMJ : British Medical Journal.