Translational integrity and continuity: Personalized biomedical data integration

Translational research data are generated in multiple research domains from the bedside to experimental laboratories. These data are typically stored in heterogeneous databases, held by segregated research domains, and described with inconsistent terminologies. Such inconsistency and fragmentation of data significantly impedes the efficiency of tracking and analyzing human-centered records. To address this problem, we have developed a data repository and management system named TraM (http://tram.uchicago.edu), based on a domain ontology integrated entity relationship model. The TraM system has the flexibility to recruit dynamically evolving domain concepts and the ability to support data integration for a broad range of translational research. The web-based application interfaces of TraM allow curators to improve data quality and provide robust and user-friendly cross-domain query functions. In its current stage, TraM relies on a semi-automated mechanism to standardize and restructure source data for data integration and thus does not support real-time data application.

[1]  Muthukumarasamy Karthikeyan,et al.  Harvesting Chemical Information from the Internet Using a Distributed Approach: ChemXtreme , 2006, J. Chem. Inf. Model..

[2]  Olivier Bodenreider,et al.  Using WordNet to Improve the Mapping of Data Elements to UMLS for Data Sources Integration , 2006, AMIA.

[3]  Montserrat Robles,et al.  Non-invasive light-weight integration engine for building EHR from autonomous distributed systems , 2006, MIE.

[4]  Paul Nightingale,et al.  Putting pharmacogenetics into practice , 2006, Nature Biotechnology.

[5]  V Maojo,et al.  Using Web Services for Linking Genomic Data to Medical Information Systems , 2007, Methods of Information in Medicine.

[6]  D B Richardson,et al.  The impact on relative risk estimates of inconsistencies between ICD-9 and ICD-10 , 2006, Occupational and Environmental Medicine.

[7]  Thomas B Knudsen,et al.  MIAME guidelines. , 2005, Reproductive toxicology.

[8]  Walter V. Sujansky,et al.  Heterogeneous Database Integration in Biomedicine , 2001, J. Biomed. Informatics.

[9]  Alan F. Scott,et al.  Online Mendelian Inheritance in Man (OMIM), a knowledgebase of human genes and genetic disorders , 2002, Nucleic Acids Res..

[10]  Tatiana Tatusova,et al.  NCBI Reference Sequence (RefSeq): a curated non-redundant sequence database of genomes, transcripts and proteins , 2004, Nucleic Acids Res..

[11]  George P Chrousos,et al.  Genomic Medicine , 2002, Annals of the New York Academy of Sciences.

[12]  Moshé M. Zloof Query-by-example: the invocation and definition of tables and forms , 1975, VLDB '75.

[13]  Ibis Sánchez-Serrano,et al.  Success in translational research: lessons from the development of bortezomib , 2006, Nature Reviews Drug Discovery.

[14]  H. Pass,et al.  Translation research: from accurate diagnosis to appropriate treatment , 2004, Journal of Translational Medicine.

[15]  中尾 光輝,et al.  KEGG(Kyoto Encyclopedia of Genes and Genomes)〔和文〕 (特集 ゲノム医学の現在と未来--基礎と臨床) -- (データベース) , 2000 .

[16]  Mark Ellisman,et al.  e-Neuroscience: challenges and triumphs in integrating distributed data from molecules to brains , 2004, Nature Neuroscience.

[17]  M. Relling,et al.  Moving towards individualized medicine with pharmacogenomics , 2004, Nature.

[18]  Alon Y. Halevy,et al.  Data integration and genomic medicine , 2007, J. Biomed. Informatics.

[19]  Kei-Hoi Cheung,et al.  Advancing translational research with the Semantic Web , 2007, BMC Bioinformatics.

[20]  Gilberto Fragoso,et al.  caCORE version 3: Implementation of a model driven, service-oriented architecture for semantic interoperability , 2008, J. Biomed. Informatics.

[21]  M. Khoury,et al.  Population screening in the age of genomic medicine. , 2003, The New England journal of medicine.

[22]  Alexander C. Yu,et al.  Methods in biomedical ontology , 2006, J. Biomed. Informatics.

[23]  José Alberto Maldonado,et al.  Non-invasive lightweight integration engine for building EHR from autonomous distributed systems , 2007, Int. J. Medical Informatics.

[24]  Andy Gaughan Bridging the divide: the need for translational informatics. , 2006, Pharmacogenomics.

[25]  Fran Turisco,et al.  Current Status of Integrating Information Technologies into the Clinical Research Enterprise within US Academic Health Centers , 2005, Journal of Investigative Medicine.

[26]  Peter B. McGarvey,et al.  UniRef: comprehensive and non-redundant UniProt reference clusters , 2007, Bioinform..

[27]  F. Marincola,et al.  Obstacles and opportunities in translational research , 2005, Nature Medicine.

[28]  Olga Brazhnik,et al.  Anatomy of data integration , 2007, J. Biomed. Informatics.

[29]  Olivier Bodenreider,et al.  Investigating subsumption in SNOMED CT: An exploration into large description logic-based biomedical terminologies , 2007, Artif. Intell. Medicine.

[30]  Sherri de Coronado,et al.  NCI Thesaurus: A semantic model integrating cancer-related clinical and molecular information , 2007, J. Biomed. Informatics.

[31]  Robert Hoehndorf,et al.  A top-level ontology of functions and its application in the Open Biomedical Ontologies , 2006, ISMB.

[32]  Francis S. Collins,et al.  Genomic medicine--a primer. , 2002, The New England journal of medicine.

[33]  Duncan Geddes Translational research--from gene to treatment: lessons from cystic fibrosis. , 2005, Clinical medicine.

[34]  G. Lonsdale,et al.  A Service-Oriented Grid Infrastructure for Biomedical Data and Compute Services , 2007, IEEE Transactions on NanoBioscience.

[35]  H Billhardt,et al.  An agent- and ontology-based system for integrating public gene, protein, and disease databases , 2007, J. Biomed. Informatics.

[36]  Ian Collins,et al.  New approaches to molecular cancer therapeutics , 2006, Nature chemical biology.

[37]  Olivier Bodenreider,et al.  Circular hierarchical relationships in the UMLS: etiology, diagnosis, treatment, complications and prevention , 2001, AMIA.

[38]  R. Cardiff,et al.  Advancing translational research. , 2011, Science.

[39]  Stephen B. Johnson,et al.  Breaking the Translational Barriers: The Value of Integrating Biomedical Informatics and Translational Research , 2005, Journal of Investigative Medicine.

[40]  K. Jain,et al.  Challenges of drug discovery for personalized medicine. , 2006, Current opinion in molecular therapeutics.

[41]  Bradley Malin,et al.  How (not) to protect genomic data privacy in a distributed network: using trail re-identification to evaluate and design anonymity protection systems , 2004, J. Biomed. Informatics.

[42]  Elizabeth M. Smigielski,et al.  dbSNP: the NCBI database of genetic variation , 2001, Nucleic Acids Res..

[43]  T. Tatusova,et al.  Entrez Gene: gene-centered information at NCBI , 2010, Nucleic Acids Res..

[44]  Peter P. Chen The entity-relationship model: toward a unified view of data , 1975, VLDB '75.

[45]  Olivier Bodenreider,et al.  The Unified Medical Language System (UMLS): integrating biomedical terminology , 2004, Nucleic Acids Res..

[46]  William T. Hole,et al.  Research Paper: Integrating SNOMED CT into the UMLS: An Exploration of Different Views of Synonymy and Quality of Editing , 2005, J. Am. Medical Informatics Assoc..

[47]  Ian Sabroe,et al.  Identifying and hurdling obstacles to translational research , 2007, Nature Reviews Immunology.