Development of a Large‐Scale De‐Identified DNA Biobank to Enable Personalized Medicine

Our objective was to develop a DNA biobank linked to phenotypic data derived from an electronic medical record (EMR) system. An “opt‐out” model was implemented after significant review and revision. The plan included (i) development and maintenance of a de‐identified mirror image of the EMR, namely, the “synthetic derivative” (SD) and (ii) DNA extracted from discarded blood samples and linked to the SD. Surveys of patients indicated general acceptance of the concept, with only a minority (∼5%) opposing it. As a result, mechanisms to facilitate opt‐out included publicity and revision of a standard “consent to treatment” form. Algorithms for sample handling and procedures for de‐identification were developed and validated in order to ensure acceptable error rates (<0.3 and <0.1%, respectively). The rate of sample accrual is 700–900 samples/week. The advantages of this approach are the rate of sample acquisition and the diversity of phenotypes based on EMRs.

[1]  R. Baggott DISEASE , 1947, Social Policy & Administration.

[2]  E. Silk,et al.  Prolonged apnoea following injection of succinyldicholine. , 1953, Lancet.

[3]  A. Motulsky Drug reactions enzymes, and biochemical genetics. , 1957, Journal of the American Medical Association.

[4]  V. McKusick,et al.  Genetic Control of Isoniazid Metabolism in Man , 1960, British medical journal.

[5]  E. Vesell,et al.  Genetic control of dicumarol levels in man. , 1968, The Journal of clinical investigation.

[6]  Electronic Mail Addresses , 1988 .

[7]  Roberto Giugliani,et al.  Inborn Errors of Metabolism , 1989 .

[8]  Randolph A. Miller,et al.  Research Paper: Evaluation of Long-term Maintenance of a Large Medical Knowledge Base , 1995, J. Am. Medical Informatics Assoc..

[9]  J. Gulcher,et al.  Population Genomics: Laying the Groundwork for Genetic Disease Modeling and Targeting , 1998, Clinical chemistry and laboratory medicine.

[10]  N. Peet,et al.  Pharmacogenomics: challenges and opportunities. , 2001, Drug discovery today.

[11]  Christopher G. Chute,et al.  The horizontal and vertical nature of patient phenotype retrieval: new directions for clinical text processing , 2002, AMIA.

[12]  E. Clayton Ethical, legal, and social implications of genomic medicine. , 2003, The New England journal of medicine.

[13]  Kevin B. Johnson,et al.  The Impact of Peer Management on Test-Ordering Behavior , 2004, Annals of Internal Medicine.

[14]  U Sax,et al.  Integration of Genomic Data in Electronic Health Records , 2005, Methods of Information in Medicine.

[15]  Gene Feder,et al.  Recruiting patients to medical research: double blind randomised trial of “opt-in” versus “opt-out” strategies , 2005, BMJ : British Medical Journal.

[16]  Bradley Malin,et al.  Technical Evaluation: An Evaluation of the Current State of Genomic Data Privacy Protection Technology and a Roadmap for the Future , 2004, J. Am. Medical Informatics Assoc..

[17]  S. Trent Rosenbloom,et al.  A Framework for Clinical Communication Supporting Healthcare Delivery , 2005, AMIA.

[18]  Randolph A. Miller,et al.  The anatomy of decision support during inpatient care provider order entry (CPOE): Empirical observations from a decade of CPOE experience at Vanderbilt , 2005, J. Biomed. Informatics.

[19]  J. Gilbert,et al.  Complement Factor H Variant Increases the Risk of Age-Related Macular Degeneration , 2005, Science.

[20]  Tim Sprosen,et al.  UK Biobank: from concept to reality. , 2005, Pharmacogenomics.

[21]  Lemuel R Waitman,et al.  Improved compliance with quality measures at hospital discharge with a computerized physician order entry system. , 2006, American heart journal.

[22]  Andy Haines,et al.  Overcoming barriers to recruitment in health research , 2006, BMJ : British Medical Journal.

[23]  Russ Altman,et al.  Pharmacogenomics: Challenges and Opportunities , 2006, Annals of Internal Medicine.

[24]  Peter J. Haug,et al.  Natural language processing to extract medical problems from electronic clinical documents: Performance evaluation , 2006, J. Biomed. Informatics.

[25]  C. Compton Getting to personalized cancer medicine , 2007, Cancer.

[26]  T. Hudson,et al.  A genome-wide association study identifies novel risk loci for type 2 diabetes , 2007, Nature.

[27]  Liam Glynn,et al.  Selection bias resulting from the requirement for prior consent in observational research: a community cohort of people with ischaemic heart disease , 2007, Heart.

[28]  Robert L Davis,et al.  Real-Time Vaccine Safety Surveillance for the Early Detection of Adverse Events , 2007, Medical care.

[29]  Francis S Collins,et al.  Ethics. Identifiability in genomic research. , 2007, Science.

[30]  Richard L Berg,et al.  Use of an Electronic Medical Record for the Identification of Research Subjects with Diabetes Mellitus , 2007, Clinical Medicine & Research.

[31]  R. Krauss,et al.  When good drugs go bad , 2007, Nature.

[32]  M. McCarthy,et al.  Replication of Genome-Wide Association Signals in UK Samples Reveals Risk Loci for Type 2 Diabetes , 2007, Science.

[33]  Angelo Nuzzo,et al.  A Dynamic Query System for Supporting Phenotype Mining in Genetic Studies , 2007, MedInfo.

[34]  B. Psaty,et al.  Personalized medicine in the era of genomics. , 2007, JAMA.

[35]  Jonathan C. Cohen,et al.  A Common Allele on Chromosome 9 Associated with Coronary Heart Disease , 2007, Science.

[36]  R B Altman,et al.  The Pharmacogenetics Research Network: From SNP Discovery to Clinical Drug Response , 2007, Clinical pharmacology and therapeutics.

[37]  Simon C. Potter,et al.  Association scan of 14,500 nonsynonymous SNPs in four diseases identifies autoimmunity variants , 2007, Nature Genetics.

[38]  Tracy A. Lieu,et al.  Application of Information Technology: Using Electronic Medical Records to Enhance Detection and Reporting of Vaccine Adverse Events , 2007, J. Am. Medical Informatics Assoc..

[39]  Randolph A. Miller,et al.  Research Paper: Computer-based Insulin Infusion Protocol Improves Glycemia Control over Manual Protocol , 2007, J. Am. Medical Informatics Assoc..

[40]  Simon C. Potter,et al.  Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls , 2007, Nature.

[41]  Jill M. Pulley,et al.  Attitudes and perceptions of patients towards methods of establishing a DNA biobank , 2008, Cell and Tissue Banking.

[42]  M. A. Hoffman,et al.  The genome-enabled electronic medical record , 2007, J. Biomed. Informatics.

[43]  Nicole Huang,et al.  Record linkage research and informed consent: who consents? , 2007, BMC Health Services Research.

[44]  G. Bernard,et al.  Evaluation of the effectiveness of posters to provide information to patients about a DNA database and their opportunity to opt out , 2007, Cell and Tissue Banking.

[45]  Francis S. Collins,et al.  Identifiability in Genomic Research , 2007, Science.