GenePING: secure, scalable management of personal genomic data

BackgroundPatient genomic data are rapidly becoming part of clinical decision making. Within a few years, full genome expression profiling and genotyping will be affordable enough to perform on every individual. The management of such sizeable, yet fine-grained, data in compliance with privacy laws and best practices presents significant security and scalability challenges.ResultsWe present the design and implementation of GenePING, an extension to the PING personal health record system that supports secure storage of large, genome-sized datasets, as well as efficient sharing and retrieval of individual datapoints (e.g. SNPs, rare mutations, gene expression levels). Even with full access to the raw GenePING storage, an attacker cannot discover any stored genomic datapoint on any single patient. Given a large-enough number of patient records, an attacker cannot discover which data corresponds to which patient, or even the size of a given patient's record. The computational overhead of GenePING's security features is a small constant, making the system usable, even in emergency care, on today's hardware.ConclusionGenePING is the first personal health record management system to support the efficient and secure storage and sharing of large genomic datasets. GenePING is available online at http://ping.chip.org/genepinghtml, licensed under the LGPL.

[1]  Yudong D. He,et al.  Gene expression profiling predicts clinical outcome of breast cancer , 2002, Nature.

[2]  Russ B Altman,et al.  Genetics. Genomic research and human subject privacy. , 2004, Science.

[3]  Michael M. Kaback,et al.  Population-based genetic screening for reproductive counseling: the Tay-Sachs disease model , 2000, European Journal of Pediatrics.

[4]  M. Olivier A haplotype map of the human genome. , 2003, Nature.

[5]  G. E. Thyer,et al.  Modes of operation , 1991 .

[6]  Abhi Shelat,et al.  Remembrance of Data Passed: A Study of Disk Sanitization Practices , 2003, IEEE Secur. Priv..

[7]  J. Shendure,et al.  Advanced sequencing technologies: methods and goals , 2004, Nature Reviews Genetics.

[8]  Peter Szolovits,et al.  The Personal Internetworked Notary and Guardian , 2001, Int. J. Medical Informatics.

[9]  J. Gerberding,et al.  Genetic testing for breast and ovarian cancer susceptibility: evaluating direct-to-consumer marketing--Atlanta, Denver, Raleigh-Durham, and Seattle, 2003. , 2004, MMWR. Morbidity and mortality weekly report.

[10]  Paola Sebastiani,et al.  Genetic dissection and prognostic modeling of overt stroke in sickle cell anemia , 2005, Nature Genetics.

[11]  Lidewij Henneman,et al.  Public Experiences, Knowledge and Expectations about Medical Genetics and the Use of Genetic Information , 2004, Public Health Genomics.

[12]  T. Poggio,et al.  Prediction of central nervous system embryonal tumour outcome based on gene expression , 2002, Nature.

[13]  Zhen Lin,et al.  Genomic Research and Human Subject Privacy , 2004, Science.

[14]  Hugo Krawczyk,et al.  Keying Hash Functions for Message Authentication , 1996, CRYPTO.

[15]  Russ B Altman,et al.  Health-information altruists--a potentially critical resource. , 2005, The New England journal of medicine.

[16]  Ash A. Alizadeh,et al.  Distinct types of diffuse large B-cell lymphoma identified by gene expression profiling , 2000, Nature.

[17]  Arthur L. Beaudet,et al.  Genetic testing for cystic fibrosis. , 1992, NIH consensus statement.

[18]  M. Olivier A haplotype map of the human genome , 2003, Nature.

[19]  Abhi Shelat,et al.  IEEE Security & Privacy: Data Forensics - Rememberance of Data Passed: A Study of Disk Sanitization Practices , 2003, IEEE Distributed Syst. Online.

[20]  Kenneth D. Mandl,et al.  Model Formulation: The PING Personally Controlled Electronic Medical Record System: Technical Architecture , 2004, J. Am. Medical Informatics Assoc..

[21]  T Hyslop,et al.  Use of cancer susceptibility testing among primary care physicians , 2003, Clinical genetics.

[22]  Kristine K Barlow-Stewart,et al.  Working in partnership with support services in the era of the “new genetics” , 2003, The Medical journal of Australia.

[23]  Jorge Cortes,et al.  Natural history and staging of chronic myelogenous leukemia. , 2004, Hematology/oncology clinics of North America.