Privacy-preserving techniques of genomic data - a survey

Genomic data hold salient information about the characteristics of a living organism. Throughout the past decade, pinnacle developments have given us more accurate and inexpensive methods to retrieve genome sequences of humans. However, with the advancement of genomic research, there is a growing privacy concern regarding the collection, storage and analysis of such sensitive human data. Recent results show that given some background information, it is possible for an adversary to reidentify an individual from a specific genomic data set. This can reveal the current association or future susceptibility of some diseases for that individual (and sometimes the kinship between individuals) resulting in a privacy violation. Regardless of these risks, our genomic data hold much importance in analyzing the well-being of us and the future generation. Thus, in this article, we discuss the different privacy and security-related problems revolving around human genomic data. In addition, we will explore some of the cardinal cryptographic concepts, which can bring efficacy in secure and private genomic data computation. This article will relate the gaps between these two research areas-Cryptography and Genomics.

[1]  Stephen E. Fienberg,et al.  Privacy-Preserving Data Sharing for Genome-Wide Association Studies , 2012, J. Priv. Confidentiality.

[2]  Dan Bogdanov,et al.  A new way to protect privacy in large-scale genome-wide association studies , 2013, Bioinform..

[3]  Michael Naehrig,et al.  Private Predictive Analysis on Encrypted Medical Data , 2014, IACR Cryptol. ePrint Arch..

[4]  Barry Barnes,et al.  Genomes and What to Make of Them , 2008 .

[5]  Yaniv Erlich,et al.  Routes for breaching and protecting genetic privacy , 2013, Nature Reviews Genetics.

[6]  S. Fullerton,et al.  Genomic research and wide data sharing: Views of prospective participants , 2010, Genetics in Medicine.

[7]  Jung Hee Cheon,et al.  Secure searching of biomarkers through hybrid homomorphic encryption scheme , 2017, BMC Medical Genomics.

[8]  Murat Kantarcioglu,et al.  A Cryptographic Approach to Securely Share and Query Genomic Sequences , 2008, IEEE Transactions on Information Technology in Biomedicine.

[9]  S. Gottlieb,et al.  US employer agrees to stop genetic testing , 2001, BMJ : British Medical Journal.

[10]  Massoud Hadian Dehkordi,et al.  Private and Efficient Query Processing on Outsourced Genomic Databases , 2017, IEEE Journal of Biomedical and Health Informatics.

[11]  Zhen Lin,et al.  Genomic Research and Human Subject Privacy , 2004, Science.

[12]  M. Gerstein,et al.  Quantification of private information leakage from phenotype-genotype data: linking attacks , 2016, Nature Methods.

[13]  Dima Alhadidi,et al.  Secure approximation of edit distance on genomic data , 2017, BMC Medical Genomics.

[14]  Eran Halperin,et al.  Identifying Personal Genomes by Surname Inference , 2013, Science.

[15]  S. Nelson,et al.  Resolving Individuals Contributing Trace Amounts of DNA to Highly Complex Mixtures Using High-Density SNP Genotyping Microarrays , 2008, PLoS genetics.

[16]  Marina Blanton,et al.  Efficient Server-Aided Secure Two-Party Function Evaluation with Applications to Genomic Computation , 2016, IACR Cryptol. ePrint Arch..

[17]  Bonnie Berger,et al.  Realizing privacy preserving genome-wide association studies , 2016, Bioinform..

[18]  Carl A. Gunter,et al.  Privacy in the Genomic Era , 2014, ACM Comput. Surv..

[19]  C. Bustamante,et al.  Privacy Risks from Genomic Data-Sharing Beacons , 2015, American journal of human genetics.

[20]  Xiaoqian Jiang,et al.  FORESEE: Fully Outsourced secuRe gEnome Study basEd on homomorphic Encryption , 2015, BMC Medical Informatics and Decision Making.

[21]  Mete Akgün,et al.  Privacy preserving processing of genomic data: A survey , 2015, J. Biomed. Informatics.

[22]  Yuchen Zhang,et al.  HEALER: homomorphic computation of ExAct Logistic rEgRession for secure rare disease variants analysis in GWAS , 2015, Bioinform..

[23]  J. Pennekamp,et al.  BLOOM: BLoom filter based oblivious outsourced matchings , 2017, BMC Medical Genomics.

[24]  Zhicong Huang,et al.  A privacy-preserving solution for compressed storage and selective retrieval of genomic data , 2016, Genome research.

[25]  Latanya Sweeney,et al.  Identifying Participants in the Personal Genome Project by Name , 2013, ArXiv.

[26]  Zhicong Huang,et al.  Efficient and secure outsourcing of genomic data storage , 2017, BMC Medical Genomics.

[27]  Xiaoqian Jiang,et al.  Addressing Beacon re-identification attacks: quantification and mitigation of privacy risks , 2017, J. Am. Medical Informatics Assoc..

[28]  Murat Kantarcioglu,et al.  Secure Management of Biomedical Data With Cryptographic Hardware , 2012, IEEE Transactions on Information Technology in Biomedicine.

[29]  Gizem S. Cetin,et al.  Private queries on encrypted genomic data , 2017, BMC Medical Genomics.

[30]  Stephen E. Fienberg,et al.  Scalable privacy-preserving data sharing methodology for genome-wide association studies , 2014, J. Biomed. Informatics.

[31]  Bonnie Berger,et al.  Enabling Privacy Preserving GWAS in Heterogeneous Human Populations , 2016, RECOMB.

[32]  Murat Kantarcioglu,et al.  Controlling the signal: Practical privacy protection of genomic data sharing through Beacon services , 2017, BMC Medical Genomics.

[33]  Gunnar Rätsch,et al.  Efficient privacy-preserving string search and an application in genomics , 2015, bioRxiv.

[34]  Shuang Wang,et al.  Differentially private genome data dissemination through top-down specialization , 2014, BMC Medical Informatics and Decision Making.

[35]  Jihoon Kim,et al.  PRINCESS: Privacy‐protecting Rare disease International Network Collaboration via Encryption through Software guard extensionS , 2017, Bioinform..

[36]  Md Momin Al Aziz,et al.  Aftermath of bustamante attack on genomic beacon service , 2017, BMC Medical Genomics.

[37]  Shuang Wang,et al.  Genome privacy: challenges, technical approaches to mitigate risk, and ethical considerations in the United States , 2017, Annals of the New York Academy of Sciences.