The Effect of Kinship in Re-identification Attacks Against Genomic Data Sharing Beacons

Motivation Big data era in genomics promises a breakthrough in medicine, but sharing data in a private manner limits the pace of field. Widely accepted “genomic data-sharing beacon” protocol provides a standardized and secure interface for querying the genomic datasets. The data is only shared if the desired information (e.g., a certain variant) exists in the dataset. Various studies showed that beacons are vulnerable to re-identification (or membership inference) attacks. As beacons are generally associated with sensitive phenotype information, re-identification creates a significant risk for the participants. Unfortunately, proposed countermeasures against such attacks have failed to be effective, as they do not consider the utility of beacon protocol. Results In this study, for the first time, we analyze the mitigation effect of the kinship relationships among beacon participants against re-identification attacks. We argue that having multiple family members in a beacon can garble the information for attacks since a substantial number of variants are shared among kinrelated people. Using family genomes from HapMap and synthetically-generated datasets, we show that having one of the parents of a victim in the beacon causes (i) significant decrease in the power of attacks and (ii) substantial increase in the number of queries needed to confirm an individual’s beacon membership. We also show how the protection effect attenuates when more distant relatives, such as grandparents are included alongside the victim. Furthermore, we quantify the utility loss due adding relatives and show that it is smaller compared to flipping based techniques.

[1]  Zhicong Huang,et al.  Quantifying Genomic Privacy via Inference Attack with High-Order SNV Correlations , 2015, 2015 IEEE Security and Privacy Workshops.

[2]  Xiaoqian Jiang,et al.  Addressing Beacon re-identification attacks: quantification and mitigation of privacy risks , 2017, J. Am. Medical Informatics Assoc..

[3]  Marvin R Natowicz,et al.  Reply to Hook and Lowden: the definition and implications of genetic discrimination. , 1992, American journal of human genetics.

[4]  Jean-Pierre Hubaux,et al.  Addressing the concerns of the lacks family: quantification of kin genomic privacy , 2013, CCS.

[5]  M. Olivier A haplotype map of the human genome , 2003, Nature.

[6]  Erman Ayday,et al.  Re-Identification of Individuals in Genomic Data-Sharing Beacons via Allele Inference , 2017, bioRxiv.

[7]  Xiaofeng Wang,et al.  Real-time Protection of Genomic Data Sharing in Beacon Services , 2018, AMIA Joint Summits on Translational Science proceedings. AMIA Joint Summits on Translational Science.

[8]  Erman Ayday,et al.  Genome Reconstruction Attacks Against Genomic Data-Sharing Beacons , 2020, Proc. Priv. Enhancing Technol..

[9]  Iman Deznabi,et al.  An Inference Attack on Genomic Data Using Kinship, Complex Correlations, and Phenotype Information , 2018, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[10]  P. Kim,et al.  Genetic discrimination, genetic privacy: rethinking employee protections for a brave new workplace. , 2002, Northwestern University law review.

[11]  M R Natowicz,et al.  Discrimination as a consequence of genetic testing. , 1992, American journal of human genetics.

[12]  David Haussler,et al.  Federated discovery and sharing of genomic data using Beacons , 2019, Nature Biotechnology.

[13]  C. Bustamante,et al.  Privacy Risks from Genomic Data-Sharing Beacons , 2015, American journal of human genetics.

[14]  Toshihiro Tanaka The International HapMap Project , 2003, Nature.

[15]  Klaudia Walter,et al.  The impact of rare and low-frequency genetic variants in common disease , 2017, Genome Biology.

[16]  C. Kozma,et al.  Genetic Discrimination: Perspectives of Consumers , 1996, Science.

[17]  David Mittelman,et al.  Consumer genomics will change your life, whether you get tested or not , 2018, Genome Biology.

[18]  Md Momin Al Aziz,et al.  Aftermath of bustamante attack on genomic beacon service , 2017, BMC Medical Genomics.