Privacy in the age of medical big data

Big data has become the ubiquitous watch word of medical innovation. The rapid development of machine-learning techniques and artificial intelligence in particular has promised to revolutionize medical practice from the allocation of resources to the diagnosis of complex diseases. But with big data comes big risks and challenges, among them significant questions about patient privacy. Here, we outline the legal and ethical challenges big data brings to patient privacy. We discuss, among other topics, how best to conceive of health privacy; the importance of equity, consent, and patient governance in data collection; discrimination in data uses; and how to handle data breaches. We close by sketching possible ways forward for the regulatory system.The increased amount of health care data collected brings with it ethical and legal challenges for protecting the patient while optimizing health care and research.

[1]  D. Stone The struggle for the soul of health insurance. , 1993, Journal of health politics, policy and law.

[2]  Richard A Epstein,et al.  The legal regulation of genetic discrimination: old responses to new technology. , 1994, Boston University law review. Boston University. School of Law.

[3]  Workshop conclusions , 1996, Environmental monitoring and assessment.

[4]  E. Jacobsohn,et al.  Ethical and practical considerations of with , 1999 .

[5]  A. Wall,et al.  Book ReviewTo Err is Human: building a safer health system Kohn L T Corrigan J M Donaldson M S Washington DC USA: Institute of Medicine/National Academy Press ISBN 0 309 06837 1 $34.95 , 2000 .

[6]  D. Winickoff,et al.  The charitable trust as a model for genomic biobanks. , 2003, The New England journal of medicine.

[7]  H. Morreim Research versus Innovation: Real Differences , 2005, The American journal of bioethics : AJOB.

[8]  H. Greely The uneasy ethical and legal underpinnings of large-scale genomic biobanks. , 2007, Annual review of genomics and human genetics.

[9]  Vitaly Shmatikov,et al.  Robust De-anonymization of Large Sparse Datasets , 2008, 2008 IEEE Symposium on Security and Privacy (sp 2008).

[10]  E. Emanuel,et al.  Quality-improvement research and informed consent. , 2008, The New England journal of medicine.

[11]  Helen Nissenbaum,et al.  Privacy in Context , 2009 .

[12]  Paul Ohm Broken Promises of Privacy: Responding to the Surprising Failure of Anonymization , 2009 .

[13]  Helen Nissenbaum,et al.  Privacy in Context - Technology, Policy, and the Integrity of Social Life , 2009 .

[14]  M. Calo The Boundaries of Privacy Harm , 2010 .

[15]  D. Blumenthal,et al.  Achieving a Nationwide Learning Health System , 2010, Science Translational Medicine.

[16]  R. Platt,et al.  Developing the Sentinel System--a national resource for evidence development. , 2011, The New England journal of medicine.

[17]  I. Kohane Using electronic health records to drive discovery in disease genomics , 2011, Nature Reviews Genetics.

[18]  Three Models of Health Insurance: The Conceptual Pluralism of the Patient Protection and Affordable Care Act , 2011 .

[19]  Nicolas P. Terry,et al.  Protecting Patient Privacy in the Age of Big Data , 2012 .

[20]  Steven N Goodman,et al.  The research-treatment distinction: a problematic approach for determining which activities should have ethical oversight. , 2013, The Hastings Center report.

[21]  Nandita Mitra,et al.  Public preferences about secondary uses of electronic health information. , 2013, JAMA internal medicine.

[22]  K. Crawford,et al.  Big Data and Due Process: Toward a Framework to Redress Predictive Privacy Harms , 2013 .

[23]  Leopoldo C. Cancio,et al.  Development and validation of a machine learning algorithm and hybrid system to predict the need for life-saving interventions in trauma patients , 2013, Medical & Biological Engineering & Computing.

[24]  Eran Halperin,et al.  Identifying Personal Genomes by Surname Inference , 2013, Science.

[25]  Steven N Goodman,et al.  An ethics framework for a learning health care system: a departure from traditional research ethics and clinical ethics. , 2013, The Hastings Center report.

[26]  Daniel B. Vorhaus,et al.  The next controversy in genetic testing: clinical data as trade secrets? , 2012, European Journal of Human Genetics.

[27]  Aaron Roth,et al.  The Algorithmic Foundations of Differential Privacy , 2014, Found. Trends Theor. Comput. Sci..

[28]  W. Nicholson Price,et al.  Black-Box Medicine , 2014 .

[29]  Distinguishing QI projects from human subjects research: ethical and practical considerations. , 2014, Bulletin of the American College of Surgeons.

[30]  S. Hoffman Citizen Science: The Law and Ethics of Public Access to Medical Big Data , 2014 .

[31]  Robert A Philibert,et al.  Methylation array data can simultaneously identify individuals and convey protected health information: an unrecognized ethical concern , 2014, Clinical Epigenetics.

[32]  Ruben Amarasingham,et al.  The legal and ethical concerns that arise from using complex predictive analytics in health care. , 2014, Health affairs.

[33]  Ben Goldacre,et al.  How to Get All Trials Reported: Audit, Better Data, and Individual Accountability , 2015, PLoS medicine.

[34]  Michael Morrison,et al.  Dynamic consent: a patient interface for twenty-first century research networks , 2014, European Journal of Human Genetics.

[35]  W. Nicholson Price,et al.  Big Data, Patents, and the Future of Medicine , 2015 .

[36]  Robert Cook-Deegan,et al.  Broad Consent for Research With Biological Samples: Workshop Conclusions , 2015, The American journal of bioethics : AJOB.

[37]  Andrew D. Selbst,et al.  Big Data's Disparate Impact , 2016 .

[38]  Frank Rockhold,et al.  Data Sharing at a Crossroads. , 2016, The New England journal of medicine.

[39]  Roger Allan Ford,et al.  Privacy and Accountability in Black-Box Medicine , 2016 .

[40]  S. Hoffman Big Data's New Discrimination Threats: Amending the Americans with Disabilities Act to Cover Discrimination Based on Data-Driven Predictions of Future Disease , 2016 .

[41]  After a prominent gene-testing firm declined to give patients their complete data, ACLU filed a legal complaint , 2016 .

[42]  K. Spector-Bagdady,et al.  "The Google of Healthcare": enabling the privatization of genetic bio/databanking. , 2016, Annals of epidemiology.

[43]  Steven A. Demurjian,et al.  Differential Privacy Approach for Big Data Privacy in Healthcare , 2017 .

[44]  R. Eisenberg,et al.  Promoting healthcare innovation on the demand side , 2016, Journal of law and the biosciences.

[45]  An Expressive Theory of Privacy Intrusions , 2017 .

[46]  N. Terry Existential challenges for healthcare data protection in the United States , 2017 .

[47]  Sebastian Thrun,et al.  Dermatologist-level classification of skin cancer with deep neural networks , 2017, Nature.

[48]  Jon M. Kleinberg,et al.  Inherent Trade-Offs in the Fair Determination of Risk Scores , 2016, ITCS.

[49]  W Nicholson Price,et al.  Regulating Black-Box Medicine. , 2017, Michigan law review.

[50]  A. Shuman,et al.  Reg-ent within the Learning Health System , 2018, Otolaryngology--head and neck surgery : official journal of American Academy of Otolaryngology-Head and Neck Surgery.

[51]  Urs Gasser,et al.  Big Data, Health Law, and Bioethics , 2018 .

[52]  Drug Approval in a Learning Health System , 2018 .

[53]  C. Robertson,et al.  Who's left out of Big Data?: How Big Data collection, analysis, and use neglect populations most in need of medical and public health research and interventions , 2018 .

[54]  Thomas May Sociogenetic Risks - Ancestry DNA Testing, Third-Party Identity, and Protection of Privacy. , 2018, The New England journal of medicine.

[55]  Andrew Y. Ng,et al.  Improving palliative care with deep learning , 2017, 2017 IEEE International Conference on Bioinformatics and Biomedicine (BIBM).

[56]  Big Data’s Epistemology and Its Implications for Precision Medicine and Privacy , 2018 .

[57]  E. Ingelsson,et al.  Big Data and medicine: a big deal? , 2018, Journal of internal medicine.

[58]  I. Cohen Is There a Duty to Share Healthcare Data , 2018 .

[59]  Margaret Foster Riley Big Data, HIPAA, and the Common Rule , 2018 .

[60]  Regulatory Disruption and Arbitrage in Health-Care Data Protection. , 2016, Yale journal of health policy, law, and ethics.

[61]  S. Goodman,et al.  Clinical Trial Participants’ Views of the Risks and Benefits of Data Sharing , 2018, The New England journal of medicine.

[62]  P. Appelbaum,et al.  Analysis of state laws on informed consent for clinical genetic testing in the era of genomic sequencing , 2018, American journal of medical genetics. Part C, Seminars in medical genetics.

[63]  Nicolas P. Terry,et al.  Appification, AI, and Healthcare's New Iron Triangle , 2018 .

[64]  David Sontag,et al.  Why Is My Classifier Discriminatory? , 2018, NeurIPS.

[65]  Zhiwei Steven Wu,et al.  Privacy-Preserving Generative Deep Neural Networks Support Clinical Data Sharing , 2017, bioRxiv.

[66]  M. Lungren,et al.  Preparing Medical Imaging Data for Machine Learning. , 2020, Radiology.

[67]  Mariarosaria Taddeo,et al.  How to Design AI for Social Good: Seven Essential Factors , 2020, Science and Engineering Ethics.

[69]  G. Falcone,et al.  Genetic Variation and Response to Neurocritical Illness: a Powerful Approach to Identify Novel Pathophysiological Mechanisms and Therapeutic Targets , 2020, Neurotherapeutics.

[70]  B. Knoppers,et al.  Genomic Sequencing Capacity, Data Retention, and Personal Access to Raw Data in Europe , 2020, Frontiers in Genetics.