Blockchain-Authenticated Sharing of Genomic and Clinical Outcomes Data of Patients With Cancer: A Prospective Cohort Study

Background Efficiently sharing health data produced during standard care could dramatically accelerate progress in cancer treatments, but various barriers make this difficult. Not sharing these data to ensure patient privacy is at the cost of little to no learning from real-world data produced during cancer care. Furthermore, recent research has demonstrated a willingness of patients with cancer to share their treatment experiences to fuel research, despite potential risks to privacy. Objective The objective of this study was to design, pilot, and release a decentralized, scalable, efficient, economical, and secure strategy for the dissemination of deidentified clinical and genomic data with a focus on late-stage cancer. Methods We created and piloted a blockchain-authenticated system to enable secure sharing of deidentified patient data derived from standard of care imaging, genomic testing, and electronic health records (EHRs), called the Cancer Gene Trust (CGT). We prospectively consented and collected data for a pilot cohort (N=18), which we uploaded to the CGT. EHR data were extracted from both a hospital cancer registry and a common data model (CDM) format to identify optimal data extraction and dissemination practices. Specifically, we scored and compared the level of completeness between two EHR data extraction formats against the gold standard source documentation for patients with available data (n=17). Results Although the total completeness scores were greater for the registry reports than those for the CDM, this difference was not statistically significant. We did find that some specific data fields, such as histology site, were better captured using the registry reports, which can be used to improve the continually adapting CDM. In terms of the overall pilot study, we found that CGT enables rapid integration of real-world data of patients with cancer in a more clinically useful time frame. We also developed an open-source Web application to allow users to seamlessly search, browse, explore, and download CGT data. Conclusions Our pilot demonstrates the willingness of patients with cancer to participate in data sharing and how blockchain-enabled structures can maintain relationships between individual data elements while preserving patient privacy, empowering findings by third-party researchers and clinicians. We demonstrate the feasibility of CGT as a framework to share health data trapped in silos to further cancer research. Further studies to optimize data representation, stream, and integrity are required.

[1]  Mark Shervey,et al.  Privacy-Preserving Methods for Feature Engineering Using Blockchain: Review, Evaluation, and Proof of Concept , 2019, Journal of medical Internet research.

[2]  David Brindley,et al.  Implementing Blockchains for Efficient Health Care: Systematic Review , 2019, Journal of medical Internet research.

[3]  I. Kohane,et al.  Biases in electronic health record data due to processes within the healthcare system: retrospective observational study , 2018, British Medical Journal.

[4]  Peter Szolovits,et al.  MIMIC-III, a freely accessible critical care database , 2016, Scientific Data.

[5]  David Bourque,et al.  Association of Patient Characteristics and Tumor Genomics With Clinical Outcomes Among Patients With Non–Small Cell Lung Cancer Using a Clinicogenomic Database , 2019, JAMA.

[6]  Alex M. Fichtenholtz,et al.  Development and validation of a clinical cancer genomic profiling test based on massively parallel DNA sequencing , 2013, Nature Biotechnology.

[7]  Zhiwei Steven Wu,et al.  Privacy-Preserving Generative Deep Neural Networks Support Clinical Data Sharing , 2017, bioRxiv.

[8]  C. Sander,et al.  Genome Sequencing Identifies a Basis for Everolimus Sensitivity , 2012, Science.

[9]  R. Schilsky Personalized medicine in oncology: the future is now , 2010, Nature Reviews Drug Discovery.

[10]  Kipp W. Johnson,et al.  The next generation of precision medicine: observational studies, electronic health records, biobanks and continuous monitoring. , 2018, Human molecular genetics.

[11]  Sanchita Bhattacharya,et al.  Prototype of running clinical trials in an untrustworthy environment using blockchain , 2019, Nature Communications.

[12]  M. Ferguson,et al.  Core Clinical Data Elements for Cancer Genomic Repositories: A Multi-stakeholder Consensus , 2017, Cell.

[13]  Atul J Butte,et al.  A call for deep-learning healthcare , 2019, Nature Medicine.

[14]  George Hripcsak,et al.  Facilitating phenotype transfer using a common data model , 2019, J. Biomed. Informatics.

[15]  Arie Perry,et al.  Targeted next-generation sequencing of pediatric neuro-oncology patients improves diagnosis, identifies pathogenic germline mutations, and directs targeted therapy , 2016, Neuro-oncology.

[16]  S. Goodman,et al.  Clinical Trial Participants’ Views of the Risks and Benefits of Data Sharing , 2018, The New England journal of medicine.

[17]  Atalay Mert Ileri,et al.  Realizing the potential of blockchain technologies in genomics , 2018, Genome research.

[18]  Luc Rocher,et al.  Estimating the success of re-identifications in incomplete datasets using generative models , 2019, Nature Communications.

[19]  Charles Friedman,et al.  Can learning health systems help organisations deliver personalised care? , 2017, BMC Medicine.

[20]  Yu Rang Park,et al.  Is Blockchain Technology Suitable for Managing Personal Health Records? Mixed-Methods Study to Test Feasibility , 2019, Journal of medical Internet research.

[21]  R. Califf,et al.  Real-World Evidence - What Is It and What Can It Tell Us? , 2016, The New England journal of medicine.

[22]  Tsung-Ting Kuo,et al.  Comparison of blockchain platforms: a systematic review and healthcare examples , 2019, J. Am. Medical Informatics Assoc..

[23]  Benjamin S. Glicksberg,et al.  ROMOP: a light-weight R package for interfacing with OMOP-formatted electronic health record data , 2019, JAMIA open.

[24]  Yury Yanovich,et al.  Converging blockchain and next-generation artificial intelligence technologies to decentralize and accelerate biomedical research and healthcare , 2015, Oncotarget.

[25]  J. Overhage,et al.  Advancing the Science for Active Surveillance: Rationale and Design for the Observational Medical Outcomes Partnership , 2010, Annals of Internal Medicine.

[26]  David A. Solomon,et al.  Genomic Profiling of Malignant Peritoneal Mesothelioma Reveals Recurrent Alterations in Epigenetic Regulatory Genes BAP1, SETD2, and DDX3X , 2016, Modern Pathology.

[27]  Julia Adler-Milstein,et al.  Sharing clinical data electronically: a critical challenge for fixing the health care system. , 2012, JAMA.

[28]  Fusheng Wang,et al.  Secure and Trustable Electronic Medical Records Sharing using Blockchain , 2017, AMIA.

[29]  Douglas C. Schmidt,et al.  FHIRChain: Applying Blockchain to Securely and Scalably Share Clinical Data , 2018, Computational and structural biotechnology journal.

[30]  Sean Khozin,et al.  Real-World Evidence In Support Of Precision Medicine: Clinico-Genomic Cancer Data As A Case Study. , 2018, Health affairs.

[31]  Theodore C Goldstein,et al.  PatientExploreR: an extensible application for dynamic visualization of patient clinical history from electronic health records in the OMOP common data model , 2019, Bioinform..