Genomic data offer valuable insights that can be used to help find treatments and cures for disease. Precision medicine, defined by the NIH as “an emerging approach for disease treatment and prevention that takes into account individual variability in genes, environment, and lifestyle for each person,” is gaining acceptance among physicians, who are beginning to integrate patient-centric data analysis into clinical decision-making. Although precision medicine makes use of various types of data, this piece focuses on molecular characterization data specifically, as the discoveries yielded from these data can advance thinking around clinical care for cancer patients. Our pediatrics genomics team at the University of California Santa Cruz Genomics Institute is uniquely situated to discuss the use of shared genomic data for clinical benefit because our collaborations with hospital partners in the United States and internationally rely on big-data comparative genomic analysis. Using shared data, Treehouse Childhood Cancer Initiative develops methods for comparative analysis of tumor RNA sequencing profiles from single patients for the purposes of identifying overexpressed oncogenes that could be targeted by therapies in the clinic. To enable and improve this analysis, we continuously increase the size of our data compendium by adding public pediatric tumor RNA sequencing data sets. We developed an approach for assessing the quality of shared RNA sequencing data to ensure the integrity of the data. In this approach we calculate the number of mapped exonic nonduplicate (MEND) reads, applying a 10 million MEND read minimum threshold for inclusion in our comparative analysis. In collaboration with Stanford University and Lucile Packard Children's Hospital Stanford, our team at Treehouse Childhood Cancer Initiative explores the value to researchers everywhere of shared genomic data for clinical utility and the challenges of data sharing that threaten to impede otherwise rapid advances in precision medicine. This Perspective offers recommendations for maximizing the use of genomic data to make discoveries that will benefit patients.
[1]
Mark Gerstein,et al.
Genomics and Privacy: Implications of the New Reality of Closed Data for the Field
,
2011,
PLoS Comput. Biol..
[2]
Weida Tong,et al.
Comprehensive Assessments of RNA-seq by the SEQC Consortium: FDA-Led Efforts Advance Precision Medicine
,
2016,
Pharmaceutics.
[3]
S. Bale,et al.
Clinical Versus Research Sequencing.
,
2016,
Cold Spring Harbor perspectives in medicine.
[4]
Kim M. Summers,et al.
Integration of quantitated expression estimates from polyA-selected and rRNA-depleted RNA-seq libraries
,
2017,
BMC Bioinformatics.
[5]
Marilyn M. Li,et al.
Standards and Guidelines for the Interpretation and Reporting of Sequence Variants in Cancer: A Joint Consensus Recommendation of the Association for Molecular Pathology, American Society of Clinical Oncology, and College of American Pathologists.
,
2017,
The Journal of molecular diagnostics : JMD.
[6]
Michael Krawczak,et al.
Patient views on research use of clinical data without consent: Legal, but also acceptable?
,
2019,
European Journal of Human Genetics.
[7]
David Haussler,et al.
Barriers to accessing public cancer genomic data
,
2019,
Scientific Data.
[8]
J. Biegel,et al.
The genomic landscape of pediatric cancers: Implications for diagnosis and treatment
,
2019,
Science.