Inferring expressed genes by whole-genome sequencing of plasma DNA

The analysis of cell-free DNA (cfDNA) in plasma represents a rapidly advancing field in medicine. cfDNA consists predominantly of nucleosome-protected DNA shed into the bloodstream by cells undergoing apoptosis. We performed whole-genome sequencing of plasma DNA and identified two discrete regions at transcription start sites (TSSs) where nucleosome occupancy results in different read depth coverage patterns for expressed and silent genes. By employing machine learning for gene classification, we found that the plasma DNA read depth patterns from healthy donors reflected the expression signature of hematopoietic cells. In patients with cancer having metastatic disease, we were able to classify expressed cancer driver genes in regions with somatic copy number gains with high accuracy. We were able to determine the expressed isoform of genes with several TSSs, as confirmed by RNA-seq analysis of the matching primary tumor. Our analyses provide functional information about cells releasing their DNA into the circulation.

[1]  Ancha Baranova,et al.  Non-random fragmentation patterns in circulating cell-free DNA reflect epigenetic regulation , 2015, BMC Genomics.

[2]  E. Winer,et al.  Systemic therapy for patients with advanced human epidermal growth factor receptor 2-positive breast cancer: American Society of Clinical Oncology clinical practice guideline. , 2014, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[3]  ENCODEConsortium,et al.  An Integrated Encyclopedia of DNA Elements in the Human Genome , 2012, Nature.

[4]  Bert Vogelstein,et al.  DETECTION OF CIRCULATING TUMOR DNA IN EARLY AND LATE STAGE HUMAN MALIGNANCIES , 2014 .

[5]  M. Speicher,et al.  Co-occurrence of MYC amplification and TP53 mutations in human cancer , 2016, Nature Genetics.

[6]  Peter Ulz,et al.  Non‐invasive detection of genome‐wide somatic copy number alterations by liquid biopsies , 2016, Molecular oncology.

[7]  E. Levanon,et al.  Human housekeeping genes, revisited. , 2013, Trends in genetics : TIG.

[8]  K. Kinzler,et al.  Cancer Genome Landscapes , 2013, Science.

[9]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[10]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[11]  Matthew W. Snyder,et al.  Cell-free DNA Comprises an In Vivo Nucleosome Footprint that Informs Its Tissues-Of-Origin , 2016, Cell.

[12]  F. Nicolantonio,et al.  Liquid biopsy: monitoring cancer-genetics in the blood , 2013, Nature Reviews Clinical Oncology.

[13]  Frank Diehl,et al.  Detection and quantification of mutations in the plasma of patients with colorectal tumors. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[14]  David R. Kelley,et al.  Differential gene and transcript expression analysis of RNA-seq experiments with TopHat and Cufflinks , 2012, Nature Protocols.

[15]  A. McKenna,et al.  Absolute quantification of somatic DNA alterations in human cancer , 2012, Nature Biotechnology.

[16]  Steven M. Johnson,et al.  Determinants of nucleosome organization in primary human cells , 2011, Nature.

[17]  Peter Ulz,et al.  The dynamic range of circulating tumor DNA in metastatic breast cancer , 2014, Breast Cancer Research.

[18]  Peter Ulz,et al.  Circulating tumor DNA as a liquid biopsy for cancer. , 2015, Clinical chemistry.

[19]  Peter J. Park,et al.  CGHweb: a tool for comparing DNA copy number segmentations from multiple algorithms , 2008, Bioinform..

[20]  Peter Ulz,et al.  Whole-genome plasma sequencing reveals focal amplifications as a driving force in metastatic prostate cancer , 2016, Nature Communications.

[21]  John T. Lis,et al.  Promoter-proximal pausing of RNA polymerase II: emerging roles in metazoans , 2012, Nature Reviews Genetics.

[22]  Derek Y. Chiang,et al.  The landscape of somatic copy-number alteration across human cancers , 2010, Nature.

[23]  Dustin E. Schones,et al.  Dynamic Regulation of Nucleosome Positioning in the Human Genome , 2008, Cell.

[24]  S. Henikoff,et al.  Replicating nucleosomes , 2015, Science Advances.

[25]  E. Ma,et al.  Plasma DNA tissue mapping by genome-wide methylation sequencing for noninvasive prenatal, cancer, and transplantation assessments , 2015, Proceedings of the National Academy of Sciences.

[26]  N. Thorne,et al.  High-resolution characterization of sequence signatures due to non-random cleavage of cell-free DNA , 2015, BMC Medical Genomics.

[27]  M. Choti,et al.  Detection of Circulating Tumor DNA in Early- and Late-Stage Human Malignancies , 2014, Science Translational Medicine.

[28]  W. Koh,et al.  Noninvasive in vivo monitoring of tissue-specific global gene expression in humans , 2014, Proceedings of the National Academy of Sciences.

[29]  Yama W. L. Zheng,et al.  Maternal Plasma DNA Sequencing Reveals the Genome-Wide Genetic and Mutational Profile of the Fetus , 2010, Science Translational Medicine.

[30]  L. Diaz,et al.  Liquid biopsies: genotyping circulating tumor DNA. , 2014, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[31]  Peter Ulz,et al.  Tumor-associated copy number changes in the circulation of patients with prostate cancer identified through whole-genome sequencing , 2013, Genome Medicine.

[32]  J. Workman,et al.  Histone exchange, chromatin structure and the regulation of transcription , 2015, Nature Reviews Molecular Cell Biology.

[33]  Razelle Kurzrock,et al.  The FGFR Landscape in Cancer: Analysis of 4,853 Tumors by Next-Generation Sequencing , 2015, Clinical Cancer Research.

[34]  Klaus Pantel,et al.  Cell-free nucleic acids as biomarkers in cancer patients , 2011, Nature Reviews Cancer.

[35]  C. Lam,et al.  Predominant hematopoietic origin of cell-free DNA in plasma and serum after sex-mismatched bone marrow transplantation. , 2002, Clinical chemistry.

[36]  Peter Ulz,et al.  Changes in Colorectal Carcinoma Genomes under Anti-EGFR Therapy Identified by Whole-Genome Plasma DNA Sequencing , 2014, PLoS genetics.

[37]  Peter Ulz,et al.  Circulating tumor cells and DNA as liquid biopsies , 2013, Genome Medicine.

[38]  Daniel J. Gaffney,et al.  Controls of Nucleosome Positioning in the Human Genome , 2012, PLoS genetics.

[39]  N. Rosenfeld,et al.  Non-invasive analysis of acquired resistance to cancer therapy by sequencing of plasma DNA , 2013, Nature.