Machine learning guided signal enrichment for ultrasensitive plasma tumor burden monitoring

In solid tumor oncology, circulating tumor DNA (ctDNA) is poised to transform care through accurate assessment of minimal residual disease (MRD) and therapeutic response monitoring. To overcome the sparsity of ctDNA fragments in low tumor fraction (TF) settings and increase MRD sensitivity, we previously leveraged genome-wide mutational integration through plasma whole genome sequencing (WGS). We now introduce MRD-EDGE, a composite machine learning-guided WGS ctDNA single nucleotide variant (SNV) and copy number variant (CNV) detection platform designed to increase signal enrichment. MRD-EDGE uses deep learning and a ctDNA-specific feature space to increase SNV signal to noise enrichment in WGS by 300X compared to our previous noise suppression platform MRDetect. MRD-EDGE also reduces the degree of aneuploidy needed for ultrasensitive CNV detection through WGS from 1Gb to 200Mb, thereby expanding its applicability to a wider range of solid tumors. We harness the improved performance to track changes in tumor burden in response to neoadjuvant immunotherapy in non-small cell lung cancer and demonstrate ctDNA shedding in precancerous colorectal adenomas. Finally, the radical signal to noise enrichment in MRD-EDGE enables de novo mutation calling in melanoma without matched tumor, yielding clinically informative TF monitoring for patients on immune checkpoint inhibition.

[1]  M. Ladanyi,et al.  Improved prediction of immune checkpoint blockade efficacy across multiple cancer types , 2021, Nature Biotechnology.

[2]  C. Andersen,et al.  Circulating Tumor DNA in Stage III Colorectal Cancer, beyond Minimal Residual Disease Detection, toward Assessment of Adjuvant Therapy Efficacy and Clinical Behavior of Recurrences , 2021, Clinical Cancer Research.

[3]  H. C. Donker,et al.  Dynamic Changes of Circulating Tumor DNA Predict Clinical Outcome in Patients With Advanced Non–Small-Cell Lung Cancer Treated With Immune Checkpoint Inhibitors , 2021, JCO precision oncology.

[4]  Trevor J Pugh,et al.  Pan-cancer analysis of longitudinal metastatic tumors reveals genomic alterations and immune landscape dynamics associated with pembrolizumab sensitivity , 2021, Nature Communications.

[5]  R. Sullivan,et al.  Early Use of High-Dose Glucocorticoid for the Management of irAE Is Associated with Poorer Survival in Patients with Advanced Melanoma Treated with Anti–PD-1 Monotherapy , 2021, Clinical Cancer Research.

[6]  C. Swanton,et al.  Clinical validation of a targeted methylation-based multi-cancer early detection test using an independent validation set. , 2021, Annals of oncology : official journal of the European Society for Medical Oncology.

[7]  Ash A. Alizadeh,et al.  Enhanced detection of minimal residual disease by targeted sequencing of phased variants in circulating tumor DNA , 2021, Nature Biotechnology.

[8]  Julie L. Yang,et al.  Enhanced specificity of clinical high-sensitivity tumor mutation profiling in cell-free DNA via paired normal sequencing using MSK-ACCESS , 2021, Nature Communications.

[9]  T. Powles,et al.  ctDNA guiding adjuvant immunotherapy in urothelial carcinoma , 2021, Nature.

[10]  K. D. Sørensen,et al.  Discovering fragment length signatures of circulating tumor DNA using Non-negative Matrix Factorization , 2021, bioRxiv.

[11]  N. Sanfilippo,et al.  Neoadjuvant durvalumab with or without stereotactic body radiotherapy in patients with early-stage non-small-cell lung cancer: a single-centre, randomised phase 2 trial. , 2021, The Lancet. Oncology.

[12]  N. Friedman,et al.  ChIP-seq of plasma cell-free nucleosomes identifies gene expression programs of the cells-of-origin , 2021, Nature Biotechnology.

[13]  T. Graham,et al.  LiquidCNA: Tracking subclonal evolution from longitudinal liquid biopsies using somatic copy number alterations , 2021, bioRxiv.

[14]  Ash A. Alizadeh,et al.  Noninvasive Early Identification of Therapeutic Benefit from Immune Checkpoint Inhibition , 2020, Cell.

[15]  J. Clemente,et al.  Detecting and phasing minor single-nucleotide variants from long-read sequencing data , 2020, Nature Communications.

[16]  You Lu,et al.  Pseudoprogression and hyperprogression in lung cancer: a comprehensive review of literature , 2020, Journal of Cancer Research and Clinical Oncology.

[17]  Song Wu,et al.  Prognostic and predictive impact of circulating tumor DNA in patients with advanced cancers treated with immune checkpoint blockade. , 2020, Cancer discovery.

[18]  Trevor J Pugh,et al.  Personalized circulating tumor DNA analysis as a predictive biomarker in solid tumor patients treated with pembrolizumab , 2020, Nature Cancer.

[19]  Yang Xu,et al.  Quantitative characterization of tumor cell-free DNA shortening , 2020, BMC Genomics.

[20]  N. Rosenfeld,et al.  ctDNA monitoring using patient-specific sequencing and integration of variant reads , 2020, Science Translational Medicine.

[21]  Rafael C. Schulman,et al.  Genome-wide cell-free DNA mutational integration enables ultra-sensitive cancer monitoring , 2020, Nature Medicine.

[22]  D. Tougeron,et al.  Detection of Colorectal Cancer and Advanced Adenoma by Liquid Biopsy (Decalib Study): The ddPCR Challenge , 2020, Cancers.

[23]  J. Wolchok,et al.  A phase II study to evaluate the need for > two doses of nivolumab + ipilimumab combination (combo) immunotherapy. , 2020 .

[24]  S. Dawson,et al.  Longitudinal Monitoring of ctDNA in Patients with Melanoma and Brain Metastases Treated with Immune Checkpoint Inhibitors , 2020, Clinical Cancer Research.

[25]  D. Berry,et al.  Sensitive and specific multi-cancer detection and localization using methylation signatures in cell-free DNA , 2020, Annals of oncology : official journal of the European Society for Medical Oncology.

[26]  Ash A. Alizadeh,et al.  Integrating genomic features for non-invasive early lung cancer detection , 2020, Nature.

[27]  V. Wong,et al.  Plasma DNA end motif profiling as a fragmentomic marker in cancer, pregnancy and transplantation. , 2020, Cancer discovery.

[28]  Ash A. Alizadeh,et al.  A mathematical model of ctDNA shedding predicts tumor detection size , 2020, Science Advances.

[29]  Kristian Cibulskis,et al.  Calling Somatic SNVs and Indels with Mutect2 , 2019, bioRxiv.

[30]  Michael C. Zody,et al.  Deep whole-genome sequencing of 3 cancer cell lines on 2 sequencing platforms , 2019, Scientific Reports.

[31]  Aditya Deshpande,et al.  Robust foreground detection in somatic copy number data , 2019, bioRxiv.

[32]  Kyle Xiong,et al.  Revealing Hi-C subcompartments by imputing inter-chromosomal chromatin interactions , 2019, Nature Communications.

[33]  David R. Jones,et al.  High-intensity sequencing reveals the sources of plasma circulating cell-free DNA variants , 2019, Nature Medicine.

[34]  M. Schimek,et al.  Inference of transcription factor binding from cell-free DNA enables tumor subtype prediction and early detection , 2019, Nature Communications.

[35]  Anshul Kundaje,et al.  The ENCODE Blacklist: Identification of Problematic Regions of the Genome , 2019, Scientific Reports.

[36]  S. Guraya Pattern, Stage, and Time of Recurrent Colorectal Cancer After Curative Surgery. , 2019, Clinical colorectal cancer.

[37]  H. Nielsen,et al.  Genome-wide cell-free DNA fragmentation in patients with cancer , 2019, Nature.

[38]  W. Guo,et al.  Clonal hematopoiesis mutations in plasma cfDNA RAS/BRAF genotyping of metastatic colorectal cancer. , 2019, Journal of Clinical Oncology.

[39]  S. Shchegrova,et al.  Analysis of Plasma Cell-Free DNA by Ultradeep Sequencing in Patients With Stages I to III Colorectal Cancer , 2019, JAMA oncology.

[40]  A. Gonzalez-Perez,et al.  Local Determinants of the Mutational Landscape of the Human Genome , 2019, Cell.

[41]  Ruibang Luo,et al.  A multi-task convolutional deep neural network for variant calling in single molecule sequencing , 2019, Nature Communications.

[42]  Ryan L. Collins,et al.  The mutational constraint spectrum quantified from variation in 141,456 humans , 2020, Nature.

[43]  H. Egi,et al.  Spontaneous regression of transverse colon cancer with high-frequency microsatellite instability: a case report and literature review , 2019, World Journal of Surgical Oncology.

[44]  S. Tsunoda,et al.  Age-related remodelling of oesophageal epithelia by mutated cancer drivers , 2019, Nature.

[45]  Jacques Fellay,et al.  Blacklisting variants common in private cohorts but not in public databases optimizes human exome analysis , 2018, Proceedings of the National Academy of Sciences.

[46]  Peter J. Campbell,et al.  Somatic mutant clones colonize the human esophagus with age , 2018, Science.

[47]  Steven J. M. Jones,et al.  Sources of erroneous sequences and artifact chimeric reads in next generation sequencing of genomic DNA from formalin-fixed paraffin-embedded samples , 2018, Nucleic acids research.

[48]  Keval Patel,et al.  Enhanced detection of circulating tumor DNA by fragment size analysis , 2018, Science Translational Medicine.

[49]  Nuria Lopez-Bigas,et al.  Somatic and Germline Mutation Periodicity Follow the Orientation of the DNA Minor Groove around Nucleosomes , 2018, Cell.

[50]  J. McPherson,et al.  Sensitive tumour detection and classification using plasma cell-free DNA methylomes , 2018, Nature.

[51]  V. Wong,et al.  Preferred end coordinates and somatic variants as signatures of circulating tumor DNA associated with hepatocellular carcinoma , 2018, Proceedings of the National Academy of Sciences.

[52]  Mauro A. A. Castro,et al.  The chromatin accessibility landscape of primary human cancers , 2018, Science.

[53]  Thomas Colthurst,et al.  A universal SNP and small-indel variant caller using deep neural networks , 2018, Nature Biotechnology.

[54]  P. Tarpey,et al.  Circulating tumor DNA in patients with colorectal adenomas: assessment of detectability and genetic heterogeneity , 2018, Cell Death & Disease.

[55]  Peiyong Jiang,et al.  Size-tagged preferred ends in maternal plasma DNA shed light on the production mechanism and show utility in noninvasive prenatal testing , 2018, Proceedings of the National Academy of Sciences.

[56]  R. Wolff,et al.  Mutation analysis of adenomas and carcinomas of the colon: Early and late drivers , 2018, Genes, chromosomes & cancer.

[57]  Ashton C. Berger,et al.  Genomic and Functional Approaches to Understanding Cancer Aneuploidy. , 2018, Cancer cell.

[58]  C. Paweletz,et al.  False-Positive Plasma Genotyping Due to Clonal Hematopoiesis , 2018, Clinical Cancer Research.

[59]  Ludmila V. Danilova,et al.  Detection and localization of surgically resectable cancers with a multi-analyte blood test , 2018, Science.

[60]  O. Elemento,et al.  Challenges in Using ctDNA to Achieve Early Detection of Cancer , 2017, bioRxiv.

[61]  Nikhil Wagle,et al.  Scalable whole-exome sequencing of cell-free DNA reveals high concordance with metastatic tumors , 2017, Nature Communications.

[62]  D. Schadendorf,et al.  Overall Survival with Combined Nivolumab and Ipilimumab in Advanced Melanoma , 2017, The New England journal of medicine.

[63]  Sam Angiuoli,et al.  Direct detection of early-stage cancers using circulating tumor DNA , 2017, Science Translational Medicine.

[64]  Benjamin J. Raphael,et al.  The evolutionary history of 2,658 cancers , 2017, Nature.

[65]  J. Gu,et al.  The somatic mutation landscape of premalignant colorectal adenoma , 2017, Gut.

[66]  Andrew Menzies,et al.  ascatNgs: Identifying Somatically Acquired Copy‐Number Alterations from Whole‐Genome Sequencing Data , 2016, Current protocols in bioinformatics.

[67]  H. Nielsen,et al.  Protocol Outlines for Parts 1 and 2 of the Prospective Endoscopy III Study for the Early Detection of Colorectal Cancer: Validation of a Concept Based on Blood Biomarkers , 2016, JMIR research protocols.

[68]  R. Strausberg,et al.  Circulating tumor DNA analysis detects minimal residual disease and predicts recurrence in patients with stage II colon cancer , 2016, Science Translational Medicine.

[69]  Noah C Welker,et al.  Fragment Length of Circulating Tumor DNA , 2016, PLoS genetics.

[70]  Vladimir Vacic,et al.  Conpair: concordance and contamination estimator for matched tumor–normal pairs , 2016, Bioinform..

[71]  M. Phipps,et al.  Screening for Colorectal Cancer: US Preventive Services Task Force Recommendation Statement. , 2016, JAMA.

[72]  V. Seshan,et al.  FACETS: allele-specific copy number and clonal heterogeneity analysis tool for high-throughput DNA sequencing , 2016, Nucleic acids research.

[73]  Joo Hoon Kim,et al.  Effects of microsatellite instability on recurrence patterns and outcomes in colorectal cancers , 2016, British Journal of Cancer.

[74]  M. Stratton,et al.  Mutational signatures associated with tobacco smoking in human cancer , 2016, Science.

[75]  Ash A. Alizadeh,et al.  Integrated digital error suppression for improved detection of circulating tumor DNA , 2016, Nature Biotechnology.

[76]  P. Hanawalt,et al.  Mutational Strand Asymmetries in Cancer Genomes Reveal Mechanisms of DNA Damage and Repair , 2016, Cell.

[77]  Matthew W. Snyder,et al.  Cell-free DNA Comprises an In Vivo Nucleosome Footprint that Informs Its Tissues-Of-Origin , 2016, Cell.

[78]  R. Dummer,et al.  The Genetic Evolution of Melanoma from Precursor Lesions. , 2015, The New England journal of medicine.

[79]  Radhakrishnan Sabarinathan,et al.  Nucleotide excision repair is impaired by binding of transcription factors to DNA , 2015, Nature.

[80]  M. Burotto,et al.  Pseudoprogression and Immune-Related Response in Solid Tumors. , 2015, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[81]  Donavan T. Cheng,et al.  Memorial Sloan Kettering-Integrated Mutation Profiling of Actionable Cancer Targets (MSK-IMPACT): A Hybridization Capture-Based Next-Generation Sequencing Clinical Assay for Solid Tumor Molecular Oncology. , 2015, The Journal of molecular diagnostics : JMD.

[82]  H. Nielsen,et al.  Analysis of circulating tumour DNA to monitor disease burden following colorectal cancer surgery , 2015, Gut.

[83]  Dimitrios I. Fotiadis,et al.  Machine learning applications in cancer prognosis and prediction , 2014, Computational and structural biotechnology journal.

[84]  Nicolai J. Birkbak,et al.  Sequenza: allele-specific copy number and mutation profiles from tumor sequencing data , 2014, Annals of oncology : official journal of the European Society for Medical Oncology.

[85]  Shuifang Zhu,et al.  Skewer: a fast and accurate adapter trimmer for next-generation sequencing paired-end reads , 2014, BMC Bioinformatics.

[86]  Alison S. Devonshire,et al.  Towards standardisation of cell-free DNA measurement in plasma: controls for extraction efficiency, fragment size bias and quantification , 2014, Analytical and Bioanalytical Chemistry.

[87]  Ash A. Alizadeh,et al.  An ultrasensitive method for quantitating circulating tumor DNA with broad patient coverage , 2013, Nature Medicine.

[88]  Jeff Vierstra,et al.  Coupling transcription factor occupancy to nucleosome architecture with DNase-FLASH , 2013, Nature Methods.

[89]  David T. W. Jones,et al.  Signatures of mutational processes in human cancer , 2013, Nature.

[90]  N. Rosenfeld,et al.  Non-invasive analysis of acquired resistance to cancer therapy by sequencing of plasma DNA , 2013, Nature.

[91]  M. Thayer,et al.  DNA replication timing, genome stability and cancer: late and/or delayed DNA replication timing is associated with increased genomic instability. , 2013, Seminars in cancer biology.

[92]  Raymond K. Auerbach,et al.  An Integrated Encyclopedia of DNA Elements in the Human Genome , 2012, Nature.

[93]  A. McKenna,et al.  Absolute quantification of somatic DNA alterations in human cancer , 2012, Nature Biotechnology.

[94]  M. Risio The Natural History of pT1 Colorectal Cancer , 2012, Front. Oncol..

[95]  Manolis Kellis,et al.  ChromHMM: automating chromatin-state discovery and characterization , 2012, Nature Methods.

[96]  Wen-Hsiung Li,et al.  DNA replication timing and selection shape the landscape of nucleotide variation in cancer genomes , 2012, Nature Communications.

[97]  X. Wang,et al.  Potential Clinical Significance of a Plasma-Based KRAS Mutation Analysis in Patients with Advanced Non–Small Cell Lung Cancer , 2010, Clinical Cancer Research.

[98]  M. Meyerson,et al.  EGFR mutation and resistance of non-small-cell lung cancer to gefitinib. , 2005, The New England journal of medicine.

[99]  C. Cornelisse,et al.  DNA aneuploidy in colorectal adenomas. , 1987, British Journal of Cancer.

[100]  Raymond K. Auerbach,et al.  PeakSeq enables systematic scoring of ChIP-seq experiments relative to controls , 2009, Nature Biotechnology.

[101]  Claude-Alain H. Roten,et al.  Fast and accurate short read alignment with Burrows–Wheeler transform , 2009, Bioinform..

[102]  S. Goodman,et al.  Circulating mutant DNA to assess tumor dynamics , 2008, Nature Medicine.

[103]  M. Torres,et al.  DNA aneuploidy in colorectal adenomas. Role in the adenoma-carcinoma sequence. , 2005, Revista espanola de enfermedades digestivas : organo oficial de la Sociedad Espanola de Patologia Digestiva.