How to analyse the spatiotemporal tumour samples needed to investigate cancer evolution: A case study using paired primary and recurrent glioblastoma

Many traits of cancer progression (e.g., development of metastases or resistance to therapy) are facilitated by tumour evolution: Darwinian selection of subclones with distinct genotypes or phenotypes that enable such progression. Characterising these subclones provide an opportunity to develop drugs to better target their specific properties but requires the accurate identification of somatic mutations shared across multiple spatiotemporal tumours from the same patient. Current best practices for calling somatic mutations are optimised for single samples, and risk being too conservative to identify shared mutations with low prevalence in some samples. We reasoned that datasets from multiple matched tumours can be used for mutual validation and thus propose an adapted two‐stage approach: (1) low‐stringency mutation calling to identify mutations shared across samples irrespective of the weight of evidence in a single sample; (2) high‐stringency mutation calling to further characterise mutations present in a single sample. We applied our approach to three‐independent cohorts of paired primary and recurrent glioblastoma tumours, two of which have previously been analysed using existing approaches, and found that it significantly increased the amount of biologically relevant shared somatic mutations identified. We also found that duplicate removal was detrimental when identifying shared somatic mutations. Our approach is also applicable when multiple datasets e.g. DNA and RNA are available for the same tumour.

[1]  Steven J. M. Jones,et al.  Mutational Analysis Reveals the Origin and Therapy-Driven Evolution of Recurrent Glioma , 2014, Science.

[2]  Jill S Barnholtz-Sloan,et al.  Whole-genome and multisector exome sequencing of primary and post-treatment glioblastoma reveals patterns of tumor evolution , 2015, Genome research.

[3]  Mads Thomassen,et al.  Evaluation of Nine Somatic Variant Callers for Detection of Somatic Mutations in Exome and Targeted Deep Sequencing Data , 2016, PloS one.

[4]  In-Hee Lee,et al.  Spatiotemporal Evolution of the Primary Glioblastoma Genome. , 2015, Cancer cell.

[5]  Peilin Jia,et al.  Detecting somatic point mutations in cancer genome sequencing data: a comparison of mutation callers , 2013, Genome Medicine.

[6]  Marc J. Williams,et al.  Identification of neutral tumor evolution across cancer types , 2016, Nature Genetics.

[7]  Lucy F. Stead,et al.  The clonal relationships between pre‐cancer and cancer revealed by ultra‐deep sequencing , 2015, The Journal of pathology.

[8]  David M. Thomas,et al.  Sequence artefacts in a prospective series of formalin-fixed tumours tested for mutations in hotspot regions by massively parallel sequencing , 2014, BMC Medical Genomics.

[9]  Anqi Xiong,et al.  Heparan sulfate in the regulation of neural differentiation and glioma development , 2014, The FEBS journal.

[10]  Richard B. Schwab,et al.  Identification of high-confidence somatic mutations in whole genome sequence of formalin-fixed breast cancer specimens , 2012, Nucleic acids research.

[11]  Sarah C. Ayling,et al.  The Ensembl gene annotation system , 2016, Database J. Biol. Databases Curation.

[12]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[13]  Philip Quirke,et al.  Accurately Identifying Low‐Allelic Fraction Variants in Single Samples with Next‐Generation Sequencing: Applications in Tumor Subclone Resolution , 2013, Human mutation.

[14]  V. P. Collins,et al.  Intratumor heterogeneity in human glioblastoma reflects cancer evolutionary dynamics , 2013, Proceedings of the National Academy of Sciences.

[15]  Steven A. Roberts,et al.  Mutational heterogeneity in cancer and the search for new cancer-associated genes , 2013 .

[16]  R. Britto,et al.  Upregulation of ASCL1 and inhibition of Notch signaling pathway characterize progressive astrocytoma , 2005, Oncogene.

[17]  Lucy F. Stead,et al.  Elucidating drivers of oral epithelial dysplasia formation and malignant transformation to cancer using RNAseq , 2015, Oncotarget.

[18]  M. DePristo,et al.  The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. , 2010, Genome research.

[19]  E. Hoxha,et al.  A novel NFIA-NF&kgr;B feed-forward loop contributes to glioblastoma cell survival , 2016, Neuro-oncology.

[20]  In-Hee Lee,et al.  Clonal evolution of glioblastoma under therapy , 2016, Nature Genetics.

[21]  Jeyakumar Natarajan,et al.  Overview of the interactive task in BioCreative V , 2015, Database J. Biol. Databases Curation.

[22]  L. Pusztai,et al.  Cancer heterogeneity: implications for targeted therapeutics , 2013, British Journal of Cancer.

[23]  M. DePristo,et al.  A framework for variation discovery and genotyping using next-generation DNA sequencing data , 2011, Nature Genetics.

[24]  P. A. Futreal,et al.  Genomic architecture and evolution of clear cell renal cell carcinomas defined by multiregion sequencing , 2014, Nature Genetics.

[25]  D. Nam,et al.  WNT signaling in glioblastoma and therapeutic opportunities , 2016, Laboratory Investigation.

[26]  P. Babu,et al.  Implications of mitogen‐activated protein kinase signaling in glioma , 2016, Journal of neuroscience research.