A new era of long-read sequencing for cancer genomics

Cancer is a disease largely caused by genomic aberrations. Utilizing many rapidly emerging sequencing technologies, researchers have studied cancer genomes to understand the molecular statuses of cancer cells and to reveal their vulnerabilities, such as driver mutations or gene expression. Long-read technologies enable us to identify and characterize novel types of cancerous mutations, including complicated structural variants in haplotype resolution. In this review, we introduce three representative platforms for long-read sequencing and research trends of cancer genomics with long-read data. Further, we describe that aberrant transcriptome and epigenome statuses, namely, fusion transcripts, as well as aberrant transcript isoforms and the phase information of DNA methylation, are able to be elucidated by long-read sequencers. Long-read sequencing may shed light on novel types of aberrations in cancer genomics that are being missed by conventional short-read sequencing analyses.

[1]  Winston Timp,et al.  Detecting DNA cytosine methylation using nanopore sequencing , 2017, Nature Methods.

[2]  Matei David,et al.  Nanocall: an open source basecaller for Oxford Nanopore sequencing data , 2016, bioRxiv.

[3]  Heng Li,et al.  Minimap2: pairwise alignment for nucleotide sequences , 2017, Bioinform..

[4]  D. Planchard,et al.  Osimertinib in the treatment of patients with epidermal growth factor receptor T790M mutation-positive metastatic non-small cell lung cancer: clinical trial evidence and experience , 2016, Therapeutic advances in respiratory disease.

[5]  Niranjan Nagarajan,et al.  Fast and sensitive mapping of nanopore sequencing reads with GraphMap , 2016, Nature Communications.

[6]  Brent S. Pedersen,et al.  Nanopore sequencing and assembly of a human genome with ultra-long reads , 2017, Nature Biotechnology.

[7]  M. Esteller Epigenetic gene silencing in cancer: the DNA hypermethylome. , 2007, Human molecular genetics.

[8]  R. Gregory,et al.  The m(6)A Methyltransferase METTL3 Promotes Translation in Human Cancer Cells. , 2016, Molecular cell.

[9]  Sumio Sugano,et al.  Aberrant transcriptional regulations in cancers: genome, transcriptome and epigenome analysis of lung adenocarcinoma cell lines , 2014, Nucleic acids research.

[10]  P. Agius,et al.  Immunogenic neoantigens derived from gene fusions stimulate T cell responses , 2019, Nature Medicine.

[11]  W. Hahn,et al.  TERT promoter mutations and monoallelic activation of TERT in cancer , 2015, Oncogenesis.

[12]  Minh Duc Cao,et al.  Chiron: translating nanopore raw signal directly into nucleotide sequence using deep learning , 2017, bioRxiv.

[13]  J. Herman,et al.  Gene silencing in cancer in association with promoter hypermethylation. , 2003, The New England journal of medicine.

[14]  Christos Proukakis,et al.  Evaluation of the detection of GBA missense mutations and other variants using the Oxford Nanopore MinION , 2019, Molecular genetics & genomic medicine.

[15]  L. Feuk,et al.  Structural variation in the human genome , 2006, Nature Reviews Genetics.

[16]  Lynda Chin,et al.  Highly Recurrent TERT Promoter Mutations in Human Melanoma , 2013, Science.

[17]  Michael C. Schatz,et al.  Accurate detection of complex structural variations using single molecule sequencing , 2017, Nature Methods.

[18]  Junwei Shi,et al.  Promoter-bound METTL3 maintains myeloid leukaemia by m6A-dependent translation control , 2017, Nature.

[19]  Jordan M. Eizenga,et al.  Mapping DNA Methylation with High Throughput Nanopore Sequencing , 2017, Nature Methods.

[20]  C. Sander,et al.  Genome-wide analysis of non-coding regulatory mutations in cancer , 2014, Nature Genetics.

[21]  Yunfan Fan,et al.  Nanopore sequencing detects structural variants in cancer , 2015, bioRxiv.

[22]  Richard E. Green,et al.  Improving nanopore read accuracy with the R2C2 method enables the sequencing of highly multiplexed full-length single-cell cDNA , 2018, Proceedings of the National Academy of Sciences.

[23]  S. Sugano,et al.  Evaluation and application of RNA-Seq by MinION , 2018, DNA research : an international journal for rapid publication of reports on genes and genomes.

[24]  Y. Ebenstein,et al.  Selective nanopore sequencing of human BRCA1 by Cas9-assisted targeting of chromosome segments (CATCH) , 2018, Nucleic acids research.

[25]  Yao Yang,et al.  Quantitative and multiplexed DNA methylation analysis using long-read single-molecule real-time bisulfite sequencing (SMRT-BS) , 2015, BMC Genomics.

[26]  Sara Goodwin,et al.  Oxford Nanopore sequencing, hybrid error correction, and de novo assembly of a eukaryotic genome , 2015, bioRxiv.

[27]  Evan E. Eichler,et al.  Characterizing the Major Structural Variant Alleles of the Human Genome , 2019, Cell.

[28]  Daniel R. Garalde,et al.  Highly parallel direct RNA sequencing on an array of nanopores , 2016, Nature Methods.

[29]  William Jones,et al.  Variation graph toolkit improves read mapping by representing genetic variation in the reference , 2018, Nature Biotechnology.

[30]  M. Fraga,et al.  Chromosomal instability correlates with genome-wide DNA demethylation in human primary colorectal cancers. , 2006, Cancer research.

[31]  Yutaka Suzuki,et al.  MoMI-G: modular multi-scale integrated genome graph browser , 2019, BMC Bioinformatics.

[32]  N. Loman,et al.  A complete bacterial genome assembled de novo using only nanopore sequencing data , 2015, Nature Methods.

[33]  L. Anelli,et al.  Design and MinION testing of a nanopore targeted gene sequencing panel for chronic lymphocytic leukemia , 2018, Scientific Reports.

[34]  Christopher W. Whelan,et al.  Structural Alterations Driving Castration-Resistant Prostate Cancer Revealed by Linked-Read Genome Sequencing , 2018, Cell.

[35]  Thomas Zichner,et al.  DELLY: structural variant discovery by integrated paired-end and split-read analysis , 2012, Bioinform..

[36]  Kai Ye,et al.  Pindel: a pattern growth approach to detect break points of large deletions and medium sized insertions from paired-end short reads , 2009, Bioinform..

[37]  Mark T. W. Ebbert,et al.  Systematic analysis of dark and camouflaged genes: disease-relevant genes hiding in plain sight , 2019, bioRxiv.

[38]  M. Frith,et al.  Adaptive seeds tame genomic sequence comparison. , 2011, Genome research.

[39]  L. Anelli,et al.  TP53 gene mutation analysis in chronic lymphocytic leukemia by nanopore MinION sequencing , 2016, Diagnostic Pathology.

[40]  Fritz J Sedlazeck,et al.  Piercing the dark matter: bioinformatics of long-range sequencing and mapping , 2018, Nature Reviews Genetics.

[41]  Keith A. Boroevich,et al.  Whole-genome mutational landscape and characterization of noncoding and structural mutations in liver cancer , 2016, Nature Genetics.

[42]  L. Sequist,et al.  The Allelic Context of the C797S Mutation Acquired upon Treatment with Third-Generation EGFR Inhibitors Impacts Sensitivity to Subsequent Treatment Strategies , 2015, Clinical Cancer Research.

[43]  Wan-Ping Lee,et al.  Fast and accurate genomic analyses using genome graphs , 2019, Nature Genetics.

[44]  Koichi Goto,et al.  RET fusion gene: Translation to personalized lung cancer therapy , 2013, Cancer science.

[45]  Joachim Weischenfeldt,et al.  SvABA: genome-wide detection of structural variants and indels by local assembly , 2018, Genome research.

[46]  John D McPherson,et al.  Complex rearrangements and oncogene amplifications revealed by long-read DNA and RNA sequencing of a breast cancer cell line , 2017, bioRxiv.

[47]  Gary D Bader,et al.  International network of cancer genome projects , 2010, Nature.

[48]  Y. Marie,et al.  Same-day genomic and epigenomic diagnosis of brain tumors using real-time nanopore sequencing , 2017, Acta Neuropathologica.

[49]  Xiaoyu Chen,et al.  Manta: rapid detection of structural variants and indels for germline and cancer sequencing applications , 2016, Bioinform..

[50]  Kari Stefansson,et al.  Graphtyper enables population-scale genotyping using pangenome graphs , 2017, Nature Genetics.

[51]  A. Kasarskis,et al.  A window into third-generation sequencing. , 2010, Human molecular genetics.

[52]  Yasuko Mori,et al.  Direct RNA sequencing on nanopore arrays redefines the transcriptional complexity of a viral pathogen , 2019, Nature Communications.

[53]  D. Schwartz,et al.  Allele-Specific Quantification of Structural Variations in Cancer Genomes , 2016, bioRxiv.

[54]  Tomáš Vinař,et al.  DeepNano: Deep recurrent neural networks for base calling in MinION nanopore reads , 2016, PloS one.

[55]  Satoru Miyano,et al.  Aberrant PD-L1 expression through 3′-UTR disruption in multiple cancers , 2016, Nature.

[56]  Yutaka Suzuki,et al.  MoMI-G: modular multi-scale integrated genome graph browser , 2019, BMC Bioinformatics.

[57]  Kenta Nakai,et al.  DBTSS/DBKERO for integrated analysis of transcriptional regulation , 2017, Nucleic Acids Res..

[58]  Heng Li,et al.  Fast and accurate long-read assembly with wtdbg2 , 2019, Nature Methods.

[59]  Matthew E. Ritchie,et al.  Using long-read sequencing to detect imprinted DNA methylation , 2018, bioRxiv.

[60]  Hanlee P. Ji,et al.  Linked read sequencing resolves complex genomic rearrangements in gastric cancer metastases , 2017, Genome Medicine.

[61]  Christina A. Cuomo,et al.  Pilon: An Integrated Tool for Comprehensive Microbial Variant Detection and Genome Assembly Improvement , 2014, PloS one.

[62]  W. Pao,et al.  Acquired resistance to TKIs in solid tumours: learning from lung cancer , 2014, Nature Reviews Clinical Oncology.

[63]  Jian Ma,et al.  Allele-Specific Quantification of Structural Variations in Cancer Genomes , 2016, bioRxiv.

[64]  Wenjun Jiang,et al.  Cas9-Assisted Targeting of CHromosome segments CATCH enables one-step targeted cloning of large gene clusters , 2015, Nature Communications.

[65]  Hanlee P. Ji,et al.  Haplotyping germline and cancer genomes using high-throughput linked-read sequencing , 2015, Nature Biotechnology.

[66]  Ji Eun Lee,et al.  De novo Identification of DNA Modifications Enabled by Genome-Guided Nanopore Signal Processing , 2017, bioRxiv.

[67]  Joshua M. Stuart,et al.  The Cancer Genome Atlas Pan-Cancer analysis project , 2013, Nature Genetics.

[68]  S. Sugano,et al.  Identification of potential regulatory mutations using multi-omics analysis and haplotyping of lung adenocarcinoma cell lines , 2018, Scientific Reports.

[69]  D. Schadendorf,et al.  Highly Recurrent TERT Promoter Mutations in Human Melanoma , 2022 .

[70]  Sumio Sugano,et al.  Sequencing and phasing cancer mutations in lung cancers using a long-read portable sequencer , 2017, DNA research : an international journal for rapid publication of reports on genes and genomes.

[71]  Edwin Cuppen,et al.  Mapping and phasing of structural variation in patient genomes using nanopore sequencing , 2017, Nature Communications.

[72]  Michael C. Schatz,et al.  Accurate detection of complex structural variations using single molecule sequencing , 2017 .

[73]  Daniel A. Haber,et al.  Epidermal growth factor receptor mutations in lung cancer , 2007, Nature Reviews Cancer.

[74]  Jiannis Ragoussis,et al.  Benchmarking of the Oxford Nanopore MinION sequencing for quantitative and qualitative assessment of cDNA populations , 2016, Scientific Reports.

[75]  S. Koren,et al.  Canu: scalable and accurate long-read assembly via adaptive k-mer weighting and repeat separation , 2016, bioRxiv.

[76]  Yutaka Suzuki,et al.  Long read sequencing reveals a novel class of structural aberrations in cancers: identification and characterization of cancerous local amplifications , 2019, bioRxiv.