JuLI: accurate detection of DNA fusions in clinical sequencing for precision oncology.

Accurate detection of genomic fusions by high-throughput sequencing in clinical samples with inadequate tumor purity and formalin-fixed, paraffin embedded tissue is an essential task in precise oncology. We developed the fusion detection algorithm Junction Location Identifier for optimization of high-depth clinical sequencing. Novel filtering steps were implemented to minimize false positives in clinical setting. The algorithm was comprehensively validated using high-depth sequencing data from cancer cell lines and clinical samples and genome sequencing data from NA12878. Junction Location Identifier showed improved performance mainly in positive predictive value over state-of-the-art fusion callers in cases with high-depth clinical sequencing and rescued a driver fusion from false negative in plasma cell-free DNA using joint calling.

[1]  Joachim Weischenfeldt,et al.  SvABA: genome-wide detection of structural variants and indels by local assembly , 2018, Genome research.

[2]  L. Ding,et al.  novoBreak: local assembly for breakpoint detection in cancer genomes , 2016, Nature Methods.

[3]  Lianbo Yu,et al.  Impact of Pre-Analytical Variables on Cancer Targeted Gene Sequencing Efficiency , 2015, PloS one.

[4]  R. Palmer,et al.  Mechanistic insight into ALK receptor tyrosine kinase in human cancer biology , 2013, Nature Reviews Cancer.

[5]  Xiaoyu Chen,et al.  Manta: rapid detection of structural variants and indels for germline and cancer sequencing applications , 2016, Bioinform..

[6]  Dae-Soon Son,et al.  Utility of targeted deep sequencing for detecting circulating tumor DNA in pancreatic cancer patients , 2018, Scientific Reports.

[7]  Steven J. M. Jones,et al.  The Molecular Taxonomy of Primary Prostate Cancer , 2015, Cell.

[8]  Sam Angiuoli,et al.  Direct detection of early-stage cancers using circulating tumor DNA , 2017, Science Translational Medicine.

[9]  Timothy J O'Leary,et al.  Paraffin embedding contributes to RNA aggregation, reduced RNA yield, and low RNA quality. , 2011, The Journal of molecular diagnostics : JMD.

[10]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[11]  M. DePristo,et al.  The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. , 2010, Genome research.

[12]  John D Pfeifer,et al.  Comparison of clinical targeted next-generation sequence data from formalin-fixed and fresh-frozen tissue specimens. , 2013, The Journal of molecular diagnostics : JMD.

[13]  Alex M. Fichtenholtz,et al.  Development and validation of a clinical cancer genomic profiling test based on massively parallel DNA sequencing , 2013, Nature Biotechnology.

[14]  Lei Wang,et al.  ALK, ROS1 and RET fusions in 1139 lung adenocarcinomas: a comprehensive study of common and fusion pattern-specific clinicopathologic, histologic and cytologic features. , 2014, Lung cancer.

[15]  Yuki Togashi,et al.  RET, ROS1 and ALK fusions in lung cancer , 2012, Nature Medicine.

[16]  Tom H. Pringle,et al.  The human genome browser at UCSC. , 2002, Genome research.

[17]  Soile Tapio,et al.  Nucleic acids from long-term preserved FFPE tissues are suitable for downstream analyses , 2012, Virchows Archiv.

[18]  Glen J. Weiss,et al.  Using circulating cell-free DNA to monitor personalized cancer therapy , 2017, Critical reviews in clinical laboratory sciences.

[19]  Mingming Jia,et al.  COSMIC: exploring the world's knowledge of somatic mutations in human cancer , 2014, Nucleic Acids Res..

[20]  Anil Vachani,et al.  Detection of Therapeutically Targetable Driver and Resistance Mutations in Lung Cancer Patients by Next-Generation Sequencing of Cell-Free Circulating Tumor DNA , 2016, Clinical Cancer Research.

[21]  Cathy H. Wu,et al.  UniProt: the Universal Protein knowledgebase , 2004, Nucleic Acids Res..

[22]  Boram Lee,et al.  Comprehensive analysis of RET and ROS1 rearrangement in lung adenocarcinoma , 2015, Modern Pathology.

[23]  L. Holm,et al.  The Pfam protein families database , 2005, Nucleic Acids Res..

[24]  A. Shaw,et al.  ALK inhibitors in non-small cell lung cancer: crizotinib and beyond. , 2014, Clinical advances in hematology & oncology : H&O.

[25]  Rui Henrique,et al.  Novel 5' fusion partners of ETV1 and ETV4 in prostate cancer. , 2013, Neoplasia.

[26]  Hyun Cheol Chung,et al.  Landscape of Actionable Genetic Alterations Profiled from 1,071 Tumor Samples in Korean Cancer Patients , 2018, Cancer research and treatment : official journal of Korean Cancer Association.

[27]  Jeffrey W. Clark,et al.  Anaplastic lymphoma kinase inhibition in non-small-cell lung cancer. , 2010, The New England journal of medicine.

[28]  Woo Yong Lee,et al.  Prevalence and detection of low-allele-fraction variants in clinical cancer samples , 2017, Nature Communications.

[29]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[30]  Hyun-Tae Shin,et al.  Molecular breakdown: a comprehensive view of anaplastic lymphoma kinase (ALK)‐rearranged non‐small cell lung cancer , 2017, The Journal of pathology.

[31]  Laura Conti,et al.  Tearing down the walls: FDA approves next generation sequencing (NGS) assays for actionable cancer genomic aberrations , 2018, Journal of Experimental & Clinical Cancer Research.

[32]  Geoffrey R. Oxnard,et al.  Bias-Corrected Targeted Next-Generation Sequencing for Rapid, Multiplexed Detection of Actionable Alterations in Cell-Free DNA from Advanced Lung Cancer Patients , 2015, Clinical Cancer Research.

[33]  Jakob Skou Pedersen,et al.  Optimized targeted sequencing of cell-free plasma DNA from bladder cancer patients , 2018, Scientific Reports.

[34]  Thomas Zichner,et al.  DELLY: structural variant discovery by integrated paired-end and split-read analysis , 2012, Bioinform..

[35]  Mauricio O. Carneiro,et al.  Scaling accurate genetic variant discovery to tens of thousands of samples , 2017, bioRxiv.

[36]  Michael Thomas,et al.  Crizotinib versus chemotherapy in advanced ALK-positive lung cancer. , 2013, The New England journal of medicine.

[37]  Markus Riester,et al.  Tracking the Evolution of Resistance to ALK Tyrosine Kinase Inhibitors through Longitudinal Analysis of Circulating Tumor DNA. , 2018, JCO precision oncology.

[38]  Funda Meric-Bernstam,et al.  Bias from removing read duplication in ultra-deep sequencing experiments , 2014, Bioinform..

[39]  Ryan M. Layer,et al.  LUMPY: a probabilistic framework for structural variant discovery , 2012, Genome Biology.

[40]  C. Sawyers,et al.  Efficacy and safety of a specific inhibitor of the BCR-ABL tyrosine kinase in chronic myeloid leukemia. , 2001, The New England journal of medicine.

[41]  T. Speed,et al.  GRIDSS: sensitive and specific genomic rearrangement detection using positional de Bruijn graph assembly. , 2017, Genome research.