CopywriteR: DNA copy number detection from off-target sequence data

Current methods for detection of copy number variants (CNV) and aberrations (CNA) from targeted sequencing data are based on the depth of coverage of captured exons. Accurate CNA determination is complicated by uneven genomic distribution and non-uniform capture efficiency of targeted exons. Here we present CopywriteR, which eludes these problems by exploiting ‘off-target’ sequence reads. CopywriteR allows for extracting uniformly distributed copy number information, can be used without reference, and can be applied to sequencing data obtained from various techniques including chromatin immunoprecipitation and target enrichment on small gene panels. CopywriteR outperforms existing methods and constitutes a widely applicable alternative to available tools.

[1]  Xiaolin Zhu,et al.  An Evaluation of Copy Number Variation Detection Tools from Whole‐Exome Sequencing Data , 2014, Human mutation.

[2]  Jiang Li,et al.  Finding the lost treasures in exome sequencing data. , 2013, Trends in genetics : TIG.

[3]  Z. Xuan,et al.  Genome-wide in situ exon capture for selective resequencing , 2007, Nature Genetics.

[4]  Emily H Turner,et al.  Targeted Capture and Massively Parallel Sequencing of Twelve Human Exomes , 2009, Nature.

[5]  ENCODEConsortium,et al.  An Integrated Encyclopedia of DNA Elements in the Human Genome , 2012, Nature.

[6]  K. Kok,et al.  CGH arrays compared for DNA isolated from formalin‐fixed, paraffin‐embedded material , 2012, Genes, chromosomes & cancer.

[7]  Hanlee P. Ji,et al.  Next-generation DNA sequencing , 2008, Nature Biotechnology.

[8]  P. V. van Diest,et al.  Primary Colorectal Cancers and Their Subsequent Hepatic Metastases Are Genetically Different: Implications for Selection of Patients for Targeted Treatment , 2011, Clinical Cancer Research.

[9]  John Quackenbush,et al.  Exome sequencing-based copy-number variation and loss of heterozygosity detection: ExomeCNV , 2011, Bioinform..

[10]  Steven J. M. Jones,et al.  Comprehensive molecular characterization of human colon and rectal cancer , 2012, Nature.

[11]  B. Giusti,et al.  EXCAVATOR: detecting copy number variants from whole-exome sequencing data , 2013, Genome Biology.

[12]  Todd Richmond,et al.  Detection of Clinically Relevant Copy Number Variants with Whole‐Exome Sequencing , 2013, Human mutation.

[13]  Mark Gerstein,et al.  Millions of years of evolution preserved: a comprehensive catalog of the processed pseudogenes in the human genome. , 2003, Genome research.

[14]  E. Cuppen,et al.  Detailed imaging and genetic analysis reveal a secondary BRAFL505H resistance mutation and extensive intrapatient heterogeneity in metastatic BRAF mutant melanoma patients treated with vemurafenib , 2015, Pigment cell & melanoma research.

[15]  小森和樹 Gene Expression Omnibus利用方法の検討 , 2016 .

[16]  Beth Israel,et al.  Decision letter: Replication Study: A coding-independent function of gene and pseudogene mRNAs regulates tumour biology , 2010 .

[17]  M. Wigler,et al.  Circular binary segmentation for the analysis of array-based DNA copy number data. , 2004, Biostatistics.

[18]  L. Chin,et al.  A comparison of DNA copy number profiling platforms. , 2008, Cancer research.

[19]  M. DePristo,et al.  A framework for variation discovery and genotyping using next-generation DNA sequencing data , 2011, Nature Genetics.

[20]  Rory Stark Differential Oestrogen Receptor Binding is Associated with Clinical Outcome in Breast Cancer , 2012, RECOMB.

[21]  John McPherson,et al.  WaveCNV: allele-specific copy number alterations in primary tumors and xenograft models from next-generation sequencing , 2013, Bioinform..

[22]  K. Maclennan,et al.  Using next-generation sequencing for high resolution multiplex analysis of copy number variation from nanogram quantities of DNA from formalin-fixed paraffin-embedded specimens , 2010, Nucleic acids research.

[23]  Rafael A Irizarry,et al.  Exploration, normalization, and genotype calls of high-density oligonucleotide SNP array data. , 2006, Biostatistics.

[24]  J. Uhm Comprehensive genomic characterization defines human glioblastoma genes and core pathways , 2009 .

[25]  Data production leads,et al.  An integrated encyclopedia of DNA elements in the human genome , 2012 .

[26]  Bradley P. Coe,et al.  Copy number variation detection and genotyping from exome sequence data , 2012, Genome research.

[27]  S. Varambally,et al.  Genomic Loss of microRNA-101 Leads to Overexpression of Histone Methyltransferase EZH2 in Cancer , 2008, Science.

[28]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[29]  Michael R. Speicher,et al.  Digital karyotyping , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[30]  Arul M Chinnaiyan,et al.  Detection of somatic copy number alterations in cancer using targeted exome capture sequencing. , 2011, Neoplasia.

[31]  Clifford A. Meyer,et al.  Model-based Analysis of ChIP-Seq (MACS) , 2008, Genome Biology.

[32]  R. Wilson,et al.  Cancer genome sequencing: a review. , 2009, Human molecular genetics.

[33]  Wessel N. van Wieringen,et al.  CGHcall: calling aberrations for array CGH tumor profiles , 2007, Bioinform..

[34]  Lodewyk F. A. Wessels,et al.  A regression model for estimating DNA copy number applied to capture sequencing data , 2012, Bioinform..

[35]  Richard Durbin,et al.  Sequence analysis Fast and accurate short read alignment with Burrows – Wheeler transform , 2009 .

[36]  Pieter Wesseling,et al.  DNA copy number analysis of fresh and formalin-fixed specimens by shallow whole-genome sequencing with identification and exclusion of problematic regions in the genome assembly , 2014, Genome research.

[37]  J. V. Moran,et al.  Initial sequencing and analysis of the human genome. , 2001, Nature.

[38]  E. Banks,et al.  Discovery and statistical genotyping of copy-number variation from whole-exome sequencing depth. , 2012, American journal of human genetics.

[39]  Alberto Magi,et al.  Read count approach for DNA copy number variants detection , 2012, Bioinform..

[40]  James Hadfield,et al.  The pitfalls of platform comparison: DNA copy number array technologies assessed , 2009, BMC Genomics.

[41]  Daniel R. Zerbino,et al.  Ensembl 2014 , 2013, Nucleic Acids Res..

[42]  W. Kuo,et al.  High resolution analysis of DNA copy number variation using comparative genomic hybridization to microarrays , 1998, Nature Genetics.

[43]  Toshihiro Tanaka The International HapMap Project , 2003, Nature.

[44]  C. Meijer,et al.  Focal aberrations indicate EYA2 and hsa‐miR‐375 as oncogene and tumor suppressor in cervical carcinogenesis , 2013, Genes, chromosomes & cancer.

[45]  S. Gabriel,et al.  Pan-cancer patterns of somatic copy-number alteration , 2013, Nature Genetics.

[46]  M. Roizen,et al.  Hallmarks of Cancer: The Next Generation , 2012 .