DeviCNV: detection and visualization of exon-level copy number variants in targeted next-generation sequencing data

BackgroundTargeted next-generation sequencing (NGS) is increasingly being adopted in clinical laboratories for genomic diagnostic tests.ResultsWe developed a new computational method, DeviCNV, intended for the detection of exon-level copy number variants (CNVs) in targeted NGS data. DeviCNV builds linear regression models with bootstrapping for every probe to capture the relationship between read depth of an individual probe and the median of read depth values of all probes in the sample. From the regression models, it estimates the read depth ratio of the observed and predicted read depth with confidence interval for each probe which is applied to a circular binary segmentation (CBS) algorithm to obtain CNV candidates. Then, it assigns confidence scores to those candidates based on the reliability and strength of the CNV signals inferred from the read depth ratios of the probes within them. Finally, it also provides gene-centric plots with confidence levels of CNV candidates for visual inspection. We applied DeviCNV to targeted NGS data generated for newborn screening and demonstrated its ability to detect novel pathogenic CNVs from clinical samples.ConclusionsWe propose a new pragmatic method for detecting CNVs in targeted NGS data with an intuitive visualization and a systematic method to assign confidence scores for candidate CNVs. Since DeviCNV was developed for use in clinical diagnosis, sensitivity is increased by the detection of exon-level CNVs.

[1]  Christian Gilissen,et al.  Unlocking Mendelian disease using exome sequencing , 2011, Genome Biology.

[2]  André Reis,et al.  Psoriasis is associated with increased β-defensin genomic copy number , 2008, Nature Genetics.

[3]  Matthew S. Lebo,et al.  Detecting Copy Number Variation via Next Generation Technology , 2016, Current Genetic Medicine Reports.

[4]  Gonçalo R. Abecasis,et al.  The Sequence Alignment/Map format and SAMtools , 2009, Bioinform..

[5]  X. Estivill,et al.  Copy number variants and genetic traits: closer to the resolution of phenotypic to genotypic variability , 2007, Nature Reviews Genetics.

[6]  Jos Jonkers,et al.  CopywriteR: DNA copy number detection from off-target sequence data , 2015, Genome Biology.

[7]  Ryan M. Layer,et al.  LUMPY: a probabilistic framework for structural variant discovery , 2012, Genome Biology.

[8]  S. Steer,et al.  Evidence for an influence of chemokine ligand 3-like 1 (CCL3L1) gene copy number on susceptibility to rheumatoid arthritis , 2007, Annals of the rheumatic diseases.

[9]  Takuo Ikeda,et al.  Massively Parallel DNA Sequencing Successfully Identifies New Causative Mutations in Deafness Genes in Patients with Cochlear Implantation and EAS , 2013, PloS one.

[10]  J. R. MacDonald,et al.  A copy number variation map of the human genome , 2015, Nature Reviews Genetics.

[11]  Tatiana Popova,et al.  Multi-factor data normalization enables the detection of copy number aberrations in amplicon sequencing data , 2014, Bioinform..

[12]  Iuliana Ionita-Laza,et al.  Genetic association analysis of copy-number variation (CNV) in human disease pathogenesis. , 2009, Genomics.

[13]  Alfons Meindl,et al.  Copy number variant in the candidate tumor suppressor gene MTUS1 and familial breast cancer risk. , 2007, Carcinogenesis.

[14]  E. Banks,et al.  Discovery and statistical genotyping of copy-number variation from whole-exome sequencing depth. , 2012, American journal of human genetics.

[15]  Bernhard Radlwimmer,et al.  A chromosome 8 gene-cluster polymorphism with low human beta-defensin 2 gene copy number predisposes to Crohn disease of the colon. , 2006, American journal of human genetics.

[16]  John Quackenbush,et al.  Exome sequencing-based copy-number variation and loss of heterozygosity detection: ExomeCNV , 2011, Bioinform..

[17]  Silvia Parajes,et al.  High Frequency of Copy Number Variations and Sequence Variants at CYP21A2 Locus: Implication for the Genetic Diagnosis of 21-Hydroxylase Deficiency , 2008, PloS one.

[18]  Nancy R. Zhang,et al.  CODEX: a normalization and copy number variation detection method for whole exome sequencing , 2015, Nucleic acids research.

[19]  P. Stankiewicz,et al.  Whole-genome sequencing in a patient with Charcot-Marie-Tooth neuropathy. , 2010, The New England journal of medicine.

[20]  Eric Talevich,et al.  CNVkit: Genome-Wide Copy Number Detection and Visualization from Targeted DNA Sequencing , 2016, PLoS Comput. Biol..

[21]  Heng Li Aligning sequence reads, clone sequences and assembly contigs with BWA-MEM , 2013, 1303.3997.

[22]  M. Swertz,et al.  CoNVaDING: Single Exon Variation Detection in Targeted NGS Data , 2016, Human mutation.

[23]  M. Wigler,et al.  Circular binary segmentation for the analysis of array-based DNA copy number data. , 2004, Biostatistics.

[24]  Pär Stattin,et al.  Association of a germ-line copy number variation at 2p24.3 and risk for aggressive prostate cancer. , 2009, Cancer research.

[25]  M. DePristo,et al.  The Genome Analysis Toolkit: a MapReduce framework for analyzing next-generation DNA sequencing data. , 2010, Genome research.

[26]  Qingguo Wang,et al.  Computational tools for copy number variation (CNV) detection using next-generation sequencing data: features and perspectives , 2013, BMC Bioinformatics.

[27]  Jason Li,et al.  CONTRA: copy number analysis for targeted resequencing , 2012, Bioinform..

[28]  B. Rovin,et al.  The Influence of CCL 3 L 1 Gene – Containing Segmental Duplications on HIV-1 / AIDS Susceptibility , 2009 .

[29]  P. Zandi,et al.  Whole-genome CNV analysis: advances in computational approaches , 2015, Front. Genet..

[30]  Tom Walsh,et al.  Whole exome sequencing and homozygosity mapping identify mutation in the cell polarity protein GPSM2 as the cause of nonsyndromic hearing loss DFNB82. , 2010, American journal of human genetics.

[31]  M. DePristo,et al.  A framework for variation discovery and genotyping using next-generation DNA sequencing data , 2011, Nature Genetics.

[32]  Yanming Feng,et al.  Capture-based high-coverage NGS: a powerful tool to uncover a wide spectrum of mutation types , 2015, Genetics in Medicine.

[33]  Birgit Funke,et al.  VisCap: inference and visualization of germ-line copy-number variants from targeted clinical sequencing data , 2015, Genetics in Medicine.

[34]  C. Ponting,et al.  Sequencing depth and coverage: key considerations in genomic analyses , 2014, Nature Reviews Genetics.

[35]  S. Mccarroll,et al.  Copy-number variation and association studies of human disease , 2007, Nature Genetics.