Sensitive and accurate detection of copy number variants using read depth of coverage.

Methods for the direct detection of copy number variation (CNV) genome-wide have become effective instruments for identifying genetic risk factors for disease. The application of next-generation sequencing platforms to genetic studies promises to improve sensitivity to detect CNVs as well as inversions, indels, and SNPs. New computational approaches are needed to systematically detect these variants from genome sequence data. Existing sequence-based approaches for CNV detection are primarily based on paired-end read mapping (PEM) as reported previously by Tuzun et al. and Korbel et al. Due to limitations of the PEM approach, some classes of CNVs are difficult to ascertain, including large insertions and variants located within complex genomic regions. To overcome these limitations, we developed a method for CNV detection using read depth of coverage. Event-wise testing (EWT) is a method based on significance testing. In contrast to standard segmentation algorithms that typically operate by performing likelihood evaluation for every point in the genome, EWT works on intervals of data points, rapidly searching for specific classes of events. Overall false-positive rate is controlled by testing the significance of each possible event and adjusting for multiple testing. Deletions and duplications detected in an individual genome by EWT are examined across multiple genomes to identify polymorphism between individuals. We estimated error rates using simulations based on real data, and we applied EWT to the analysis of chromosome 1 from paired-end shotgun sequence data (30x) on five individuals. Our results suggest that analysis of read depth is an effective approach for the detection of CNVs, and it captures structural variants that are refractory to established PEM-based methods.

[1]  M. Wigler,et al.  Detecting gene copy number fluctuations in tumor cells by microarray analysis of genomic representations. , 2000, Genome research.

[2]  Julie R. Korenberg,et al.  Comparative Genome Hybridization , 2002 .

[3]  Christian A. Rees,et al.  Microarray analysis reveals a major direct role of DNA copy number alteration in the transcriptional program of human breast tumors , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[4]  Daniel Pinkel,et al.  Genomic microarrays in human genetic disease and cancer. , 2003, Human molecular genetics.

[5]  Kenny Q. Ye,et al.  Large-Scale Copy Number Polymorphism in the Human Genome , 2004, Science.

[6]  L. Feuk,et al.  Detection of large-scale variation in the human genome , 2004, Nature Genetics.

[7]  E. Eichler,et al.  Fine-scale structural variation of the human genome , 2005, Nature Genetics.

[8]  L. Feuk,et al.  Structural variants: changing the landscape of chromosomes and design of disease studies. , 2006, Human molecular genetics.

[9]  Kenny Q. Ye,et al.  Strong Association of De Novo Copy Number Mutations with Autism , 2007, Science.

[10]  C. Yau,et al.  QuantiSNP: an Objective Bayes Hidden-Markov Model to detect and accurately map copy number variation using SNP genotyping data , 2007, Nucleic acids research.

[11]  Philip M. Kim,et al.  Paired-End Mapping Reveals Extensive Structural Variation in the Human Genome , 2007, Science.

[12]  Joseph T. Glessner,et al.  PennCNV: an integrated hidden Markov model designed for high-resolution copy number variation detection in whole-genome SNP genotyping data. , 2007, Genome research.

[13]  J. Lupski Structural variation in the human genome. , 2007, The New England journal of medicine.

[14]  Howard L. McLeod,et al.  wuHMM: a robust algorithm to detect DNA copy number variation using long oligonucleotide microarray data , 2008, Nucleic acids research.

[15]  P. Visscher,et al.  Rare chromosomal deletions and duplications increase risk of schizophrenia , 2008, Nature.

[16]  Joshua M. Korn,et al.  Integrated detection and population-genetic analysis of SNPs and copy number variation , 2008, Nature Genetics.

[17]  D. Pinto,et al.  Structural variation of chromosomes in autism spectrum disorder. , 2008, American journal of human genetics.

[18]  Thomas W. Mühleisen,et al.  Large recurrent microdeletions associated with schizophrenia , 2008, Nature.

[19]  Nancy F. Hansen,et al.  Accurate Whole Human Genome Sequencing using Reversible Terminator Chemistry , 2008, Nature.

[20]  Joshua M. Korn,et al.  Integrated genotype calling and association analysis of SNPs, common copy number polymorphisms and rare CNVs , 2008, Nature Genetics.

[21]  Joshua M. Korn,et al.  Mapping and sequencing of structural variation from eight human genomes , 2008, Nature.

[22]  R. Durbin,et al.  Mapping Quality Scores Mapping Short Dna Sequencing Reads and Calling Variants Using P

, 2022 .

[23]  E. Eichler,et al.  Systematic assessment of copy number variant detection via genome-wide SNP genotyping , 2008, Nature Genetics.

[24]  E. Mardis The impact of next-generation sequencing technology on genetics. , 2008, Trends in genetics : TIG.

[25]  A. Singleton,et al.  Rare Structural Variants Disrupt Multiple Genes in Neurodevelopmental Pathways in Schizophrenia , 2008, Science.

[26]  Dawei Li,et al.  The diploid genome sequence of an Asian individual , 2008, Nature.

[27]  G. Kirov,et al.  Support for the involvement of large copy number variants in the pathogenesis of schizophrenia. , 2009, Human molecular genetics.

[28]  Mark Gerstein,et al.  MSB: a mean-shift-based approach for the analysis of structural variation in the genome. , 2008, Genome research.

[29]  Elvira Bramon,et al.  Disruption of the neurexin 1 gene is associated with schizophrenia. , 2009, Human molecular genetics.

[30]  John P. Rice,et al.  Singleton deletions throughout the genome increase risk of bipolar disorder , 2009, Molecular Psychiatry.

[31]  J. Stockman Recurrent Rearrangements of Chromosome 1q21.1 and Variable Pediatric Phenotypes , 2010 .