Paired-Duplication Signatures Mark Cryptic Inversions and Other Complex Structural Variation.

Copy-number variants (CNVs) have been the predominant focus of genetic studies of structural variation, and chromosomal microarray (CMA) for genome-wide CNV detection is the recommended first-tier genetic diagnostic screen in neurodevelopmental disorders. We compared CNVs observed by CMA to the structural variation detected by whole-genome large-insert sequencing in 259 individuals diagnosed with autism spectrum disorder (ASD) from the Simons Simplex Collection. These analyses revealed a diverse landscape of complex duplications in the human genome. One remarkably common class of complex rearrangement, which we term dupINVdup, involves two closely located duplications ("paired duplications") that flank the breakpoints of an inversion. This complex variant class is cryptic to CMA, but we observed it in 8.1% of all subjects. We also detected other paired-duplication signatures and duplication-mediated complex rearrangements in 15.8% of all ASD subjects. Breakpoint analysis showed that the predominant mechanism of formation of these complex duplication-associated variants was microhomology-mediated repair. On the basis of the striking prevalence of dupINVdups in this cohort, we explored the landscape of all inversion variation among the 235 highest-quality libraries and found abundant complexity among these variants: only 39.3% of inversions were canonical, or simple, inversions without additional rearrangement. Collectively, these findings indicate that dupINVdups, as well as other complex duplication-associated rearrangements, represent relatively common sources of genomic variation that is cryptic to population-based microarray and low-depth whole-genome sequencing. They also suggest that paired-duplication signatures detected by CMA warrant further scrutiny in genetic diagnostic testing given that they might mark complex rearrangements of potential clinical relevance.

[1]  Robert T. Schultz,et al.  Autism genome-wide copy number variation reveals ubiquitin and neuronal genes , 2009, Nature.

[2]  G. Schroth,et al.  Occurrence of potential cruciform and H-DNA forming sequences in genomic DNA. , 1995, Nucleic acids research.

[3]  J. Lupski,et al.  A Microhomology-Mediated Break-Induced Replication Model for the Origin of Human Copy Number Variation , 2009, PLoS genetics.

[4]  Kenny Q. Ye,et al.  Mapping copy number variation by population scale genome sequencing , 2010, Nature.

[5]  Ryan M. Layer,et al.  LUMPY: a probabilistic framework for structural variant discovery , 2012, Genome Biology.

[6]  C. Lord,et al.  The Simons Simplex Collection: A Resource for Identification of Autism Genetic Risk Factors , 2010, Neuron.

[7]  Leslie G Biesecker,et al.  Consensus statement: chromosomal microarray is a first-tier clinical diagnostic test for individuals with developmental disabilities or congenital anomalies. , 2010, American journal of human genetics.

[8]  Gary D Bader,et al.  Functional impact of global rare copy number variation in autism spectrum disorders , 2010, Nature.

[9]  Kathryn Roeder,et al.  Multiple Recurrent De Novo CNVs, Including Duplications of the 7q11.23 Williams Syndrome Region, Are Strongly Associated with Autism , 2011, Neuron.

[10]  M. K. Rudd,et al.  Next-generation sequencing of duplication CNVs reveals that most are tandem and some create fusion genes at breakpoints. , 2015, American journal of human genetics.

[11]  Ryan L. Collins,et al.  Cryptic and complex chromosomal aberrations in early-onset neuropsychiatric disorders. , 2014, American journal of human genetics.

[12]  Philip M. Kim,et al.  Paired-End Mapping Reveals Extensive Structural Variation in the Human Genome , 2007, Science.

[13]  Ellen T. Gelfand,et al.  The Genotype-Tissue Expression (GTEx) project , 2013, Nature Genetics.

[14]  Toshiro K. Ohsumi,et al.  Sequencing Chromosomal Abnormalities Reveals Neurodevelopmental Loci that Confer Risk across Diagnostic Boundaries , 2012, Cell.

[15]  S. Scherer,et al.  Exonic deletions in AUTS2 cause a syndromic form of intellectual disability and suggest a critical role for the C terminus. , 2013, American journal of human genetics.

[16]  J. Lupski,et al.  The DNA replication FoSTeS/MMBIR mechanism can generate genomic, genic and exonic complex rearrangements in humans , 2009, Nature Genetics.

[17]  Christopher S. Poultney,et al.  Synaptic, transcriptional, and chromatin genes disrupted in autism , 2014, Nature.

[18]  Boris Yamrom,et al.  The contribution of de novo coding mutations to autism spectrum disorder , 2014, Nature.

[19]  J. Lupski,et al.  Mechanisms for human genomic rearrangements , 2008, PathoGenetics.

[20]  Scott B. Selleck,et al.  Global increases in both common and rare copy number load associated with autism , 2013, Human molecular genetics.

[21]  Ira M. Hall,et al.  Complex reorganization and predominant non-homologous repair following chromosomal breakage in karyotypically balanced germline rearrangements and transgenic integration , 2012, Nature Genetics.

[22]  M. Trková,et al.  A 15 Mb large paracentric chromosome 21 inversion identified in Czech population through a pair of flanking duplications , 2014, Molecular Cytogenetics.

[23]  Katarzyna Chawarska,et al.  Molecular cytogenetic analysis and resequencing of contactin associated protein-like 2 in autism spectrum disorders. , 2008, American journal of human genetics.

[24]  B. Trask,et al.  Identification of a novel gene on chromosome 7q11.2 interrupted by a translocation breakpoint in a pair of autistic twins. , 2002, Genomics.

[25]  Committee Opinion No. 581: the use of chromosomal microarray analysis in prenatal diagnosis. , 2013, Obstetrics and gynecology.

[26]  S. Hochreiter,et al.  cn.MOPS: mixture of Poissons for discovering copy number variations in next-generation sequencing data with a low false discovery rate , 2012, Nucleic acids research.

[27]  Michael E Talkowski,et al.  Clinical diagnosis by whole-genome sequencing of a prenatal sample. , 2012, The New England journal of medicine.

[28]  Yiping Shen,et al.  Next-generation sequencing strategies enable routine detection of balanced chromosome rearrangements for clinical diagnostics and genetic research. , 2011, American journal of human genetics.

[29]  D. Ledbetter,et al.  Chromosomal microarray versus karyotyping for prenatal diagnosis. , 2012, The New England journal of medicine.

[30]  M. Talkowski,et al.  Design of Large‐Insert Jumping Libraries for Structural Variant Detection Using Illumina Sequencing , 2014, Current protocols in human genetics.

[31]  Kenny Q. Ye,et al.  Strong Association of De Novo Copy Number Mutations with Autism , 2007, Science.