A large, complex structural polymorphism at 16p12.1 underlies microdeletion disease risk

There is a complex relationship between the evolution of segmental duplications and rearrangements associated with human disease. We performed a detailed analysis of one region on chromosome 16p12.1 associated with neurocognitive disease and identified one of the largest structural inconsistencies in the human reference assembly. Various genomic analyses show that all examined humans are homozygously inverted relative to the reference genome for a 1.1-Mb region on 16p12.1. We determined that this assembly discrepancy stems from two common structural configurations with worldwide frequencies of 17.6% (S1) and 82.4% (S2). This polymorphism arose from the rapid integration of segmental duplications, precipitating two local inversions within the human lineage over the last 10 million years. The two human haplotypes differ by 333 kb of additional duplicated sequence present in S2 but not in S1. Notably, we show that the S2 configuration harbors directly oriented duplications, specifically predisposing this chromosome to disease-associated rearrangement.

[1]  Evan E. Eichler,et al.  Positive selection of a gene family during the emergence of humans and African apes , 2001, Nature.

[2]  Peter A. Meric,et al.  Lineage-Specific Biology Revealed by a Finished Genome Assembly of the Mouse , 2009, PLoS biology.

[3]  M. Adams,et al.  Recent Segmental Duplications in the Human Genome , 2002, Science.

[4]  Judith D. Cohn,et al.  The sequence and analysis of duplication-rich human chromosome 16 , 2004, Nature.

[5]  Tomas W. Fitzgerald,et al.  Origins and functional impact of copy number variation in the human genome , 2010, Nature.

[6]  Andrew J Lees,et al.  Microdeletion encompassing MAPT at chromosome 17q21.3 is associated with developmental delay and learning disability , 2006, Nature Genetics.

[7]  Emily H Turner,et al.  Targeted Capture and Massively Parallel Sequencing of Twelve Human Exomes , 2009, Nature.

[8]  Evan E. Eichler,et al.  An assessment of the sequence gaps: Unfinished business in a finished human genome , 2004, Nature Reviews Genetics.

[9]  J. D. Parsons,et al.  Miropeats: graphical DNA sequence comparisons , 1995, Comput. Appl. Biosci..

[10]  D. Cooper,et al.  Molecular mechanisms of chromosomal rearrangement during primate evolution , 2008, Chromosome Research.

[11]  B. Trask,et al.  Segmental duplications: organization and impact within the current human genome project assembly. , 2001, Genome research.

[12]  J. Lupski,et al.  Implications of human genome architecture for rearrangement-based disorders: the genomic basis of disease. , 2004, Human molecular genetics.

[13]  Stephen W. Scherer,et al.  A 1.5 million–base pair inversion polymorphism in families with Williams-Beuren syndrome , 2001, Nature Genetics.

[14]  J. Lupski,et al.  Molecular mechanisms for genomic disorders. , 2003, Annual review of genomics and human genetics.

[15]  Matthew J. Huentelman,et al.  IDENTIFICATION OF GENETIC VARIANTS USING BARCODED MULTIPLEXED SEQUENCING , 2008, Nature Methods.

[16]  J. Weber,et al.  Olfactory receptor-gene clusters, genomic-inversion polymorphisms, and common chromosome rearrangements. , 2001, American journal of human genetics.

[17]  G Hermanson,et al.  High-resolution mapping of human chromosome 11 by in situ hybridization with cosmid clones. , 1990, Science.

[18]  E. Eichler,et al.  Fine-scale structural variation of the human genome , 2005, Nature Genetics.

[19]  Zhaoshi Jiang,et al.  Evolutionary toggling of the MAPT 17q21.31 inversion region , 2008, Nature Genetics.

[20]  J. Lupski Genomic disorders: structural features of the genome can lead to DNA rearrangements and human disease traits. , 1998, Trends in genetics : TIG.

[21]  H. Stefánsson,et al.  A common inversion under selection in Europeans , 2005, Nature Genetics.

[22]  Richard Durbin,et al.  A large genome center's improvements to the Illumina sequencing system , 2008, Nature Methods.

[23]  D. Conrad,et al.  Recurrent 16p11.2 microdeletions in autism. , 2007, Human molecular genetics.

[24]  E. Eichler,et al.  Ancestral reconstruction of segmental duplications reveals punctuated cores of human genome evolution , 2007, Nature Genetics.

[25]  M. Hurles,et al.  Large, rare chromosomal deletions associated with severe early-onset obesity , 2010, Nature.

[26]  U. Surti,et al.  Discovery of a previously unrecognized microdeletion syndrome of 16p11.2–p12.2 , 2007, Nature Genetics.

[27]  Fikret Erdogan,et al.  Array CGH identifies reciprocal 16p13.1 duplications and deletions that predispose to autism and/or mental retardation , 2007, Human mutation.

[28]  R. Pfundt,et al.  A new chromosome 17q21.31 microdeletion syndrome associated with a common inversion polymorphism , 2006, Nature Genetics.

[29]  H. Brunner Annual Review of Genomics and Human Genetics , 2001, European Journal of Human Genetics.

[30]  Joshua M. Korn,et al.  Integrated detection and population-genetic analysis of SNPs and copy number variation , 2008, Nature Genetics.

[31]  E. Eichler,et al.  Structure of chromosomal duplicons and their role in mediating human genomic disorders. , 2000, Genome research.

[32]  L Peltonen,et al.  Mechanically stretched chromosomes as targets for high-resolution FISH mapping. , 1995, Genome research.

[33]  H. Mefford,et al.  Recurrent reciprocal deletions and duplications of 16p13.11: the deletion is a risk factor for MR/MCA while the duplication may be a rare benign variant , 2008, Journal of Medical Genetics.

[34]  Joshua M. Korn,et al.  Association between microdeletion and microduplication at 16p11.2 and autism. , 2008, The New England journal of medicine.

[35]  D. Haussler,et al.  Integration of cytogenetic landmarks into the draft sequence of the human genome , 2001, Nature.

[36]  David C. Schwartz,et al.  High-resolution human genome structure by single-molecule analysis , 2010, Proceedings of the National Academy of Sciences.

[37]  Andrew J Sharp,et al.  Discovery of previously unidentified genomic disorders from the duplication architecture of the human genome , 2006, Nature Genetics.

[38]  Mario Cáceres,et al.  A recurrent inversion on the eutherian X chromosome , 2007, Proceedings of the National Academy of Sciences.

[39]  J. R. MacDonald,et al.  Genome-wide detection of segmental duplications and potential assembly errors in the human genome sequence , 2003, Genome Biology.

[40]  Pawel Stankiewicz,et al.  Genomic Disorders: Molecular Mechanisms for Rearrangements and Conveyed Phenotypes , 2005, PLoS genetics.

[41]  Pui-Yan Kwok,et al.  Paternal origins of complete hydatidiform moles proven by whole genome single-nucleotide polymorphism haplotyping. , 2002, Genomics.

[42]  Arcadi Navarro,et al.  A burst of segmental duplications in the genome of the African great ape ancestor , 2009, Nature.

[43]  Richa Agarwala,et al.  A rhesus macaque radiation hybrid map and comparative analysis with the human genome. , 2005, Genomics.

[44]  International Human Genome Sequencing Consortium Initial sequencing and analysis of the human genome , 2001, Nature.

[45]  Junjun Zhang,et al.  Human Chromosome 7: DNA Sequence and Biology , 2003, Science.

[46]  P. Stankiewicz,et al.  Genome architecture, rearrangements and genomic disorders. , 2002, Trends in genetics : TIG.

[47]  Joshua M. Korn,et al.  Mapping and sequencing of structural variation from eight human genomes , 2008, Nature.

[48]  Miron Livny,et al.  Validation of rice genome sequence by optical mapping , 2007, BMC Genomics.

[49]  Zhaoshi Jiang,et al.  Characterization of six human disease-associated inversion polymorphisms , 2009, Human molecular genetics.

[50]  E. Eichler,et al.  DupMasker: a tool for annotating primate segmental duplications. , 2008, Genome research.

[51]  Richard M Myers,et al.  Population analysis of large copy number variants and hotspots of human genetic disease. , 2009, American journal of human genetics.

[52]  Deborah L. Levy,et al.  A recurrent 16p12.1 microdeletion suggests a two-hit model for severe developmental delay , 2010, Nature Genetics.

[53]  David C. Schwartz,et al.  A Single Molecule Scaffold for the Maize Genome , 2009, PLoS genetics.