The structure and evolution of centromeric transition regions within the human genome

An understanding of how centromeric transition regions are organized is a critical aspect of chromosome structure and function; however, the sequence context of these regions has been difficult to resolve on the basis of the draft genome sequence. We present a detailed analysis of the structure and assembly of all human pericentromeric regions (5 megabases). Most chromosome arms (35 out of 43) show a gradient of dwindling transcriptional diversity accompanied by an increasing number of interchromosomal duplications in proximity to the centromere. At least 30% of the centromeric transition region structure originates from euchromatic gene-containing segments of DNA that were duplicatively transposed towards pericentromeric regions at a rate of six–seven events per million years during primate evolution. This process has led to the formation of a minimum of 28 new transcripts by exon exaptation and exon shuffling, many of which are primarily expressed in the testis. The distribution of these duplicated segments is nonrandom among pericentromeric regions, suggesting that some regions have served as preferential acceptors of euchromatic DNA.

[1]  U. Francke An International System for Human Cytogenetic Nomenclature — High-Resolution Banding (1981): ISCN (1981) , 1981 .

[2]  J. Yunis,et al.  The origin of man: a chromosomal pictorial legacy. , 1982, Science.

[3]  M. Goodman,et al.  Molecular phylogeny of the New World monkeys (Platyrrhini, primates) based on two unlinked nuclear genes: IRBP intron 1 and epsilon-globin sequences. , 1996, American journal of physical anthropology.

[4]  M. Ferguson-Smith,et al.  Human centromeric DNAs , 1997, Human Genetics.

[5]  E. Eichler,et al.  Masquerading repeats: paralogous pitfalls of the human genome. , 1998, Genome research.

[6]  M. Marra,et al.  Genetic definition and sequence analysis of Arabidopsis centromeres. , 1999, Science.

[7]  N. Archidiacono,et al.  Sequences flanking the centromere of human chromosome 10 are a complex patchwork of arm-specific sequences, stable duplications and unstable sequences with homologies to telomeric and other centromeric locations. , 1999, Human molecular genetics.

[8]  E. Eichler,et al.  The mosaic structure of human pericentromeric DNA: a strategy for characterizing complex regions of the human genome. , 2000, Genome research.

[9]  M. Adams,et al.  Molecular structure and evolution of an alpha satellite/non-alpha satellite junction at 16p11. , 2000, Human molecular genetics.

[10]  E. Eichler,et al.  Recent duplication, domain accretion and the dynamic mutation of the human genome. , 2001, Trends in genetics : TIG.

[11]  Valery Shepelev,et al.  Alpha-satellite DNA of primates: old and new families , 2001, Chromosoma.

[12]  E. Eichler,et al.  Lessons from the human genome: transitions between euchromatin and heterochromatin. , 2001, Human molecular genetics.

[13]  N. Archidiacono,et al.  Centromere emergence in evolution. , 2001, Genome research.

[14]  E. Winzeler,et al.  Genomic and Genetic Definition of a Functional Human Centromere , 2001, Science.

[15]  B. Trask,et al.  Segmental duplications: organization and impact within the current human genome project assembly. , 2001, Genome research.

[16]  J. V. Moran,et al.  Initial sequencing and analysis of the human genome. , 2001, Nature.

[17]  Stuart Schwartz,et al.  Human-specific duplication and mosaic transcripts: the recent paralogous structure of chromosome 22. , 2002, American journal of human genetics.

[18]  N. Archidiacono,et al.  Human paralogs of KIAA0187 were created through independent pericentromeric-directed and chromosome-specific duplication mechanisms. , 2002, Genome research.

[19]  M. Adams,et al.  Recent Segmental Duplications in the Human Genome , 2002, Science.

[20]  M. Jackson Duplicate, decouple, disperse: the evolutionary transience of human centromeric regions. , 2003, Current opinion in genetics & development.

[21]  E. Green,et al.  Pericentromeric duplications in the laboratory mouse. , 2003, Genome research.

[22]  James M. Eldred,et al.  The DNA sequence of human chromosome 7 , 2003, Nature.

[23]  Mario Ventura,et al.  Chromosome 6 phylogeny in primates and centromere repositioning. , 2003, Molecular biology and evolution.

[24]  Jonathan M. Mudge,et al.  Neocentromeres in 15q24-26 map to duplicons which flanked an ancestral centromere in 15q25. , 2003, Genome research.

[25]  D. Haussler,et al.  Hotspots of mammalian chromosomal evolution , 2004, Genome Biology.

[26]  Jonathan M. Mudge,et al.  Genomic sequence and transcriptional profile of the boundary between pericentromeric satellites and genes on human chromosome arm 10p. , 2003, Genome research.

[27]  E. Eichler,et al.  Analysis of primate genomic variation reveals a repeat-driven expansion of the human genome. , 2003, Genome research.

[28]  B. Roe,et al.  Using a pericentromeric interspersed repeat to recapitulate the phylogeny and expansion of human centromeric segmental duplications. , 2003, Molecular biology and evolution.

[29]  E. Eichler,et al.  An Alu transposition model for the origin and expansion of human segmental duplications. , 2003, American journal of human genetics.

[30]  J. Rossier,et al.  The 200-kb segmental duplication on human chromosome 21 originates from a pericentromeric dissemination involving human chromosomes 2, 18 and 13. , 2003, Gene.

[31]  Patrick Gaudray,et al.  Segmental duplications in euchromatic regions of human chromosome 5: a source of evolutionary instability and transcriptional innovation. , 2003, Genome research.

[32]  Daniel Pinkel,et al.  Large-scale variation among human and great ape genomes determined by array comparative genomic hybridization. , 2003, Genome research.

[33]  S. Henikoff,et al.  Sequencing of a rice centromere uncovers active genes , 2004, Nature Genetics.

[34]  Randall A. Bolanos,et al.  Whole-genome shotgun assembly and comparison of human genome assemblies , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[35]  H. Willard,et al.  Analysis of the centromeric regions of the human genome assembly. , 2004, Trends in genetics : TIG.

[36]  E. Eichler,et al.  Molecular evolution of the human chromosome 15 pericentromeric region , 2004, Cytogenetic and Genome Research.

[37]  M. Rocchi,et al.  An alphoid DNA sequence conserved in all human and great ape chromosomes: evidence for ancient centromeric sequences at human chromosomal regions 2q21 and 9q13 , 1993, Human Genetics.

[38]  J. Grimwood,et al.  Closing the gaps on human chromosome 19 revealed genes with a high density of repetitive tandemly arrayed elements. , 2004, Genome research.

[39]  E. Eichler,et al.  Recent segmental duplications in the working draft assembly of the brown Norway rat. , 2004, Genome research.

[40]  Huntington F. Willard,et al.  Chromosome-specific subsets of human alpha satellite DNA: Analysis of sequence divergence within and between chromosomal subsets and evidence for an ancestral pentameric repeat , 2005, Journal of Molecular Evolution.

[41]  W. Willis,et al.  The Origin of Man : A Chromosomal Pictorial Legacy , 2014 .