Evolutionary fate of retroposed gene copies in the human genome.

Given that retroposed copies of genes are presumed to lack the regulatory elements required for their expression, retroposition has long been considered a mechanism without functional relevance. However, through an in silico assay for transcriptional activity, we identify here >1,000 transcribed retrocopies in the human genome, of which at least approximately 120 have evolved into bona fide genes. Among these, approximately 50 retrogenes have evolved functions in testes, more than half of which were recruited as functional autosomal counterparts of X-linked genes during spermatogenesis. Generally, retrogenes emerge "out of the testis," because they are often initially transcribed in testis and later evolve stronger and sometimes more diverse spatial expression patterns. We find a significant excess of transcribed retrocopies close to other genes or within introns, suggesting that retrocopies can exploit the regulatory elements and/or open chromatin of neighboring genes to become transcribed. In direct support of this hypothesis, we identify 36 retrocopy-host gene fusions, including primate-specific chimeric genes. Strikingly, 27 intergenic retrogenes have acquired untranslated exons de novo during evolution to achieve high expression levels. Notably, our screen for highly transcribed retrocopies also uncovered a retrogene linked to a human recessive disorder, gelatinous drop-like corneal dystrophy, a form of blindness. These functional implications for retroposition notwithstanding, we find that the insertion of retrocopies into genes is generally deleterious, because it may interfere with the transcription of host genes. Our results demonstrate that natural selection has been fundamental in shaping the retrocopy repertoire of the human genome.

[1]  A. Reymond,et al.  Emergence of Young Human Genes after a Burst of Retroposition in Primates , 2005, PLoS biology.

[2]  S. Salzberg,et al.  The Transcriptional Landscape of the Mammalian Genome , 2005, Science.

[3]  Leah Barrera,et al.  A high-resolution map of active promoters in the human genome , 2005, Nature.

[4]  J. Brosius,et al.  Alu-SINE exonization: en route to protein-coding function. , 2005, Molecular biology and evolution.

[5]  D. Duboule,et al.  Inversion-induced disruption of the Hoxd cluster leads to the partition of regulatory landscapes , 2005, Nature Genetics.

[6]  J. Brosius,et al.  Echoes from the past – are we still in an RNP world? , 2005, Cytogenetic and Genome Research.

[7]  Philipp Kapranov,et al.  Examples of the complex architecture of the human transcriptome revealed by RACE and high-density tiling arrays. , 2005, Genome research.

[8]  Piero Carninci,et al.  Tag-based approaches for transcriptome research and genome annotation , 2005, Nature Methods.

[9]  Mark Gerstein,et al.  Integrated pseudogene annotation for human chromosome 22: evidence for transcription. , 2005, Journal of molecular biology.

[10]  G. Helt,et al.  Transcriptional Maps of 10 Human Chromosomes at 5-Nucleotide Resolution , 2005, Science.

[11]  Jürgen Brosius,et al.  Waste not, want not--transcript excess in multicellular eukaryotes. , 2005, Trends in genetics : TIG.

[12]  M. Gerstein,et al.  Transcribed processed pseudogenes in the human genome: an intermediate form of expressed retrosequence lacking protein-coding ability , 2005, Nucleic acids research.

[13]  Fabien Burki,et al.  Birth and adaptive evolution of a hominoid gene that supports high neurotransmitter flux , 2004, Nature Genetics.

[14]  Nick Gilbert,et al.  Chromatin Architecture of the Human Genome Gene-Rich Domains Are Enriched in Open Chromatin Fibers , 2004, Cell.

[15]  J. Brosius,et al.  From "junk" to gene: curriculum vitae of a primate receptor isoform gene. , 2004, Journal of molecular biology.

[16]  Helen Skaletsky,et al.  An X-to-autosome retrogene is required for spermatogenesis in mice , 2004, Nature Genetics.

[17]  Masaru Tomita,et al.  A new role for expressed pseudogenes as ncRNA: regulation of mRNA stability of its homologous coding gene , 2004, Journal of Molecular Medicine.

[18]  L. Duret,et al.  Evidence that functional transcription units cover at least half of the human genome. , 2004, Trends in genetics : TIG.

[19]  M. Long,et al.  Extensive Gene Traffic on the Mammalian X Chromosome , 2004, Science.

[20]  Kevin R. Thornton,et al.  The origin of new genes: glimpses from the young and old , 2003, Nature Reviews Genetics.

[21]  Dixie L Mager,et al.  Transposable elements in mammals promote regulatory variation and diversification of genes with specialized functions. , 2003, Trends in genetics : TIG.

[22]  M. Long,et al.  Dntf-2r, a young Drosophila retroposed gene with specific male expression under positive Darwinian selection. , 2003, Genetics.

[23]  C. V. Jongeneel,et al.  eVOC: a controlled vocabulary for unifying gene expression data. , 2003, Genome research.

[24]  Atsushi Yoshiki,et al.  An expressed pseudogene regulates the messenger-RNA stability of its homologous coding gene , 2003, Nature.

[25]  Kevin R. Thornton,et al.  Retroposed new genes out of the X in Drosophila. , 2002, Genome research.

[26]  L. N. van de Lagemaat,et al.  Retroelement distributions in the human genome: variations associated with age and proximity to genes. , 2002, Genome research.

[27]  M. Gerstein,et al.  Identification and analysis of over 2000 ribosomal protein pseudogenes in the human genome. , 2002, Genome research.

[28]  D. Page,et al.  Functional substitution for TAF(II)250 by a retroposed homolog that is expressed in human spermatogenesis. , 2002, Human molecular genetics.

[29]  T. Mariani,et al.  Superoxide dismutase multigene family: a comparison of the CuZn-SOD (SOD1), Mn-SOD (SOD2), and EC-SOD (SOD3) gene structures, evolution, and expression. , 2002, Free radical biology & medicine.

[30]  M. Long,et al.  Evolution of the phosphoglycerate mutase processed gene in human and chimpanzee revealing the origin of a new primate gene. , 2002, Molecular biology and evolution.

[31]  N. Kenmochi,et al.  The human ribosomal protein genes: sequencing and comparative analysis of 73 genes. , 2002, Genome research.

[32]  Sydney Brenner,et al.  Massively parallel signature sequencing (MPSS) as a tool for in-depth quantitative gene expression profiling in all organisms. , 2002, Briefings in functional genomics & proteomics.

[33]  W. Swanson,et al.  The rapid evolution of reproductive proteins , 2002, Nature Reviews Genetics.

[34]  E. Eichler,et al.  Segmental duplications and the evolution of the primate genome , 2002, Nature Reviews Genetics.

[35]  A. Pawlak,et al.  Human testis expresses a specific poly(A)-binding protein. , 2001, Nucleic acids research.

[36]  D. Landsman,et al.  HMGN4, a newly discovered nucleosome-binding protein encoded by an intronless gene. , 2001, DNA and cell biology.

[37]  N. Kenmochi,et al.  A complete map of the human ribosomal protein genes: assignment of 80 genes to the cytogenetic map and implications for human disorders. , 2001, Genomics.

[38]  R. Ray,et al.  A Novel 16-Kilodalton Cellular Protein Physically Interacts with and Antagonizes the Functional Activity of c-myc Promoter-Binding Protein 1 , 2001, Molecular and Cellular Biology.

[39]  J. Gécz,et al.  Gene structure and expression study of the SEDL gene for spondyloepiphyseal dysplasia tarda. , 2000, Genomics.

[40]  J. Brosius,et al.  Many G-protein-coupled receptors are encoded by retrogenes. , 1999, Trends in genetics : TIG.

[41]  Yusuke Nakamura,et al.  Identification of the gene responsible for gelatinous drop-like corneal dystrophy , 1999, Nature Genetics.

[42]  Evan L. Mulligan,et al.  The Mouse Gene Encoding the Testis-Specific Isoform of Poly(A) Binding Protein (Pabp2) Is an Expressed Retroposon: Intimations That Gene Expression in Spermatogenic Cells Facilitates the Creation of New Genes , 1998, Journal of Molecular Evolution.

[43]  Thomas L. Madden,et al.  Gapped BLAST and PSI-BLAST: a new generation of protein database search programs. , 1997, Nucleic acids research.

[44]  E. Schmidt Transcriptional promiscuity in testes , 1996, Current Biology.

[45]  N. Tommerup,et al.  Heterogeneous Nuclear Ribonucleoproteins H, H′, and F Are Members of a Ubiquitously Expressed Subfamily of Related but Distinct Proteins Encoded by Genes Mapping to Different Chromosomes * , 1995, The Journal of Biological Chemistry.

[46]  J. Brosius,et al.  On "genomenclature": a comprehensive (and respectful) taxonomy for pseudogenes and other "junk DNA". , 1992, Proceedings of the National Academy of Sciences of the United States of America.

[47]  J. Brosius,et al.  Retroposons--seeds of evolution. , 1991, Science.

[48]  J. McCarrey,et al.  Human testis-specific PGK gene lacks introns and possesses characteristics of a processed gene , 1987, Nature.

[49]  Dr. Susumu Ohno Evolution by Gene Duplication , 1970, Springer Berlin Heidelberg.