The evolutionary fate of MULE-mediated duplications of host gene fragments in rice.

DNA transposons are known to frequently capture duplicated fragments of host genes. The evolutionary impact of this phenomenon depends on how frequently the fragments retain protein-coding function as opposed to becoming pseudogenes. Gene fragment duplication by Mutator-like elements (MULEs) has previously been documented in maize, Arabidopsis, and rice. Here we present a rigorous genome-wide analysis of MULEs in the model plant Oryza sativa (domesticated rice). We identify 8274 MULEs with intact termini and target-site duplications (TSDs) and show that 1337 of them contain duplicated host gene fragments. Through a detailed examination of the 5% of duplicated gene fragments that are transcribed, we demonstrate that virtually all cases contain pseudogenic features such as fragmented conserved protein domains, frameshifts, and premature stop codons. In addition, we show that the distribution of the ratio of nonsynonymous to synonymous amino acid substitution rates for the duplications agrees with the expected distribution for pseudogenes. We conclude that MULE-mediated host gene duplication results in the formation of pseudogenes, not novel functional protein-coding genes; however, the transcribed duplications possess characteristics consistent with a potential role in the regulation of host gene expression.

[1]  Takuji Sasaki,et al.  The map-based sequence of the rice genome , 2005, Nature.

[2]  John B. Anderson,et al.  CDD: a Conserved Domain Database for protein classification , 2004, Nucleic Acids Res..

[3]  Cathy H. Wu,et al.  The Universal Protein Resource (UniProt) , 2004, Nucleic Acids Res..

[4]  Asger Hobolth,et al.  Comparative analysis of protein coding sequences from human, mouse and the domesticated pig , 2005, BMC Biology.

[5]  Franck Vazquez,et al.  Endogenous trans-acting siRNAs regulate the accumulation of Arabidopsis mRNAs. , 2004, Molecular cell.

[6]  Sean R. Eddy,et al.  Pack-MULE transposable elements mediate gene evolution in plants , 2004, Nature.

[7]  T. Tuschl,et al.  Mechanisms of gene silencing by double-stranded RNA , 2004, Nature.

[8]  P. Rouzé,et al.  Detection of 91 potential conserved plant microRNAs in Arabidopsis thaliana and Oryza sativa identifies important target genes. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[9]  Frédérique Bitton,et al.  Genome-Wide Analysis of Arabidopsis Pentatricopeptide Repeat Proteins Reveals Their Essential Role in Organelle Biogenesis , 2004, The Plant Cell Online.

[10]  E. Nitasaka,et al.  Characterization of Tpn1 family in the Japanese morning glory: En/Spm-related transposable elements capturing host genes. , 2004, Plant & cell physiology.

[11]  H. Kazazian Mobile Elements: Drivers of Genome Evolution , 2004, Science.

[12]  D. Bartel MicroRNAs Genomics, Biogenesis, Mechanism, and Function , 2004, Cell.

[13]  Richard M. Bruskiewich,et al.  Transposable element annotation of the rice genome , 2004, Bioinform..

[14]  Patrice Koehl,et al.  The ASTRAL Compendium in 2004 , 2003, Nucleic Acids Res..

[15]  C. Robin Buell,et al.  The TIGR Plant Repeat Databases: a collective resource for the identification of repetitive sequences in plants , 2004, Nucleic Acids Res..

[16]  M. Suyama,et al.  A genome-wide survey of human pseudogenes. , 2003, Genome research.

[17]  Mark Gerstein,et al.  Millions of years of evolution preserved: a comprehensive catalog of the processed pseudogenes in the human genome. , 2003, Genome research.

[18]  F. Ayala,et al.  Pseudogenes: are they "junk" or functional DNA? , 2003, Annual review of genetics.

[19]  Mark Gerstein,et al.  A "polyORFomic" analysis of prokaryote genomes using disabled-homology filtering reveals conserved but undiscovered short ORFs. , 2003, Journal of molecular biology.

[20]  Javier F. Palatnik,et al.  Control of leaf morphogenesis by microRNAs , 2003, Nature.

[21]  F. Kaper,et al.  Hop, an active Mutator-like element in the genome of the fungus Fusarium oxysporum. , 2003, Molecular biology and evolution.

[22]  J. Kawai,et al.  Collection, Mapping, and Annotation of Over 28,000 cDNA Clones from japonica Rice , 2003, Science.

[23]  Atsushi Yoshiki,et al.  An expressed pseudogene regulates the messenger-RNA stability of its homologous coding gene , 2003, Nature.

[24]  Colin N. Dewey,et al.  Initial sequencing and comparative analysis of the mouse genome. , 2002 .

[25]  W. J. Kent,et al.  BLAT--the BLAST-like alignment tool. , 2002, Genome research.

[26]  Mark Gerstein,et al.  Molecular fossils in the human genome: identification and analysis of the pseudogenes in chromosomes 21 and 22. , 2002, Genome research.

[27]  D. Hartl,et al.  A maximum likelihood method for analyzing pseudogene evolution: implications for silent site evolution in humans and rodents. , 2002, Molecular biology and evolution.

[28]  V. Walbot,et al.  The MuDR transposon terminal inverted repeat contains a complex plant promoter directing distinct somatic and germinal programs. , 2008, The Plant journal : for cell and molecular biology.

[29]  T. Bureau,et al.  Survey of transposable elements from rice genomic sequences. , 2008, The Plant journal : for cell and molecular biology.

[30]  S. Wright,et al.  Mutator-like elements in Arabidopsis thaliana. Structure, diversity and evolution. , 2000, Genetics.

[31]  S Wright,et al.  Transposon diversity in Arabidopsis thaliana. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[32]  V. Solovyev,et al.  Ab initio gene finding in Drosophila genomic DNA. , 2000, Genome research.

[33]  M. O'Shea,et al.  Neuronal Expression of Neural Nitric Oxide Synthase (nNOS) Protein Is Suppressed by an Antisense RNA Transcribed from an NOS Pseudogene , 1999, The Journal of Neuroscience.

[34]  Thomas L. Madden,et al.  BLAST 2 Sequences, a new tool for comparing protein and nucleotide sequences. , 1999, FEMS microbiology letters.

[35]  Sean R. Eddy,et al.  Profile hidden Markov models , 1998, Bioinform..

[36]  Ziheng Yang,et al.  PAML: a program package for phylogenetic analysis by maximum likelihood , 1997, Comput. Appl. Biosci..

[37]  S. Wessler,et al.  Transduction of a cellular gene by a plant retroelement , 1994, Cell.

[38]  Johng K. Lim,et al.  Gross chromosome rearrangements mediated by transposable elements in Drosophila melanogaster , 1994, BioEssays : news and reviews in molecular, cellular and developmental biology.

[39]  E. Myers,et al.  Basic local alignment search tool. , 1990, Journal of molecular biology.

[40]  E. Ralston,et al.  Chromosome-breaking structure in maize involving a fractured Ac element. , 1989, Proceedings of the National Academy of Sciences of the United States of America.

[41]  V. Chandler,et al.  Characterization of a highly conserved sequence related to mutator transposable elements in maize. , 1988, Molecular biology and evolution.

[42]  M. Nei,et al.  Pseudogenes as a paradigm of neutral evolution , 1981, Nature.