Prevalence of intron gain over intron loss in the evolution of paralogous gene families.

The mechanisms and evolutionary dynamics of intron insertion and loss in eukaryotic genes remain poorly understood. Reconstruction of parsimonious scenarios of gene structure evolution in paralogous gene families in animals and plants revealed numerous gains and losses of introns. In all analyzed lineages, the number of acquired new introns was substantially greater than the number of lost ancestral introns. This trend held even for lineages in which vertical evolution of genes involved more intron losses than gains, suggesting that gene duplication boosts intron insertion. However, dating gene duplications and the associated intron gains and losses based on the molecular clock assumption showed that very few, if any, introns were gained during the last approximately 100 million years of animal and plant evolution, in agreement with previous conclusions reached through analysis of orthologous gene sets. These results are generally compatible with the emerging notion of intensive insertion and loss of introns during transitional epochs in contrast to the relative quiet of the intervening evolutionary spans.

[1]  Hidemi Watanabe,et al.  A genomic timescale for the origin of eukaryotes , 2001, BMC Evolutionary Biology.

[2]  Michael Lynch,et al.  The evolution of spliceosomal introns. , 2002, Current opinion in genetics & development.

[3]  Christian E. V. Storm,et al.  Automatic clustering of orthologs and in-paralogs from pairwise species comparisons. , 2001, Journal of molecular biology.

[4]  Temple F. Smith,et al.  Comparison of the complete protein sets of worm and yeast: orthology and divergence. , 1998, Science.

[5]  W. Fitch Homology a personal view on some of the problems. , 2000, Trends in genetics : TIG.

[6]  Russell F. Doolittle,et al.  Intron Distribution in Ancient Paralogs Supports Random Insertion and Not Random Loss , 1997, Journal of Molecular Evolution.

[7]  A Yoshida,et al.  Exon/intron structure of aldehyde dehydrogenase genes supports the "introns-late" theory. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[8]  A. Lamond RNA splicing: Running rings around RNA , 1999, Nature.

[9]  Dr. Susumu Ohno Evolution by Gene Duplication , 1970, Springer Berlin Heidelberg.

[10]  S. Aubourg,et al.  Evolution of intron/exon structure of DEAD helicase family genes in Arabidopsis, Caenorhabditis, and Drosophila. , 2001, Genome research.

[11]  D. Labie,et al.  Molecular Evolution , 1991, Nature.

[12]  Sudhir Kumar,et al.  Divergence time estimates for the early history of animal phyla and the origin of plants, animals and fungi , 1999, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[13]  D. Penny,et al.  The modern molecular clock , 2003, Nature Reviews Genetics.

[14]  Sudhir Kumar,et al.  Genomic clocks and evolutionary timescales. , 2003, Trends in genetics : TIG.

[15]  E V Koonin,et al.  Lineage-specific gene expansions in bacterial and archaeal genomes. , 2001, Genome research.

[16]  Alexei Fedorov,et al.  Large-scale comparison of intron positions among animal, plant, and fungal genes , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[17]  W. Gilbert,et al.  Large-scale comparison of intron positions in mammalian genes shows intron loss but no gain , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[18]  E. Koonin,et al.  Selection in the evolution of gene duplications , 2002, Genome Biology.

[19]  J. Haldane,et al.  The Part Played by Recurrent Mutation in Evolution , 1933, The American Naturalist.

[20]  Darren A. Natale,et al.  The COG database: an updated version includes eukaryotes , 2003, BMC Bioinformatics.

[21]  Mark A McPeek,et al.  Estimating metazoan divergence times with a molecular clock. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[22]  M. Lynch,et al.  The Origins of Genome Complexity , 2003, Science.

[23]  W. Doolittle,et al.  Reconstructing/Deconstructing the Earliest Eukaryotes How Comparative Genomics Can Help , 2001, Cell.

[24]  Igor B. Rogozin,et al.  Evidence of Splice Signal Migration from Exon to Intron during Intron Evolution , 2003, Current Biology.

[25]  N. Dibb,et al.  Proto-splice site model of intron origin. , 1991, Journal of theoretical biology.

[26]  T. D. Schneider,et al.  Features of spliceosome evolution and function inferred from an analysis of the information at human splice sites. , 1992, Journal of molecular biology.

[27]  G Glusman,et al.  Sequence, structure, and evolution of a complete human olfactory receptor gene cluster. , 2000, Genomics.

[28]  S. Aubourg,et al.  Introns in, introns out in plant gene families: a genomic approach of the dynamics of gene structure , 2004, Journal of Structural and Functional Genomics.

[29]  M. Lynch,et al.  The evolutionary fate and consequences of duplicate genes. , 2000, Science.

[30]  A. Force,et al.  The probability of duplicate gene preservation by subfunctionalization. , 2000, Genetics.

[31]  E. Koonin,et al.  Remarkable Interkingdom Conservation of Intron Positions and Massive, Lineage-Specific Intron Loss and Gain in Eukaryotic Evolution , 2003, Current Biology.

[32]  Alexei Fedorov,et al.  Mystery of intron gain. , 2003, Genome research.

[33]  I. Ebersberger,et al.  A variable intron distribution in globin genes of Chironomus: evidence for recent intron gain. , 1997, Gene.

[34]  A. Newman,et al.  Exon Junction Sequences as Cryptic Splice Sites Implications for Intron Origin , 2004, Current Biology.

[35]  Jodie J. Yin,et al.  A comprehensive evolutionary classification of proteins encoded in complete eukaryotic genomes , 2004, Genome Biology.

[36]  Eugene V Koonin,et al.  A Non-Adaptationist Perspective on Evolution of Genomic Complexity or the Continued Dethroning of Man , 2004, Cell cycle.

[37]  A. Newman,et al.  Evidence that introns arose at proto‐splice sites. , 1989, The EMBO journal.

[38]  J. Farris Phylogenetic Analysis Under Dollo's Law , 1977 .

[39]  J. Felsenstein Inferring phylogenies from protein sequences by parsimony, distance, and likelihood methods. , 1996, Methods in enzymology.

[40]  W. Fitch Distinguishing homologous from analogous proteins. , 1970, Systematic zoology.

[41]  F. Ayala,et al.  Origin of the metazoan phyla: molecular clocks confirm paleontological estimates. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[42]  D. Lipman,et al.  A genomic perspective on protein families. , 1997, Science.

[43]  F. Ayala,et al.  A new Drosophila spliceosomal intron position is common in plants , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[44]  J. Rogers,et al.  How were introns inserted into nuclear genes? , 1989, Trends in genetics : TIG.

[45]  E. Koonin,et al.  Orthology, paralogy and proposed classification for paralog subtypes. , 2002, Trends in genetics : TIG.

[46]  L. Pauling,et al.  Molecules as documents of evolutionary history. , 1965, Journal of theoretical biology.

[47]  J. Logsdon,et al.  The recent origins of spliceosomal introns revisited. , 1998, Current opinion in genetics & development.

[48]  A. Hughes The evolution of functionally novel proteins after gene duplication , 1994, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[49]  Stephane Aris-Brosou,et al.  Bayesian models of episodic evolution support a late precambrian explosive diversification of the Metazoa. , 2003, Molecular biology and evolution.

[50]  J. Thompson,et al.  CLUSTAL W: improving the sensitivity of progressive multiple sequence alignment through sequence weighting, position-specific gap penalties and weight matrix choice. , 1994, Nucleic acids research.

[51]  M. Kimura,et al.  The neutral theory of molecular evolution. , 1983, Scientific American.

[52]  E. Koonin,et al.  The role of lineage-specific gene family expansion in the evolution of eukaryotes. , 2002, Genome research.