An Expectation-Maximization Algorithm for Analysis of Evolution of Exon-Intron Structure of Eukaryotic Genes

We propose a detailed model of evolution of exon-intron structure of eukaryotic genes that takes into account gene-specific intron gain and loss rates, branch-specific gain and loss coefficients, invariant sites incapable of intron gain, and rate variability of both gain and loss which is gamma-distributed across sites. We develop an expectation-maximization algorithm to estimate the parameters of this model, and study its performance using simulated data.

[1]  Arlin Stoltzfus,et al.  The evolutionary gain of spliceosomal introns: sequence and phase preferences. , 2004, Molecular biology and evolution.

[2]  Walter Gilbert,et al.  Complex early genes. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[3]  Russell F. Doolittle,et al.  Intron Distribution in Ancient Paralogs Supports Random Insertion and Not Random Loss , 1997, Journal of Molecular Evolution.

[4]  D. Haussler,et al.  Phylogenetic estimation of context-dependent substitution rates by maximum likelihood. , 2003, Molecular biology and evolution.

[5]  Ziheng Yang Maximum likelihood phylogenetic estimation from DNA sequences with variable rates over sites: Approximate methods , 1994, Journal of Molecular Evolution.

[6]  Walter Gilbert,et al.  The pattern of intron loss. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[7]  R. Ellis,et al.  A phylogeny of caenorhabditis reveals frequent loss of introns during nematode evolution. , 2004, Genome research.

[8]  Tobias Mourier,et al.  Eukaryotic Intron Loss , 2003, Science.

[9]  Tal Pupko,et al.  A Structural EM Algorithm for Phylogenetic Inference , 2002, J. Comput. Biol..

[10]  Ian Holmes,et al.  Using evolutionary Expectation Maximization to estimate indel rates , 2005, Bioinform..

[11]  J. Felsenstein Evolutionary trees from DNA sequences: A maximum likelihood approach , 2005, Journal of Molecular Evolution.

[12]  W. Gilbert,et al.  The exon theory of genes. , 1987, Cold Spring Harbor symposia on quantitative biology.

[13]  Andrew G McArthur,et al.  A spliceosomal intron in Giardia lamblia , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[14]  Eugene V Koonin,et al.  Preferential loss and gain of introns in 3' portions of genes suggests a reverse-transcription mechanism of intron insertion. , 2004, Gene.

[15]  M. Lynch Intron evolution as a population-genetic process , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[16]  E. Koonin,et al.  Remarkable Interkingdom Conservation of Intron Positions and Massive, Lineage-Specific Intron Loss and Gain in Eukaryotic Evolution , 2003, Current Biology.

[17]  N. Dibb,et al.  Proto-splice site model of intron origin. , 1991, Journal of theoretical biology.

[18]  Mona Singh,et al.  A novel method for estimating ancestral amino acid composition and its application to proteins of the Last Universal Ancestor , 2004, Bioinform..