Wagner and Dollo: a stochastic duet by composing two parsimonious solos.

New contributions toward generalizing evolutionary models expand greatly our ability to analyze complex evolutionary characters and advance phylogeny reconstruction. In this article, we extend the binary stochastic Dollo model to allow for multi-state characters. In doing so, we align previously incompatible Wagner and Dollo parsimony principles under a common probabilistic framework by embedding arbitrary continuous-time Markov chains into the binary stochastic Dollo model. This approach enables us to analyze character traits that exhibit both Dollo and Wagner characteristics throughout their evolutionary histories. Utilizing Bayesian inference, we apply our novel model to analyze intron conservation patterns and the evolution of alternatively spliced exons. The generalized framework we develop demonstrates potential in distinguishing between phylogenetic hypotheses and providing robust estimates of evolutionary rates. Moreover, for the two applications analyzed here, our framework is the first to provide an adequate stochastic process for the data. We discuss possible extensions to the framework from both theoretical and applied perspectives.

[1]  J. Farris,et al.  Quantitative Phyletics and the Evolution of Anurans , 1969 .

[2]  F. Crick Central Dogma of Molecular Biology , 1970, Nature.

[3]  S. Gould,et al.  Dollo on Dollo's law: Irreversibility and the status of evolutionary laws , 1970, Journal of the history of biology.

[4]  W. J. Quesne The Uniquely Evolved Character Concept and its Cladistic Application , 1974 .

[5]  J. Farris Phylogenetic Analysis Under Dollo's Law , 1977 .

[6]  C. Loan,et al.  Nineteen Dubious Ways to Compute the Exponential of a Matrix , 1978 .

[7]  J. Felsenstein,et al.  Invariants of phylogenies in a simple case with discrete states , 1987 .

[8]  M. Newton Approximate Bayesian-inference With the Weighted Likelihood Bootstrap , 1994 .

[9]  R. Raff,et al.  Evidence for a clade of nematodes, arthropods and other moulting animals , 1997, Nature.

[10]  Andrew Gelman,et al.  General methods for monitoring convergence of iterative simulations , 1998 .

[11]  B. Hausdorf Early evolution of the bilateria. , 2000, Systematic biology.

[12]  Bradley P. Carlin,et al.  Markov Chain Monte Carlo Methods for Computing Bayes Factors , 2001 .

[13]  M. Suchard,et al.  Bayesian selection of continuous-time Markov chain evolutionary models. , 2001, Molecular biology and evolution.

[14]  E. Koonin,et al.  Remarkable Interkingdom Conservation of Intron Positions and Massive, Lineage-Specific Intron Loss and Gain in Eukaryotic Evolution , 2003, Current Biology.

[15]  Christopher J. Lee,et al.  Alternative splicing in the human, mouse and rat genomes is associated with an increased frequency of exon creation and/or loss , 2003, Nature Genetics.

[16]  Cleve B. Moler,et al.  Nineteen Dubious Ways to Compute the Exponential of a Matrix, Twenty-Five Years Later , 1978, SIAM Rev..

[17]  E. Koonin,et al.  Coelomata and not Ecdysozoa: evidence from genome-wide phylogenetic analysis. , 2003, Genome research.

[18]  H. Philippe,et al.  Multigene analyses of bilaterian animals corroborate the monophyly of Ecdysozoa, Lophotrochozoa, and Protostomia. , 2005, Molecular biology and evolution.

[19]  Christopher J. Lee,et al.  Protein Modularity of Alternatively Spliced Exons Is Associated with Tissue-Specific Regulation of Alternative Splicing , 2005, PLoS genetics.

[20]  T. Warnow,et al.  A STOCHASTIC MODEL OF LANGUAGE EVOLUTION THAT INCORPORATES HOMOPLASY AND BORROWING , 2005 .

[21]  W. Gilbert,et al.  Resolution of a deep animal divergence by the pattern of intron conservation. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[22]  J. Felsenstein Evolutionary trees from DNA sequences: A maximum likelihood approach , 2005, Journal of Molecular Evolution.

[23]  Geoff K. Nicholls,et al.  Quantifying uncertainty in a stochastic model of vocabulary evolution , 2006 .

[24]  P. Forster,et al.  Phylogenetic Methods and the Prehistory of Languages , 2006 .

[25]  Tony O’Hagan Bayes factors , 2006 .

[26]  Geoff K. Nicholls,et al.  Dated ancestral trees from binary trait data and their application to the diversification of languages , 2007, 0711.1874.

[27]  Christopher J. Lee,et al.  Global analysis of exon creation versus loss and the role of alternative splicing in 17 vertebrate genomes. , 2007, RNA.

[28]  E. Koonin,et al.  Analysis of rare amino acid replacements supports the Coelomata clade. , 2007, Molecular biology and evolution.

[29]  A. Rambaut,et al.  BEAST: Bayesian evolutionary analysis by sampling trees , 2007, BMC Evolutionary Biology.

[30]  Marc A Suchard,et al.  Counting labeled transitions in continuous-time Markov models of evolution , 2007, Journal of mathematical biology.

[31]  Marco A. R. Ferreira,et al.  Bayesian analysis of elapsed times in continuous‐time Markov chains , 2008 .