Infinitely Long Branches and an Informal Test of Common Ancestry

The evidence for universal common ancestry (UCA) is vast and persuasive, and a phylogenetic test was proposed for quantifying its odds against independently originated sequences based on the comparison between one and several trees [1]. This test was successfully applied to a well-supported homologous sequence alignment, being however criticized once simulations showed that even alignments without any phylogenetic structure could mislead its conclusions [2]. Despite claims to the contrary [3], we believe that the counterexample successfully showed a drawback of the test, of relying on good alignments. Here we present a simplified version of this counterexample, which can be interpreted as a tree with arbitrarily long branches, and where the test again fails. We also present another simulation showing circumstances whereby any sufficiently similar alignment will favor UCA irrespective of the true independent origins for the sequences. We therefore conclude that the test should not be trusted unless convergence has already been ruled out a priori. Finally, we present a class of frequentist tests that perform better than the purportedly formal UCA test.

[1]  David L. Swofford,et al.  THE TOPOLOGY-DEPENDENT PERMUTATION TEST FOR MONOPHYLY DOES NOT TEST FOR MONOPHYLY , 1996 .

[2]  Robert C. Edgar,et al.  MUSCLE: multiple sequence alignment with high accuracy and high throughput. , 2004, Nucleic acids research.

[3]  Ziheng Yang,et al.  INDELible: A Flexible Simulator of Biological Sequence Evolution , 2009, Molecular biology and evolution.

[4]  M. Hasegawa,et al.  Was the universal common ancestry proved? , 2010, Nature.

[5]  David R. Anderson,et al.  Model selection and multimodel inference : a practical information-theoretic approach , 2003 .

[6]  M. Wilkinson,et al.  Type 1 error rates of the parsimony permutation tail probability test. , 2002, Systematic biology.

[7]  Olivier Gascuel,et al.  Empirical profile mixture models for phylogenetic reconstruction , 2008, Bioinform..

[8]  M. Hasegawa,et al.  Some Problems in Proving the Existence of the Universal Common Ancestor of Life on Earth , 2012, TheScientificWorldJournal.

[9]  Jun S. Liu,et al.  BALSA: Bayesian algorithm for local sequence alignment. , 2002, Nucleic acids research.

[10]  D. Posada,et al.  Model selection and model averaging in phylogenetics: advantages of akaike information criterion and bayesian approaches over likelihood ratio tests. , 2004, Systematic biology.

[11]  H. Philippe,et al.  A Bayesian mixture model for across-site heterogeneities in the amino-acid replacement process. , 2004, Molecular biology and evolution.

[12]  D. Penny,et al.  Beyond Reasonable Doubt: Evolution from DNA Sequences , 2013, PloS one.

[13]  J. Felsenstein,et al.  Inching toward reality: An improved likelihood model of sequence evolution , 2004, Journal of Molecular Evolution.

[14]  D. Theobald On universal common ancestry, sequence similarity, and phylogenetic structure: the sins of P-values and the virtues of Bayesian evidence , 2011, Biology Direct.

[15]  Eugene V Koonin,et al.  The common ancestry of life , 2010, Biology Direct.

[16]  Michael D. Hendy,et al.  A Framework for the Quantitative Study of Evolutionary Trees , 1989 .

[17]  Leonardo de Oliveira Martins,et al.  Proving universal common ancestry with similar sequences. , 2012, Trends in evolutionary biology.

[18]  David R. Anderson,et al.  Multimodel Inference , 2004 .

[19]  Douglas L. Theobald,et al.  A formal test of the theory of universal common ancestry , 2010, Nature.

[20]  David Bryant,et al.  Hadamard Phylogenetic Methods and the n-taxon Process , 2008, Bulletin of mathematical biology.

[21]  Daniel P. Faith,et al.  COULD A CLADOGRAM THIS SHORT HAVE ARISEN BY CHANCE ALONE?: ON PERMUTATION TESTS FOR CLADISTIC STRUCTURE , 1991 .

[22]  W. Bruno Modeling residue usage in aligned protein sequences via maximum likelihood. , 1996, Molecular biology and evolution.

[23]  J. Felsenstein Evolutionary trees from DNA sequences: A maximum likelihood approach , 2005, Journal of Molecular Evolution.

[24]  D. Posada,et al.  Testing for Universal Common Ancestry , 2014, Systematic biology.