Implementing and testing Bayesian and maximum-likelihood supertree methods in phylogenetics

Since their advent, supertrees have been increasingly used in large-scale evolutionary studies requiring a phylogenetic framework and substantial efforts have been devoted to developing a wide variety of supertree methods (SMs). Recent advances in supertree theory have allowed the implementation of maximum likelihood (ML) and Bayesian SMs, based on using an exponential distribution to model incongruence between input trees and the supertree. Such approaches are expected to have advantages over commonly used non-parametric SMs, e.g. matrix representation with parsimony (MRP). We investigated new implementations of ML and Bayesian SMs and compared these with some currently available alternative approaches. Comparisons include hypothetical examples previously used to investigate biases of SMs with respect to input tree shape and size, and empirical studies based either on trees harvested from the literature or on trees inferred from phylogenomic scale data. Our results provide no evidence of size or shape biases and demonstrate that the Bayesian method is a viable alternative to MRP and other non-parametric methods. Computation of input tree likelihoods allows the adoption of standard tests of tree topologies (e.g. the approximately unbiased test). The Bayesian approach is particularly useful in providing support values for supertree clades in the form of posterior probabilities.

[1]  O. Bininda-Emonds,et al.  Novel versus unsupported clades: assessing the qualitative support for clades in MRP supertrees. , 2003, Systematic biology.

[2]  François-Joseph Lapointe,et al.  Properties of supertree methods in the consensus setting. , 2007, Systematic biology.

[3]  M. Ragan Phylogenetic inference based on matrix representation of trees. , 1992, Molecular phylogenetics and evolution.

[4]  A. Purvis A composite estimate of primate phylogeny. , 1995, Philosophical transactions of the Royal Society of London. Series B, Biological sciences.

[5]  A. Rodrigo On combining cladograms , 1996 .

[6]  A. D. Gordon Consensus supertrees: The synthesis of rooted trees containing overlapping sets of labeled leaves , 1986 .

[7]  Hidetoshi Shimodaira An approximately unbiased test of phylogenetic tree selection. , 2002, Systematic biology.

[8]  Mike A. Steel,et al.  Computing the Distribution of a Tree Metric , 2009, IEEE ACM Trans. Comput. Biol. Bioinform..

[9]  Mark Wilkinson,et al.  Majority-rule supertrees. , 2007, Systematic biology.

[10]  James O. McInerney,et al.  Some Desiderata for Liberal Supertrees , 2004 .

[11]  Oliver Eulenstein,et al.  The shape of supertrees to come: tree shape related properties of fourteen supertree methods. , 2005, Systematic biology.

[12]  David Fernández-Baca,et al.  Robinson-Foulds Supertrees , 2010, Algorithms for Molecular Biology.

[13]  J. L. Gittleman,et al.  A complete phylogeny of the whales, dolphins and even‐toed hoofed mammals (Cetartiodactyla) , 2005, Biological reviews of the Cambridge Philosophical Society.

[14]  Mark Wilkinson,et al.  Measuring support and finding unsupported relationships in supertrees. , 2005, Systematic biology.

[15]  Davide Pisani,et al.  Supertrees disentangle the chimerical origin of eukaryotic genomes. , 2007, Molecular biology and evolution.

[16]  Maxim Teslenko,et al.  MrBayes 3.2: Efficient Bayesian Phylogenetic Inference and Model Choice Across a Large Model Space , 2012, Systematic biology.

[17]  Fred R. McMorris,et al.  Consensusn-trees , 1981 .

[18]  M. Chase,et al.  Complete generic-level phylogenetic analyses of palms (Arecaceae) with comparisons of supertree and supermatrix approaches. , 2009, Systematic biology.

[19]  Roderic D. M. Page,et al.  Modified Mincut Supertrees , 2002, WABI.

[20]  Peter G Foster,et al.  Modeling compositional heterogeneity. , 2004, Systematic biology.

[21]  Mike Steel,et al.  Maximum likelihood supertrees. , 2007, Systematic biology.

[22]  Mark Wilkinson,et al.  Matrix representation with parsimony, taxonomic congruence, and total evidence. , 2002, Systematic biology.

[23]  Andy Purvis,et al.  A Modification to Baum and Ragan's Method for Combining Phylogenetic Trees , 1995 .

[24]  J. G. Burleigh,et al.  Supertree bootstrapping methods for assessing phylogenetic variation among genes in genome-scale data sets. , 2006, Systematic biology.

[25]  Thylogale,et al.  THE AVERAGE CONSENSUS PROCEDURE: COMBINATION OF WEIGHTED TREES CONTAINING IDENTICAL OR OVERLAPPING SETS OF TAXA , 2009 .

[26]  A. Kupczok Split-based computation of majority-rule supertrees , 2011, BMC Evolutionary Biology.

[27]  F. Ronquist Matrix representation of trees, redundancy, and weighting , 1996 .

[28]  Thérèse A. Holton,et al.  Deep Genomic-Scale Analyses of the Metazoa Reject Coelomata: Evidence from Single- and Multigene Families Analyzed Under a Supertree and Supermatrix Paradigm , 2010, Genome biology and evolution.

[29]  James O. McInerney,et al.  Clann: investigating phylogenetic information through supertree analyses , 2005, Bioinform..

[30]  François-Joseph Lapointe,et al.  THE AVERAGE CONSENSUS PROCEDURE: COMBINATION OF WEIGHTED TREES CONTAINING IDENTICAL OR OVERLAPPING SETS OF TAXA , 1997 .

[31]  G Perrière,et al.  Bacterial molecular phylogeny using supertree approach. , 2001, Genome informatics. International Conference on Genome Informatics.

[32]  David Fernández-Baca,et al.  Flipping: A supertree construction method , 2001, Bioconsensus.

[33]  B. Baum Combining trees as a way of combining data sets for phylogenetic inference, and the desirability of combining gene trees , 1992 .

[34]  Mark D. Wilkinson,et al.  L.U.St: a tool for approximated maximum likelihood supertree reconstruction , 2014, BMC Bioinformatics.

[35]  Mark Wilkinson,et al.  Conservative supertrees. , 2011, Systematic biology.

[36]  D. Robinson,et al.  Comparison of phylogenetic trees , 1981 .

[37]  M. Wilkinson Common Cladistic Information and its Consensus Representation: Reduced Adams and Reduced Cladistic Consensus Trees and Profiles , 1994 .

[38]  M. Donoghue,et al.  Increasing data transparency and estimating phylogenetic uncertainty in supertrees: Approaches using nonparametric bootstrapping. , 2006, Systematic biology.

[39]  Masami Hasegawa,et al.  CONSEL: for assessing the confidence of phylogenetic tree selection , 2001, Bioinform..

[40]  Katrin Nyakatura,et al.  Updating the evolutionary history of Carnivora (Mammalia): a new species-level supertree complete with divergence time estimates , 2012, BMC Biology.

[41]  Simon A. A. Travers,et al.  Does a tree–like phylogeny only exist at the tips in the prokaryotes? , 2004, Proceedings of the Royal Society of London. Series B: Biological Sciences.

[42]  Marcello Ruta,et al.  Dinosaurs and the Cretaceous Terrestrial Revolution , 2008, Proceedings of the Royal Society B: Biological Sciences.

[43]  Wilkinson,et al.  Testing the phylogenetic stability of early tetrapods , 1999, Journal of theoretical biology.

[44]  Charles Semple,et al.  A supertree method for rooted trees , 2000, Discret. Appl. Math..

[45]  J. Cotton,et al.  Supertrees join the mainstream of phylogenetics. , 2009, Trends in ecology & evolution.