Distance-Based Phylogenetic Methods Around a Polytomy

Distance-based phylogenetic algorithms attempt to solve the NP-hard least-squares phylogeny problem by mapping an arbitrary dissimilarity map representing biological data to a tree metric. The set of all dissimilarity maps is a Euclidean space properly containing the space of all tree metrics as a polyhedral fan. Outputs of distance-based tree reconstruction algorithms such as UPGMA and neighbor-joining are points in the maximal cones in the fan. Tree metrics with polytomies lie at the intersections of maximal cones. A phylogenetic algorithm divides the space of all dissimilarity maps into regions based upon which combinatorial tree is reconstructed by the algorithm. Comparison of phylogenetic methods can be done by comparing the geometry of these regions. We use polyhedral geometry to compare the local nature of the subdivisions induced by least-squares phylogeny, UPGMA, and neighbor-joining when the true tree has a single polytomy with exactly four neighbors. Our results suggest that in some circumstances, UPGMA and neighbor-joining poorly match least-squares phylogeny.

[1]  W. H. Day Computational complexity of inferring phylogenies from dissimilarity matrices. , 1987, Bulletin of mathematical biology.

[2]  P. Buneman A Note on the Metric Properties of Trees , 1974 .

[3]  H. Ross Principles of Numerical Taxonomy , 1964 .

[4]  Louis J. Billera,et al.  Geometry of the Space of Phylogenetic Trees , 2001, Adv. Appl. Math..

[5]  P. Buneman The Recovery of Trees from Measures of Dissimilarity , 1971 .

[6]  Lior Pachter,et al.  On the optimality of the neighbor-joining algorithm , 2007, Algorithms for Molecular Biology.

[7]  N. Saitou,et al.  The neighbor-joining method: a new method for reconstructing phylogenetic trees. , 1987, Molecular biology and evolution.

[8]  O Gascuel,et al.  BIONJ: an improved version of the NJ algorithm based on a simple model of sequence data. , 1997, Molecular biology and evolution.

[9]  O. Gascuel,et al.  Neighbor-joining revealed. , 2006, Molecular biology and evolution.

[10]  Caroline J. Klivans,et al.  The Bergman complex of a matroid and phylogenetic trees , 2006, J. Comb. Theory, Ser. B.

[11]  G. Ziegler Lectures on Polytopes , 1994 .

[12]  A. Halpern,et al.  Weighted neighbor joining: a likelihood-based approach to distance-based phylogeny reconstruction. , 2000, Molecular biology and evolution.

[13]  Ruriko Yoshida,et al.  Optimality of the Neighbor Joining Algorithm and Faces of the Balanced Minimum Evolution Polytope , 2010, Bulletin of mathematical biology.

[14]  Serkan Hosten,et al.  Least Squares Methods for Equidistant Tree Reconstruction , 2008, 0808.3979.

[15]  O. Gascuel,et al.  Theoretical foundation of the balanced minimum evolution method of phylogenetic inference and its relationship to weighted least-squares tree fitting. , 2003, Molecular biology and evolution.

[16]  P. Sneath,et al.  Numerical Taxonomy , 1962, Nature.

[17]  David E. Speyer,et al.  The tropical Grassmannian , 2003, math/0304218.

[18]  Y. Pauplin Direct Calculation of a Tree Length Using a Distance Matrix , 2000, Journal of Molecular Evolution.

[19]  Seth Sullivant,et al.  Polyhedral combinatorics of UPGMA cones , 2013, Adv. Appl. Math..