Structural parsimony: Reductions in sequence space

Computational phylogenetics has historically neglected strict theoretical approaches that exploit the mathematical models beneath which it abstracts away the nuances of evolution. In particular, parsimony is conceptually simple and amenable to rigorous treatment, and has a clear analogue in graph theory, the Steiner tree. We present and refine the notion of sequence space as the soil from which all graph-theoretical methods arise, studying its structural properties and complexity with an eye on maximum parsimony. We therefrom introduce a basic set of very efficient implicit reductions that discard information with a fixed effect on the optimality of the solution, and show how it can be applied to large, real datasets.

[1]  A. Volgenant,et al.  An edge elimination test for the steiner problem in graphs , 1989 .

[2]  Christian Wulff-Nilsen,et al.  A novel approach to phylogenetic trees: d-Dimensional geometric Steiner trees , 2009, Networks.

[3]  Richard W. Hamming,et al.  Error detecting and error correcting codes , 1950 .

[4]  Walter M. Fitch,et al.  On the Problem of Discovering the Most Parsimonious Tree , 1977, The American Naturalist.

[5]  R. Graham,et al.  The steiner problem in phylogeny is NP-complete , 1982 .

[6]  Dharma P. Agrawal,et al.  Generalized Hypercube and Hyperbus Structures for a Computer Network , 1984, IEEE Transactions on Computers.

[7]  R. Sokal,et al.  A METHOD FOR DEDUCING BRANCHING SEQUENCES IN PHYLOGENY , 1965 .

[8]  F. Hwang,et al.  The Steiner Tree Problem , 2012 .

[9]  L. R. Foulds,et al.  A graph theoretic approach to the development of minimal phylogenetic trees , 1979, Journal of Molecular Evolution.

[10]  M. Eigen,et al.  Statistical geometry in sequence space: a method of quantitative comparative sequence analysis. , 1988, Proceedings of the National Academy of Sciences of the United States of America.

[11]  John E. Beasley An algorithm for the steiner problem in graphs , 1984, Networks.

[12]  W. Fitch Toward Defining the Course of Evolution: Minimum Change for a Specific Tree Topology , 1971 .

[13]  John Maynard Smith,et al.  Natural Selection and the Concept of a Protein Space , 1970, Nature.

[14]  Juraj Hromkovic,et al.  Reoptimization of Steiner trees: Changing the terminal set , 2009, Theor. Comput. Sci..

[15]  Thorsten Koch,et al.  Solving Steiner tree problems in graphs to optimality , 1998, Networks.

[16]  Ludwig Nastansky,et al.  Cost-minimal trees in directed acyclic graphs , 1974, Z. Oper. Research.