Solving the tree containment problem in linear time for nearly stable phylogenetic networks

Abstract A phylogenetic network is a rooted acyclic digraph whose leaves are uniquely labeled with a set of taxa. The tree containment problem asks whether or not a phylogenetic network displays a phylogenetic tree over the same set of labeled leaves. It is a fundamental problem arising from validation of phylogenetic network models. The tree containment problem is NP -complete in general. To identify network classes on which the problem is polynomial time solvable, we introduce two classes of networks by generalizations of tree-child networks through vertex stability, namely nearly stable networks and genetically stable networks. Here, we study the combinatorial properties of these two classes of phylogenetic networks. We also develop a linear-time algorithm for solving the tree containment problem on binary nearly stable networks.

[1]  Gabriel Cardona,et al.  Comparison of Tree-Child Phylogenetic Networks , 2007, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[2]  Ge Xia,et al.  Seeing the trees and their branches in the network is hard , 2007, Theor. Comput. Sci..

[3]  G. Carlsson,et al.  Topology of viral evolution , 2013, Proceedings of the National Academy of Sciences.

[4]  L. Nakhleh,et al.  Computational approaches to species phylogeny inference and gene tree reconciliation. , 2013, Trends in ecology & evolution.

[5]  E. Rocha,et al.  Horizontal Transfer, Not Duplication, Drives the Expansion of Protein Families in Prokaryotes , 2011, PLoS genetics.

[6]  Bhaskar DasGupta,et al.  A decomposition theorem and two algorithms for reticulation-visible networks , 2017, Inf. Comput..

[7]  Dan Gusfield,et al.  ReCombinatorics: The Algorithmics of Ancestral Recombination Graphs and Explicit Phylogenetic Networks , 2014 .

[8]  Mike A. Steel,et al.  Which Phylogenetic Networks are Merely Trees with Additional Arcs? , 2015, Systematic biology.

[9]  Louxin Zhang,et al.  Bounding the Size of a Network Defined By Visibility Property , 2015, ArXiv.

[10]  Bengt Oxelman,et al.  Inferring Species Networks from Gene Trees in High-Polyploid North American and Hawaiian Violets (Viola, Violaceae) , 2011, Systematic biology.

[11]  Louxin Zhang,et al.  Locating a Tree in a Phylogenetic Network in Quadratic Time , 2015, RECOMB.

[12]  Paul A. Jenkins,et al.  Genealogy-Based Methods for Inference of Historical Recombination and Gene Flow and Their Application in Saccharomyces cerevisiae , 2012, PloS one.

[13]  Daniel H. Huson,et al.  Phylogenetic Networks - Concepts, Algorithms and Applications , 2011 .

[14]  J. A. Bondy,et al.  Graph Theory , 2008, Graduate Texts in Mathematics.

[15]  Charles Semple,et al.  Reticulation-Visible Networks , 2015, Adv. Appl. Math..

[16]  Jittat Fakcharoenphol,et al.  A faster algorithm for the tree containment problem for binary nearly stable phylogenetic networks , 2015, 2015 12th International Joint Conference on Computer Science and Software Engineering (JCSSE).

[17]  Leo van Iersel,et al.  Locating a tree in a phylogenetic network , 2010, Inf. Process. Lett..

[18]  Gabriel Cardona,et al.  A distance metric for a class of tree-sibling phylogenetic networks , 2008, Bioinform..

[19]  Éva Tardos,et al.  Algorithm design , 2005 .

[20]  D. Posada,et al.  Characterization of Reticulate Networks Based on the Coalescent with Recombination , 2008, Molecular biology and evolution.

[21]  Luay Nakhleh,et al.  Phylogenetic networks , 2004 .

[22]  Kaizhong Zhang,et al.  Perfect phylogenetic networks with recombination , 2001, J. Comput. Biol..

[23]  Simone Linz,et al.  Phylogenetic Networks that Display a Tree Twice , 2014, Bulletin of mathematical biology.

[24]  Louxin Zhang,et al.  Solving the Tree Containment Problem for Genetically Stable Networks in Quadratic Time , 2015, IWOCA.

[25]  Mike A. Steel,et al.  Phylogeny - discrete and random processes in evolution , 2016, CBMS-NSF regional conference series in applied mathematics.

[26]  Louxin Zhang,et al.  A program for verification of phylogenetic network models , 2016, Bioinform..