Solving Phylogenetic Network Containment Problems using Cherry-picking Sequences

Phylogenetic networks are used to represent evolutionary scenarios in biology and linguistics. To find the most probable scenario, it may be necessary to compare candidate networks, to distinguish different networks, and to see when one network is contained in another. In this paper, we introduce cherry-picking networks, a class of networks that can be reduced by a sequence of two graph operations. We show that some networks are uniquely determined by the sequences that reduce them---we call these the reconstructible cherry-picking networks, and further show that given two cherry-picking networks within the same reconstructible class, one is contained in the other if a sequence for the latter network reduces the former network. By restricting our scope to tree-child networks, we show that the converse of the above statement holds, thereby showing that {\sc Network Containment}, the problem of checking whether a network is contained in another, can be solved in linear time for tree-child networks. We implement this algorithm in Python and show that the linear-time theoretical bound on the input size is achievable in practice. Lastly, we provide a linear time algorithm for deciding whether two tree-child networks are isomorphic.

[1]  Ge Xia,et al.  Seeing the trees and their branches in the network is hard , 2007, Theor. Comput. Sci..

[2]  Gabriel Cardona,et al.  Comparison of Tree-Child Phylogenetic Networks , 2009, TCBB.

[3]  Charles Semple,et al.  A class of phylogenetic networks reconstructable from ancestral profiles. , 2019, Mathematical biosciences.

[4]  Mike A. Steel,et al.  Which Phylogenetic Networks are Merely Trees with Additional Arcs? , 2015, Systematic biology.

[5]  Charles Semple,et al.  Determining phylogenetic networks from inter-taxa distances , 2016, Journal of mathematical biology.

[6]  Simone Linz,et al.  Cherry Picking: A Characterization of the Temporal Hybridization Number for a Set of Phylogenies , 2013, Bulletin of Mathematical Biology.

[7]  D. Morrison,et al.  Networks in phylogenetic analysis: new tools for population biology. , 2005, International journal for parasitology.

[8]  Steven Kelk,et al.  Networks: expanding evolutionary thinking. , 2013, Trends in genetics : TIG.

[9]  Louxin Zhang,et al.  Solving the Tree Containment Problem for Genetically Stable Networks in Quadratic Time , 2015, IWOCA.

[10]  Leo van Iersel,et al.  Deciding the existence of a cherry-picking sequence is hard on two trees , 2017, Discret. Appl. Math..

[11]  Mark Jones,et al.  Combining Networks Using Cherry Picking Sequences , 2019, AlCoB.

[12]  Mark Jones,et al.  Reconstructing Tree-Child Networks from Reticulate-Edge-Deleted Subnetworks , 2018, Bulletin of Mathematical Biology.

[13]  Simone Linz,et al.  Attaching leaves and picking cherries to characterise the hybridisation number for a set of phylogenies , 2017, Adv. Appl. Math..

[14]  Louxin Zhang,et al.  Solving the tree containment problem in linear time for nearly stable phylogenetic networks , 2017, Discret. Appl. Math..

[15]  Leo van Iersel,et al.  Locating a tree in a phylogenetic network , 2010, Inf. Process. Lett..