Generalized Fitch graphs II: Sets of binary relations that are explained by edge-labeled trees

Fitch graphs $G=(X,E)$ are digraphs that are explained by $\{\emptyset, 1\}$-edge-labeled rooted trees $T$ with leaf set $X$: there is an arc $(x,y) \in E$ if and only if the unique path in $T$ that connects the last common ancestor $\mathrm{lca}(x,y)$ of $x$ and $y$ with $y$ contains at least one edge with label "1". In practice, Fitch graphs represent xenology relations, i.e., pairs of genes $x$ and $y$ for which a horizontal gene transfer happened along the path from $\mathrm{lca}(x,y)$ to $y$. In this contribution, we generalize the concept of Fitch graphs and consider trees $T$ that are equipped with edge-labeling $\lambda: E\to \mathcal{P}(M)$ that assigns to each edge a subset $M'\subseteq M$ of colors. Given such a tree, we can derive a map $\varepsilon_{(T,\lambda)}$ (or equivalently a set of not necessarily disjoint binary relations), such that $i\in \varepsilon_{(T,\lambda)}(x,y)$ (or equivalently $(x,y)\in R_i$) with $x,y\in X$, if and only if there is at least one edge with color $i$ from $\mathrm{lca}(x,y)$ to $y$. The central question considered here: Is a given map $\varepsilon$ a Fitch map, i.e., is there there an edge-labeled tree $(T,\lambda)$ with $\varepsilon_{(T,\lambda)} = \varepsilon$, and thus explains $\varepsilon$? Here, we provide a characterization of Fitch maps in terms of certain neighborhoods and forbidden submaps. Further restrictions of Fitch maps are considered. Moreover, we show that the least-resolved tree explaining a Fitch map is unique (up to isomorphism). In addition, we provide a polynomial-time algorithm to decide whether $\varepsilon$ is a Fitch map and, in the affirmative case, to construct the (up to isomorphism) unique least-resolved tree $(T^*,\lambda^*)$ that explains $\varepsilon$.

[1]  Katharina T. Huber,et al.  Three-Way Symbolic Tree-Maps and Ultrametrics , 2017, J. Classif..

[2]  Yangjing Long,et al.  Reconstructing Unrooted Phylogenetic Trees from Symbolic Ternary Metrics , 2018, Bulletin of mathematical biology.

[3]  Carsten R. Seemann,et al.  Alternative characterizations of Fitch’s xenology relation , 2018, Journal of Mathematical Biology.

[4]  Peter F. Stadler,et al.  Reconstructing gene trees from Fitch’s xenology relation , 2017, Journal of Mathematical Biology.

[5]  Christophe Paul,et al.  Fully dynamic recognition algorithm and certificate for directed cographs , 2006, Discret. Appl. Math..

[6]  Tero Harju,et al.  Structure and organization , 2014 .

[7]  Mike Steel,et al.  Closure operations in phylogenetics. , 2007, Mathematical biosciences.

[8]  Tiziana Calamoneri,et al.  Pairwise Compatibility Graphs: A Survey , 2016, SIAM Rev..

[9]  Tero Harju,et al.  Characterization and Complexity of Uniformly Non Primitive Labeled 2-Structures , 1996, Theor. Comput. Sci..

[10]  M. Steel,et al.  Extension Operations on Sets of Leaf-Labeled Trees , 1995 .

[11]  Michael R. Fellows,et al.  Leaf Powers and Their Properties: Using the Trees , 2008, ISAAC.

[12]  Andrzej Ehrenfeucht,et al.  Theory of 2-Structures, Part II: Representation Through Labeled Tree Families , 1990, Theor. Comput. Sci..

[13]  Nadia El-Mabrouk,et al.  Orthology and paralogy constraints: satisfiability and consistency , 2014, BMC Genomics.

[14]  Sonja J. Prohaska,et al.  Proteinortho: Detection of (Co-)orthologs in large-scale analysis , 2011, BMC Bioinformatics.

[15]  Peter F. Stadler,et al.  A Short Note on Undirected Fitch Graphs , 2017, Art Discret. Appl. Math..

[16]  G. Douglas,et al.  Current and Promising Approaches to Identify Horizontal Gene Transfer Events in Metagenomes , 2019, Genome biology and evolution.

[17]  P. Stadler,et al.  Inferring phylogenetic trees from the knowledge of rare evolutionary events , 2017, Journal of Mathematical Biology.

[18]  Andreas W. M. Dress,et al.  Recovering Symbolically Dated, Rooted Trees from Symbolic Ultrametrics , 1998 .

[19]  Marc Hellmuth,et al.  Generalized Fitch Graphs: Edge-labeled Graphs that are explained by Edge-labeled Trees , 2018, Discret. Appl. Math..

[20]  Jeroniza Nunes Marchaukoski,et al.  New Tools in Orthology Analysis: A Brief Review of Promising Perspectives , 2017, Front. Genet..

[21]  Marc Hellmuth,et al.  The matroid structure of representative triple sets and triple-closure computation , 2017, Eur. J. Comb..

[22]  Derek G. Corneil,et al.  Complement reducible graphs , 1981, Discret. Appl. Math..

[23]  Ross M. McConnell,et al.  Linear-time modular decomposition of directed graphs , 2005, Discret. Appl. Math..

[24]  W. Fitch Homology a personal view on some of the problems. , 2000, Trends in genetics : TIG.

[25]  H. A. Jung,et al.  On a class of posets and the corresponding comparability graphs , 1978, J. Comb. Theory B.

[26]  Peter F. Stadler,et al.  Generalized Fitch Graphs III: Symmetrized Fitch maps and Sets of Symmetric Binary Relations that are explained by Unrooted Edge-labeled Trees , 2020, ArXiv.

[27]  Katharina T. Huber,et al.  Beyond Representing Orthology Relations by Trees , 2016, Algorithmica.

[28]  Martin Middendorf,et al.  Phylogenomics with paralogs , 2015, Proceedings of the National Academy of Sciences.

[29]  Peter F. Stadler,et al.  The mathematics of xenology: di-cographs, symbolic ultrametrics, 2-structures and tree-representable systems of binary relations , 2016, Journal of Mathematical Biology.

[30]  Christophe Dessimoz,et al.  Inferring Horizontal Gene Transfer , 2015, PLoS Comput. Biol..

[31]  Christophe Paul,et al.  Fully dynamic recognition algorithm and certificate for directed cographs , 2004, Graph-Theoretic Concepts in Computer Science.

[32]  Dulce I. Valdivia,et al.  Best match graphs , 2019, Journal of Mathematical Biology.

[33]  Andrzej Ehrenfeucht,et al.  Theory of 2-Structures, Part I: Clans, Basic Subclasses, and Morphisms , 1990, Theor. Comput. Sci..

[34]  Wing-Kin Sung Algorithms in Bioinformatics: A Practical Introduction , 2020 .

[35]  Katharina T. Huber,et al.  Orthology relations, symbolic ultrametrics, and cographs , 2013, Journal of mathematical biology.