On the impact of uncertain gene tree rooting on duplication-transfer-loss reconciliation

BackgroundDuplication-Transfer-Loss (DTL) reconciliation is a powerful and increasingly popular technique for studying the evolution of microbial gene families. DTL reconciliation requires the use of rooted gene trees to perform the reconciliation with the species tree, and the standard technique for rooting gene trees is to assign a root that results in the minimum reconciliation cost across all rootings of that gene tree. However, even though it is well understood that many gene trees have multiple optimal roots, only a single optimal root is randomly chosen to create the rooted gene tree and perform the reconciliation. This remains an important overlooked and unaddressed problem in DTL reconciliation, leading to incorrect evolutionary inferences. In this work, we perform an in-depth analysis of the impact of uncertain gene tree rooting on the computed DTL reconciliation and provide the first computational tools to quantify and negate the impact of gene tree rooting uncertainty on DTL reconciliation.ResultsOur analysis of a large data set of over 4500 gene families from 100 species shows that a large fraction of gene trees have multiple optimal rootings, that these multiple roots often, but not always, appear closely clustered together in the same region of the gene tree, that many aspects of the reconciliation remain conserved across the multiple rootings, that gene tree error has a profound impact on the prevalence and structure of multiple optimal rootings, and that there are specific interesting patterns in the reconciliation of those gene trees that have multiple optimal roots.ConclusionsOur results show that unrooted gene trees can be meaningfully reconciled and high-quality evolutionary information can be obtained from them even after accounting for multiple optimal rootings. In addition, the techniques and tools introduced in this paper make it possible to systematically avoid incorrect evolutionary inferences caused by incorrect or uncertain gene tree rooting. These tools have been implemented in the phylogenetic reconciliation software package RANGER-DTL 2.0, freely available from http://compbio.engr.uconn.edu/software/RANGER-DTL/.

[1]  Jerzy Tiuryn,et al.  Unrooted Tree Reconciliation: A Unified Approach , 2013, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[2]  Manolis Kellis,et al.  Improved gene tree error correction in the presence of horizontal gene transfer , 2014, Bioinform..

[3]  Sophie S Abby,et al.  Phylogenetic modeling of lateral gene transfer reconstructs the pattern and relative timing of speciations , 2012, Proceedings of the National Academy of Sciences.

[4]  Michael T. Hallett,et al.  Simultaneous Identification of Duplications and Lateral Gene Transfers , 2011, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[5]  Ping Xu,et al.  Isolation and characterization of an ABC-transporter cDNA clone from wheat (Triticum aestivum L.) , 2009, Molecular Biology.

[6]  Manolis Kellis,et al.  Efficient algorithms for the reconciliation problem with gene duplication, horizontal transfer and loss , 2012, Bioinform..

[7]  K. Gorbunov,et al.  [Reconstructing genes evolution along a species tree]. , 2009, Molekuliarnaia biologiia.

[8]  Dannie Durand,et al.  Inferring duplications, losses, transfers and incomplete lineage sorting with nonbinary species trees , 2012, Bioinform..

[9]  Vincent Berry,et al.  An Efficient Algorithm for Gene/Species Trees Parsimonious Reconciliation with Losses, Duplications and Transfers , 2010, RECOMB-CG.

[10]  Alexandros Stamatakis,et al.  RAxML-VI-HPC: maximum likelihood-based phylogenetic analyses with thousands of taxa and mixed models , 2006, Bioinform..

[11]  Yann Ponty,et al.  ecceTERA: comprehensive gene tree-species tree reconciliation using parsimony , 2016, Bioinform..

[12]  Lawrence A. David,et al.  Rapid evolutionary innovation during an Archaean genetic expansion , 2011, Nature.

[13]  V. A. Lyubetsky,et al.  Reconstructing the evolution of genes along the species tree , 2009, Molecular Biology.

[14]  Vincent Berry,et al.  Representing a Set of Reconciliations in a Compact Way , 2013, J. Bioinform. Comput. Biol..

[15]  Marie-France Sagot,et al.  Robustness of the Parsimonious Reconciliation Method in Cophylogeny , 2016, AlCoB.

[16]  Ran Libeskind-Hadas,et al.  On the Computational Complexity of the Reticulate Cophylogeny Reconstruction Problem , 2009, J. Comput. Biol..

[17]  Joel Sjöstrand,et al.  A Bayesian method for analyzing lateral gene transfer. , 2014, Systematic biology.

[18]  Ran Libeskind-Hadas,et al.  The Cophylogeny Reconstruction Problem Is NP-Complete , 2011, J. Comput. Biol..

[19]  Fred R. McMorris,et al.  A View of Some Consensus Methods for Trees , 1983 .

[20]  Ran Libeskind-Hadas,et al.  Pareto-optimal phylogenetic tree reconciliation , 2014, Bioinform..

[21]  Manolis Kellis,et al.  Reconciliation Revisited: Handling Multiple Optima when Reconciling with Duplication, Transfer, and Loss , 2013, J. Comput. Biol..

[22]  Gorbunov KIu,et al.  Reconstructing genes evolution along a species tree , 2009 .

[23]  Misagh Kordi,et al.  Exact Algorithms for Duplication-Transfer-Loss Reconciliation with Non-Binary Gene Trees , 2016, BCB.

[24]  Frank Rutschmann,et al.  Molecular dating of phylogenetic trees : A brief review of current methods that estimate divergence times , 2022 .