A New Paradigm for Identifying Reconciliation-Scenario Altering Mutations Conferring Environmental Adaptation

An important goal in microbial computational genomics is to identify crucial events in the evolution of a gene that severely alter the duplication, loss and mobilization patterns of the gene within the genomes in which it disseminates. In this paper, we formalize this microbiological goal as a new pattern-matching problem in the domain of Gene tree and Species tree reconciliation, denoted "Reconciliation-Scenario Altering Mutation (RSAM) Discovery". We propose an $O(m\cdot n\cdot k)$ time algorithm to solve this new problem, where $m$ and $n$ are the number of vertices of the input Gene tree and Species tree, respectively, and $k$ is a user-specified parameter that bounds from above the number of optimal solutions of interest. The algorithm first constructs a hypergraph representing the $k$ highest scoring reconciliation scenarios between the given Gene tree and Species tree, and then interrogates this hypergraph for subtrees matching a pre-specified RSAM Pattern. Our algorithm is optimal in the sense that the number of hypernodes in the hypergraph can be lower bounded by $\Omega(m\cdot n\cdot k)$. We implement the new algorithm as a tool, called RSAM-finder, and demonstrate its application to -the identification of RSAMs in toxins and drug resistance elements across a dataset spanning hundreds of species.

[1]  Ran Libeskind-Hadas,et al.  On the Computational Complexity of the Reticulate Cophylogeny Reconstruction Problem , 2009, J. Comput. Biol..

[2]  Michael Y. Galperin,et al.  The COG database: a tool for genome-scale analysis of protein functions and evolution , 2000, Nucleic Acids Res..

[3]  Cédric Chauve,et al.  Space of Gene/Species Trees Reconciliations and Parsimonious Models , 2009, J. Comput. Biol..

[4]  P. Nordmann,et al.  Integron Mobilization Unit as a Source of Mobility of Antibiotic Resistance Genes , 2008, Antimicrobial Agents and Chemotherapy.

[5]  Vincent Berry,et al.  A geography‐aware reconciliation method to investigate diversification patterns in host/parasite interactions , 2018, Molecular ecology resources.

[6]  K. Bush,et al.  Past and Present Perspectives on β-Lactamases , 2018, Antimicrobial Agents and Chemotherapy.

[7]  Cédric Chauve,et al.  An Efficient Method for Exploring the Space of Gene Tree/Species Tree Reconciliations in a Probabilistic Framework , 2012, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[8]  Nikos Kyrpides,et al.  Genomes OnLine Database (GOLD) v.6: data updates and feature enhancements , 2016, Nucleic Acids Res..

[9]  Maureen A. O’Malley,et al.  Prokaryotic evolution and the tree of life are two different things , 2009, Biology Direct.

[10]  Manolis Kellis,et al.  Reconciliation Revisited: Handling Multiple Optima when Reconciling with Duplication, Transfer, and Loss , 2013, J. Comput. Biol..

[11]  Robert Patro,et al.  Predicting protein interactions via parsimonious network history inference , 2013, Bioinform..

[12]  A. Grossman,et al.  A conserved anti‐repressor controls horizontal gene transfer by proteolysis , 2008, Molecular microbiology.

[13]  Mathias W. Pletz,et al.  In silico serine β-lactamases analysis reveals a huge potential resistome in environmental and pathogenic species , 2017, Scientific Reports.

[14]  Vincent Ranwez,et al.  A fast method for calculating reliable event supports in tree reconciliations via Pareto optimality , 2015, BMC Bioinformatics.

[15]  Michael T. Hallett,et al.  Simultaneous Identification of Duplications and Lateral Gene Transfers , 2011, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[16]  Fabian Sievers,et al.  Clustal Omega for making accurate alignments of many protein sequences , 2018, Protein science : a publication of the Protein Society.

[17]  Stephen H. Bryant,et al.  CD-Search: protein domain annotations on the fly , 2004, Nucleic Acids Res..

[18]  Lawrence A. David,et al.  Rapid evolutionary innovation during an Archaean genetic expansion , 2011, Nature.

[19]  L. Van Melderen,et al.  Bacterial Toxin–Antitoxin Systems: More Than Selfish Entities? , 2009, PLoS genetics.

[20]  Pierluigi Crescenzi,et al.  EUCALYPT: efficient tree reconciliation enumerator , 2015, Algorithms for Molecular Biology.

[21]  David Chiang,et al.  Better k-best Parsing , 2005, IWPT.

[22]  M. Charleston,et al.  Jungles: a new solution to the host/parasite phylogeny reconciliation problem. , 1998, Mathematical biosciences.

[23]  Wilfred W. Li,et al.  MEME: discovering and analyzing DNA and protein sequence motifs , 2006, Nucleic Acids Res..

[24]  Ali Tofigh,et al.  Using Trees to Capture Reticulate Evolution : Lateral Gene Transfers and Cancer Progression , 2009 .

[25]  P. Bork,et al.  ETE 3: Reconstruction, Analysis, and Visualization of Phylogenomic Data , 2016, Molecular biology and evolution.

[26]  Daniel Merkle,et al.  A parameter-adaptive dynamic programming approach for inferring cophylogenies , 2010, BMC Bioinformatics.

[27]  L. Alksne,et al.  Expression of the AsbA1, OXA-12, and AsbM1 beta-lactamases in Aeromonas jandaei AER 14 is coordinated by a two-component regulon , 1997, Journal of bacteriology.

[28]  M. P. Montanari,et al.  Borderline methicillin-susceptible Staphylococcus aureus strains have more in common than reduced susceptibility to penicillinase-resistant penicillins , 1996, Antimicrobial agents and chemotherapy.

[29]  Naomi McCallion,et al.  Outbreaks of extended spectrum beta-lactamase-producing Enterobacteriaceae in neonatal intensive care units: a systematic review , 2015, Archives of Disease in Childhood: Fetal and Neonatal Edition.

[30]  Sergei Vakulenko,et al.  Class D β-lactamases do exist in Gram-positive bacteria , 2015, Nature chemical biology.

[31]  Jeet Sukumaran,et al.  DendroPy: a Python library for phylogenetic computing , 2010, Bioinform..

[32]  Katharina T. Huber,et al.  ape 3.0: New tools for distance-based phylogenetics and evolutionary analysis in R , 2012, Bioinform..

[33]  Vincent Berry,et al.  Representing a Set of Reconciliations in a Compact Way , 2013, J. Bioinform. Comput. Biol..

[34]  Dannie Durand,et al.  Inferring duplications, losses, transfers and incomplete lineage sorting with nonbinary species trees , 2012, Bioinform..

[35]  F. Navarro,et al.  Acquisition and horizontal diffusion of beta-lactam resistance among clinically relevant microorganisms. , 2006, International microbiology : the official journal of the Spanish Society for Microbiology.

[36]  Manolis Kellis,et al.  Efficient algorithms for the reconciliation problem with gene duplication, horizontal transfer and loss , 2012, Bioinform..

[37]  Damian Szklarczyk,et al.  The STRING database in 2017: quality-controlled protein–protein association networks, made broadly accessible , 2016, Nucleic Acids Res..