Phylogenetic Properties of RNA Viruses

A new word, phylodynamics, was coined to emphasize the interconnection between phylogenetic properties, as observed for instance in a phylogenetic tree, and the epidemic dynamics of viruses, where selection, mediated by the host immune response, and transmission play a crucial role. The challenges faced when investigating the evolution of RNA viruses call for a virtuous loop of data collection, data analysis and modeling. This already resulted both in the collection of massive sequences databases and in the formulation of hypotheses on the main mechanisms driving qualitative differences observed in the (reconstructed) evolutionary patterns of different RNA viruses. Qualitatively, it has been observed that selection driven by the host immune response induces an uneven survival ability among co-existing strains. As a consequence, the imbalance level of the phylogenetic tree is manifestly more pronounced if compared to the case when the interaction with the host immune system does not play a central role in the evolutive dynamics. While many imbalance metrics have been introduced, reliable methods to discriminate in a quantitative way different level of imbalance are still lacking. In our work, we reconstruct and analyze the phylogenetic trees of six RNA viruses, with a special emphasis on the human Influenza A virus, due to its relevance for vaccine preparation as well as for the theoretical challenges it poses due to its peculiar evolutionary dynamics. We focus in particular on topological properties. We point out the limitation featured by standard imbalance metrics, and we introduce a new methodology with which we assign the correct imbalance level of the phylogenetic trees, in agreement with the phylodynamics of the viruses. Our thorough quantitative analysis allows for a deeper understanding of the evolutionary dynamics of the considered RNA viruses, which is crucial in order to provide a valuable framework for a quantitative assessment of theoretical predictions.

[1]  H. Oshitani,et al.  Origin of measles virus: divergence from rinderpest virus between the 11th and 12th centuries , 2010, Virology Journal.

[2]  T. Jukes CHAPTER 24 – Evolution of Protein Molecules , 1969 .

[3]  M. Slatkin,et al.  SEARCHING FOR EVOLUTIONARY PATTERNS IN THE SHAPE OF A PHYLOGENETIC TREE , 1993, Evolution; international journal of organic evolution.

[4]  Vittorio Loreto,et al.  A stochastic local search algorithm for distance-based phylogeny reconstruction. , 2010, Molecular biology and evolution.

[5]  S. Elena,et al.  Subclonal components of consensus fitness in an RNA virus clone , 1994, Journal of virology.

[6]  M. Uhlén,et al.  Accurate reconstruction of a known HIV-1 transmission history by phylogenetic tree analysis. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[7]  T. Tatusova,et al.  The Influenza Virus Resource at the National Center for Biotechnology Information , 2007, Journal of Virology.

[8]  Michael Stich,et al.  Collective properties of evolving molecular quasispecies , 2007, BMC Evolutionary Biology.

[9]  Francesca Tria,et al.  A minimal stochastic model for influenza evolution , 2005, q-bio/0505035.

[10]  S. Jeffery Evolution of Protein Molecules , 1979 .

[11]  W. Fitch,et al.  Effects of passage history and sampling bias on phylogenetic reconstruction of human influenza A evolution. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[12]  M. Eigen Selforganization of matter and the evolution of biological macromolecules , 1971, Naturwissenschaften.

[13]  C. Viboud,et al.  Explorer The genomic and epidemiological dynamics of human influenza A virus , 2016 .

[14]  Vittorio Loreto,et al.  On the Accuracy of Language Trees , 2011, PloS one.

[15]  Vittorio Loreto,et al.  Distance-based phylogenetic algorithms: New insights and applications , 2010 .

[16]  R. Webster,et al.  Evolution and ecology of influenza A viruses. , 1992, Current topics in microbiology and immunology.

[17]  Vittorio Loreto,et al.  A Fast Noise Reduction Driven Distance-Based Phylogenetic Algorithm , 2010, International Conference on Bioinformatics & Computational Biology.

[18]  W. Fitch,et al.  Predicting the evolution of human influenza A. , 1999, Science.

[19]  Elizabeth C. Theil,et al.  Epochal Evolution Shapes the Phylodynamics of Interpandemic Influenza A (H3N2) in Humans , 2006, Science.

[20]  Olivier François,et al.  Which random processes describe the tree of life? A large-scale study of phylogenetic tree imbalance. , 2006, Systematic biology.

[21]  E. Domingo,et al.  Resistance of virus to extinction on bottleneck passages: Study of a decaying and fluctuating pattern of fitness loss , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[22]  Simon Whelan,et al.  Pandit: a database of protein and associated nucleotide domains with inferred trees , 2003, Bioinform..

[23]  G. Yule,et al.  A Mathematical Theory of Evolution, Based on the Conclusions of Dr. J. C. Willis, F.R.S. , 1925 .

[24]  C. Kuiken,et al.  HIV sequence databases. , 2003, AIDS reviews.

[25]  Bryan T Grenfell,et al.  Dynamics and selection of many-strain pathogens , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[26]  G. Yule,et al.  Some Statistics of Evolution and Geographical Distribution in Plants and Animals, and their Significance. , 1922, Nature.

[27]  Emilio Hernández-García,et al.  Universal Scaling in the Branching of the Tree of Life , 2008, PloS one.

[28]  Andy Purvis,et al.  Power of eight tree shape statistics to detect nonrandom diversification: a comparison by simulation of two models of cladogenesis. , 2002, Systematic biology.

[29]  Edward C. Holmes,et al.  Discovering the Phylodynamics of RNA Viruses , 2009, PLoS Comput. Biol..

[30]  Frederick A Matsen,et al.  A geometric approach to tree shape statistics. , 2005, Systematic biology.

[31]  Frederick Albert Matsen IV,et al.  Optimization Over a Class of Tree Shape Statistics , 2006, IEEE/ACM Transactions on Computational Biology and Bioinformatics.

[32]  D. Aldous PROBABILITY DISTRIBUTIONS ON CLADOGRAMS , 1996 .

[33]  Arne Ø. Mooers,et al.  Inferring Evolutionary Process from Phylogenetic Tree Shape , 1997, The Quarterly Review of Biology.

[34]  O. Pybus,et al.  Unifying the Epidemiological and Evolutionary Dynamics of Pathogens , 2004, Science.

[35]  David L. Wheeler,et al.  GenBank , 2015, Nucleic Acids Res..

[36]  V. Eguíluz,et al.  Scaling properties of protein family phylogenies , 2011, BMC Evolutionary Biology.

[37]  W. Fitch,et al.  Positive selection on the H3 hemagglutinin gene of human influenza virus A. , 1999, Molecular biology and evolution.

[38]  N. Ferguson,et al.  Ecological and immunological determinants of influenza evolution , 2003, Nature.

[39]  M. Eigen,et al.  Molecular quasi-species. , 1988 .

[40]  Susanna C. Manrubia,et al.  Topological properties of phylogenetic trees in evolutionary models , 2009 .

[41]  D. Aldous Stochastic models and descriptive statistics for phylogenetic trees, from Yule to today , 2001 .

[42]  W. Fitch,et al.  Long term trends in the evolution of H(3) HA1 human influenza type A. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[43]  M. J. Sackin,et al.  “Good” and “Bad” Phenograms , 1972 .

[44]  Kazutaka Katoh,et al.  Multiple alignment of DNA sequences with MAFFT. , 2009, Methods in molecular biology.

[45]  Daniel J. Ford Probabilities on cladograms: introduction to the alpha model , 2005, math/0511246.

[46]  Emilio Hernández-García,et al.  An Age Dependent Branching Model for Macroevolution , 2012 .

[47]  A. Purvis,et al.  Phylogeny imbalance: taxonomic level matters. , 2002, Systematic biology.

[48]  Giuseppe Fusco,et al.  A new method for evaluating the shape of large phylogenies , 1995 .

[49]  Charles Weissmann,et al.  Nucleotide sequence heterogeneity of an RNA phage population , 1978, Cell.