Yule-generated trees constrained by node imbalance.

The Yule process generates a class of binary trees which is fundamental to population genetic models and other applications in evolutionary biology. In this paper, we introduce a family of sub-classes of ranked trees, called Ω-trees, which are characterized by imbalance of internal nodes. The degree of imbalance is defined by an integer 0 ≤ ω. For caterpillars, the extreme case of unbalanced trees, ω = 0. Under models of neutral evolution, for instance the Yule model, trees with small ω are unlikely to occur by chance. Indeed, imbalance can be a signature of permanent selection pressure, such as observable in the genealogies of certain pathogens. From a mathematical point of view it is interesting to observe that the space of Ω-trees maintains several statistical invariants although it is drastically reduced in size compared to the space of unconstrained Yule trees. Using generating functions, we study here some basic combinatorial properties of Ω-trees. We focus on the distribution of the number of subtrees with two leaves. We show that expectation and variance of this distribution match those for unconstrained trees already for very small values of ω.

[1]  Oskar Hallatschek,et al.  Genealogies of rapidly adapting populations , 2012, Proceedings of the National Academy of Sciences.

[2]  E. Harding The probabilities of rooted tree-shapes generated by random bifurcation , 1971, Advances in Applied Probability.

[3]  M. Steel,et al.  Distributions of cherries for two models of trees. , 2000, Mathematical biosciences.

[4]  Philippe Flajolet,et al.  Analytic Combinatorics , 2009 .

[5]  Haipeng Li,et al.  Coalescent Tree Imbalance and a Simple Test for Selective Sweeps Based on Microsatellite Variation , 2013, PLoS Comput. Biol..

[6]  C. J-F,et al.  THE COALESCENT , 1980 .

[7]  Justin C. Fay,et al.  Hitchhiking under positive Darwinian selection. , 2000, Genetics.

[8]  Olivier François,et al.  Which random processes describe the tree of life? A large-scale study of phylogenetic tree imbalance. , 2006, Systematic biology.

[9]  M. J. Sackin,et al.  “Good” and “Bad” Phenograms , 1972 .

[10]  D. Aldous PROBABILITY DISTRIBUTIONS ON CLADOGRAMS , 1996 .

[11]  Mark Kirkpatrick,et al.  SHAPE OF A PHYLOGENETIC TREE , 1993 .

[12]  Filippo Disanto,et al.  Exact enumeration of cherries and pitchforks in ranked trees under the coalescent model. , 2011, Mathematical biosciences.

[13]  Richard R. Hudson,et al.  Generating samples under a Wright-Fisher neutral model of genetic variation , 2002, Bioinform..

[14]  J. H. M. Wedderburn The Functional Equation g(x 2 ) = 2 αx + [g(x)] 2 , 1922 .

[15]  Noah A. Rosenberg,et al.  Counting Coalescent Histories , 2007, J. Comput. Biol..

[16]  F. Tajima Evolutionary relationship of DNA sequences in finite populations. , 1983, Genetics.

[17]  D. Aldous Stochastic models and descriptive statistics for phylogenetic trees, from Yule to today , 2001 .

[18]  Arne Ø. Mooers,et al.  Inferring Evolutionary Process from Phylogenetic Tree Shape , 1997, The Quarterly Review of Biology.

[19]  O. Pybus,et al.  Unifying the Epidemiological and Evolutionary Dynamics of Pathogens , 2004, Science.

[20]  Olivier François,et al.  On statistical tests of phylogenetic tree imbalance: the Sackin and other indices revisited. , 2005, Mathematical biosciences.

[21]  M Steel,et al.  Properties of phylogenetic trees generated by Yule-type speciation models. , 2001, Mathematical biosciences.

[22]  A. Mooers,et al.  Using tree shape. , 2002, Systematic biology.

[23]  G. Yule,et al.  A Mathematical Theory of Evolution Based on the Conclusions of Dr. J. C. Willis, F.R.S. , 1925 .

[24]  G. Yule,et al.  A Mathematical Theory of Evolution, Based on the Conclusions of Dr. J. C. Willis, F.R.S. , 1925 .

[25]  Noah A. Rosenberg,et al.  The Mean and Variance of the Numbers of r-Pronged Nodes and r-Caterpillars in Yule-Generated Genealogical Trees , 2006 .

[26]  Haipeng Li,et al.  A new test for detecting recent positive selection that is free from the confounding impacts of demography. , 2011, Molecular biology and evolution.

[27]  M. Steel,et al.  Clades, clans, and reciprocal monophyly under neutral evolutionary models. , 2011, Theoretical population biology.