Estimating hybridization in the presence of coalescence using phylogenetic intraspecific sampling

BackgroundA well-known characteristic of multi-locus data is that each locus has its own phylogenetic history which may differ substantially from the overall phylogenetic history of the species. Although the possibility that this arises through incomplete lineage sorting is often incorporated in models for the species-level phylogeny, it is much less common for hybridization to also be formally included in such models.ResultsWe have modified the evolutionary model of Meng and Kubatko (2009) to incorporate intraspecific sampling of multiple individuals for estimation of speciation times and times of hybridization events for testing for hybridization in the presence of incomplete lineage sorting. We have also utilized a more efficient algorithm for obtaining our estimates. Using simulations, we demonstrate that our approach performs well under conditions motivated by an empirical data set for Sistrurus rattlesnakes where putative hybridization has occurred. We further demonstrate that the method is able to accurately detect the signature of hybridization in the data, while this signal may be obscured when other species-tree inference methods that ignore hybridization are used.ConclusionsOur approach is shown to be powerful in detecting hybridization when it is present. When applied to the Sistrurus data, we find no evidence of hybridization; instead, it appears that putative hybrid snakes in Missouri are most likely pure S. catenatus tergeminus in origin, which has significant conservation implications.

[1]  A. Hobolth,et al.  Genomic Relationships and Speciation Times of Human, Chimpanzee, and Gorilla Inferred from a Coalescent Hidden Markov Model , 2006, PLoS genetics.

[2]  Erik Bloomquist,et al.  Inferring species-level phylogenies and taxonomic distinctiveness using multilocus data in Sistrurus rattlesnakes. , 2011, Systematic biology.

[3]  Loren H. Rieseberg,et al.  Hybrid Origins of Plant Species , 1997 .

[4]  Luay Nakhleh,et al.  Identifiability Issues in Phylogeny-Based Detection of Horizontal Gene Transfer , 2006, Comparative Genomics.

[5]  Laura Salter Kubatko,et al.  Detecting hybrid speciation in the presence of incomplete lineage sorting using gene tree incongruence: a model. , 2009, Theoretical population biology.

[6]  D. Futuyma,et al.  Hybrid zones and the evolutionary process , 1995 .

[7]  N. Rosenberg,et al.  Discordance of Species Trees with Their Most Likely Gene Trees , 2006, PLoS genetics.

[8]  A. Drummond,et al.  Bayesian Inference of Species Trees from Multilocus Data , 2009, Molecular biology and evolution.

[9]  Andrew Rambaut,et al.  Seq-Gen: an application for the Monte Carlo simulation of DNA sequence evolution along phylogenetic trees , 1997, Comput. Appl. Biosci..

[10]  H. Gibbs,et al.  Genetic identity of endangered massasauga rattlesnakes (Sistrurus sp.) in Missouri , 2011, Conservation Genetics.

[11]  Temple F. Smith,et al.  Reconstruction of ancient molecular phylogeny. , 1996, Molecular phylogenetics and evolution.

[12]  K. Liang,et al.  Asymptotic Properties of Maximum Likelihood Estimators and Likelihood Ratio Tests under Nonstandard Conditions , 1987 .

[13]  S. Edwards IS A NEW AND GENERAL THEORY OF MOLECULAR SYSTEMATICS EMERGING? , 2009, Evolution; international journal of organic evolution.

[14]  Luay Nakhleh,et al.  Coalescent histories on phylogenetic networks and detection of hybridization despite incomplete lineage sorting. , 2011, Systematic biology.

[15]  J. Kingman On the genealogy of large populations , 1982, Journal of Applied Probability.

[16]  Laura Kubatko,et al.  Estimating species trees : practical and theoretical aspects , 2010 .

[17]  M. Nei,et al.  Relationships between gene trees and species trees. , 1988, Molecular biology and evolution.

[18]  John P. Huelsenbeck,et al.  MrBayes 3: Bayesian phylogenetic inference under mixed models , 2003, Bioinform..

[19]  Richard R. Hudson,et al.  Generating samples under a Wright-Fisher neutral model of genetic variation , 2002, Bioinform..

[20]  Laura Salter Kubatko,et al.  STEM: species tree estimation using maximum likelihood for gene trees under coalescence , 2009, Bioinform..

[21]  Patricia A. McLenachan,et al.  A Statistical Approach for Distinguishing Hybridization and Incomplete Lineage Sorting , 2009, The American Naturalist.

[22]  C. J-F,et al.  THE COALESCENT , 1980 .

[23]  D. Swofford PAUP*: Phylogenetic analysis using parsimony (*and other methods), Version 4.0b10 , 2002 .

[24]  L. M. Klauber,et al.  The Rattlesnakes, Genera Sistrurus and Crotalus: A Study in Zoogeography and Evolution , 1940 .

[25]  C. Simon,et al.  Differentiating between hypotheses of lineage sorting and introgression in New Zealand alpine cicadas (Maoricicada Dugdale). , 2006, Systematic biology.

[26]  L. Nakhleh Evolutionary Phylogenetic Networks: Models and Issues , 2010 .

[27]  M. P. Cummings,et al.  PAUP* Phylogenetic analysis using parsimony (*and other methods) Version 4 , 2000 .

[28]  J. Kingman On the genealogy of large populations , 1982 .

[29]  Noah A Rosenberg,et al.  THE SHAPES OF NEUTRAL GENE GENEALOGIES IN TWO SPECIES: PROBABILITIES OF MONOPHYLY, PARAPHYLY, AND POLYPHYLY IN A COALESCENT MODEL , 2003, Evolution; international journal of organic evolution.

[30]  J. Mallet Hybrid speciation , 2007, Nature.

[31]  L. Kubatko Identifying hybridization events in the presence of coalescence via model selection. , 2009, Systematic biology.

[32]  Luay Nakhleh,et al.  Species Tree Inference by Minimizing Deep Coalescences , 2009, PLoS Comput. Biol..

[33]  D. Pearl,et al.  Species trees from gene trees: reconstructing Bayesian posterior distributions of a species phylogeny using estimated gene tree distributions. , 2007, Systematic biology.

[34]  H. Gibbs,et al.  Identification of single copy nuclear DNA markers for North American pit vipers , 2010, Molecular ecology resources.

[35]  Laura S Kubatko,et al.  Estimating species trees using approximate Bayesian computation. , 2011, Molecular phylogenetics and evolution.

[36]  Loren H. Rieseberg,et al.  lntrogression and Its Consequences in Plants , 1993 .