Multispecies coalescent delimits structure, not species

Significance Despite its widespread application to the species delimitation problem, our study demonstrates that what the multispecies coalescent actually delimits is structure. The current implementations of species delimitation under the multispecies coalescent do not provide any way for distinguishing between structure due to population-level processes and that due to species boundaries. The overinflation of species due to the misidentification of general genetic structure for species boundaries has profound implications for our understanding of the generation and dynamics of biodiversity, because any ecological or evolutionary studies that rely on species as their fundamental units will be impacted, as well as the very existence of this biodiversity, because conservation planning is undermined due to isolated populations incorrectly being treated as distinct species. The multispecies coalescent model underlies many approaches used for species delimitation. In previous work assessing the performance of species delimitation under this model, speciation was treated as an instantaneous event rather than as an extended process involving distinct phases of speciation initiation (structuring) and completion. Here, we use data under simulations that explicitly model speciation as an extended process rather than an instantaneous event and carry out species delimitation inference on these data under the multispecies coalescent. We show that the multispecies coalescent diagnoses genetic structure, not species, and that it does not statistically distinguish structure associated with population isolation vs. species boundaries. Because of the misidentification of population structure as putative species, our work raises questions about the practice of genome-based species discovery, with cascading consequences in other fields. Specifically, all fields that rely on species as units of analysis, from conservation biology to studies of macroevolutionary dynamics, will be impacted by inflated estimates of the number of species, especially as genomic resources provide unprecedented power for detecting increasingly finer-scaled genetic structure under the multispecies coalescent. As such, our work also represents a general call for systematic study to reconsider a reliance on genomic data alone. Until new methods are developed that can discriminate between structure due to population-level processes and that due to species boundaries, genomic-based results should only be considered a hypothesis that requires validation of delimited species with multiple data types, such as phenotypic and ecological information.

[1]  Sean Nee,et al.  Birth-Death Models in Macroevolution , 2006 .

[2]  J. Hey,et al.  POPULATION GENETICS AND OBJECTIVITY IN SPECIES DIAGNOSIS , 2012, Evolution; international journal of organic evolution.

[3]  D. Wheatcroft,et al.  THE ROLE OF ECOLOGICAL CONSTRAINT IN DRIVING THE EVOLUTION OF AVIAN SONG FREQUENCY ACROSS A LATITUDINAL GRADIENT , 2012, Evolution; international journal of organic evolution.

[4]  J. Avise Phylogeography: The History and Formation of Species , 2000 .

[5]  Ziheng Yang The BPP program for species tree estimation and species delimitation , 2015 .

[6]  Luke J. Harmon,et al.  Goldilocks Meets Santa Rosalia: An Ephemeral Speciation Model Explains Patterns of Diversification Across Time Scales , 2012, Evolutionary Biology.

[7]  K. de Queiroz,et al.  Ernst Mayr and the modern concept of species , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[8]  M. Fujita,et al.  Multilocus phylogeography reveals nested endemism in a gecko across the monsoonal tropics of Australia , 2016, Molecular ecology.

[9]  Claudia R. Solís-Lemus,et al.  Bayesian species delimitation combining multiple genes and traits in a unified framework , 2015, Evolution; international journal of organic evolution.

[10]  W. Jetz,et al.  The global diversity of birds in space and time , 2012, Nature.

[11]  Jonathan D. Ballou,et al.  Implications of different species concepts for conserving biodiversity , 2012 .

[12]  J. G. Burleigh,et al.  Synthesis of phylogeny and taxonomy into a comprehensive tree of life , 2014, Proceedings of the National Academy of Sciences.

[13]  J. L. Gittleman,et al.  The Impact of Species Concept on Biodiversity Studies , 2004, The Quarterly Review of Biology.

[14]  Md. Shamsuzzoha Bayzid,et al.  Whole-genome analyses resolve early branches in the tree of life of modern birds , 2014, Science.

[15]  Jeet Sukumaran,et al.  DendroPy: a Python library for phylogenetic computing , 2010, Bioinform..

[16]  Craig Moritz,et al.  Coalescent-based species delimitation in an integrative taxonomy. , 2012, Trends in ecology & evolution.

[17]  K. Queiroz**,et al.  Ernst Mayr and the modern concept of species , 2005 .

[18]  B. Rannala,et al.  Bayesian species delimitation using multilocus sequence data , 2010, Proceedings of the National Academy of Sciences.

[19]  F. Leliaert,et al.  Extensive cryptic species diversity and fine-scale endemism in the marine red alga Portieria in the Philippines , 2013, Proceedings of the Royal Society B: Biological Sciences.

[20]  David Bryant,et al.  Simulating gene trees under the multispecies coalescent and time-dependent migration , 2013, BMC Evolutionary Biology.

[21]  L. Knowles,et al.  The Species versus Subspecies Conundrum: Quantitative Delimitation from Integrating Multiple Data Types within a Single Bayesian Approach in Hercules Beetles. , 2016, Systematic biology.

[22]  R. Jansson,et al.  PERSISTENCE OF WITHIN‐SPECIES LINEAGES: A NEGLECTED CONTROL OF SPECIATION RATES , 2014, Evolution; international journal of organic evolution.

[23]  David B. Wake,et al.  EVOLUTIONARY RELATIONSHIPS WITHIN THE ENSATINA ESCHSCHOLTZII COMPLEX CONFIRM THE RING SPECIES INTERPRETATION , 1992 .

[24]  L. Knowles,et al.  Upstream analyses create problems with DNA-based species delimitation. , 2014, Systematic biology.

[25]  Tianqi Zhu,et al.  Evaluation of a bayesian coalescent method of species delimitation. , 2011, Systematic biology.

[26]  Bryan C. Carstens,et al.  How to fail at species delimitation , 2013, Molecular ecology.

[27]  R. Vilà,et al.  Cryptic matters: overlooked species generate most butterfly beta‐diversity , 2015 .

[28]  Bryan C. Carstens,et al.  ESTIMATING A GEOGRAPHICALLY EXPLICIT MODEL OF POPULATION DIVERGENCE , 2007, Evolution; international journal of organic evolution.

[29]  A. Lambert,et al.  The reconstructed tree in the lineage-based model of protracted speciation , 2013, Journal of mathematical biology.

[30]  James Mallet,et al.  Taxonomic inflation: its influence on macroecology and conservation. , 2004, Trends in ecology & evolution.

[31]  Bryan C Carstens,et al.  Estimating species phylogeny from gene-tree probabilities despite incomplete lineage sorting: an example from Melanoplus grasshoppers. , 2007, Systematic biology.

[32]  K. de Queiroz,et al.  Different species problems and their resolution. , 2005, BioEssays : news and reviews in molecular, cellular and developmental biology.

[33]  C. Botero,et al.  Environmental harshness is positively correlated with intraspecific divergence in mammals and birds , 2014, Molecular ecology.

[34]  Rampal S. Etienne,et al.  Prolonging the Past Counteracts the Pull of the Present: Protracted Speciation Can Explain Observed Slowdowns in Diversification , 2012, Systematic biology.

[35]  Rampal S Etienne,et al.  ESTIMATING THE DURATION OF SPECIATION FROM PHYLOGENIES , 2014, Evolution; international journal of organic evolution.

[36]  Ziheng Yang,et al.  Unguided Species Delimitation Using DNA Sequence Data from Multiple Loci , 2014, Molecular biology and evolution.

[37]  P. Nosil,et al.  Ecological explanations for (incomplete) speciation. , 2009, Trends in ecology & evolution.

[38]  J. Bergsten,et al.  Bayesian species delimitation reveals generalist and specialist parasitic wasps on Galerucella beetles (Chrysomelidae): sorting by herbivore or plant host , 2013, BMC Evolutionary Biology.