INVITED REVIEW: What is a population? An empirical evaluation of some genetic methods for identifying the number of gene pools and their degree of connectivity

We review commonly used population definitions under both the ecological paradigm (which emphasizes demographic cohesion) and the evolutionary paradigm (which emphasizes reproductive cohesion) and find that none are truly operational. We suggest several quantitative criteria that might be used to determine when groups of individuals are different enough to be considered ‘populations’. Units for these criteria are migration rate (m) for the ecological paradigm and migrants per generation (Nm) for the evolutionary paradigm. These criteria are then evaluated by applying analytical methods to simulated genetic data for a finite island model. Under the standard parameter set that includes L = 20 High mutation (microsatellite‐like) loci and samples of S = 50 individuals from each of n = 4 subpopulations, power to detect departures from panmixia was very high (∼100%; P < 0.001) even with high gene flow (Nm = 25). A new method, comparing the number of correct population assignments with the random expectation, performed as well as a multilocus contingency test and warrants further consideration. Use of Low mutation (allozyme‐like) markers reduced power more than did halving S or L. Under the standard parameter set, power to detect restricted gene flow below a certain level X (H0: Nm < X) can also be high, provided that true Nm ≤ 0.5X. Developing the appropriate test criterion, however, requires assumptions about several key parameters that are difficult to estimate in most natural populations. Methods that cluster individuals without using a priori sampling information detected the true number of populations only under conditions of moderate or low gene flow (Nm ≤ 5), and power dropped sharply with smaller samples of loci and individuals. A simple algorithm based on a multilocus contingency test of allele frequencies in pairs of samples has high power to detect the true number of populations even with Nm = 25 but requires more rigorous statistical evaluation. The ecological paradigm remains challenging for evaluations using genetic markers, because the transition from demographic dependence to independence occurs in a region of high migration where genetic methods have relatively little power. Some recent theoretical developments and continued advances in computational power provide hope that this situation may change in the future.

[1]  John M. Drake,et al.  Population Viability Analysis , 2019, Encyclopedia of Theoretical Ecology.

[2]  M. Whitlock Combining probability from independent tests: the weighted Z‐method is superior to Fisher's approach , 2005, Journal of evolutionary biology.

[3]  Arnaud Estoup,et al.  A Spatial Statistical Model for Landscape Genetics , 2005, Genetics.

[4]  G. Evanno,et al.  Detecting the number of clusters of individuals using the software structure: a simulation study , 2005, Molecular ecology.

[5]  D. J. Funk,et al.  REPRODUCTIVE ISOLATION CAUSED BY NATURAL SELECTION AGAINST IMMIGRANTS FROM DIVERGENT HABITATS , 2005 .

[6]  LEAKY PREZYGOTIC ISOLATION AND POROUS GENOMES: RAPID INTROGRESSION OF MATERNALLY INHERITED DNA , 2005, Evolution; international journal of organic evolution.

[7]  D. J. Funk,et al.  REPRODUCTIVE ISOLATION CAUSED BY NATURAL SELECTION AGAINST IMMIGRANTS FROM DIVERGENT HABITATS , 2005 .

[8]  Stephanie Manel,et al.  Assignment methods: matching biological questions with appropriate techniques. , 2005, Trends in ecology & evolution.

[9]  Koen J. F. Verhoeven,et al.  Implementing false discovery rate control: increasing your power , 2005 .

[10]  M. Slatkin Seeing ghosts: the effect of unsampled populations on migration rates estimated for sampled populations , 2004, Molecular ecology.

[11]  J. Cornuet,et al.  GENECLASS2: a software for genetic assignment and first-generation migrant detection. , 2004, The Journal of heredity.

[12]  Jukka Corander,et al.  BAPS 2: enhanced possibilities for the analysis of genetic population structure , 2004, Bioinform..

[13]  F. Allendorf,et al.  Intercrosses and the U.S. Endangered Species Act: Should Hybridized Populations be Included as Westslope Cutthroat Trout? , 2004 .

[14]  F. Rousset Inferences from Spatial Population Genetics , 2004 .

[15]  S. Kalinowski Genetic polymorphism and mixed-stock fisheries analysis , 2004 .

[16]  Luis V. García,et al.  Escaping the Bonferroni iron claw in ecological studies , 2004 .

[17]  R. Nielsen,et al.  Multilocus Methods for Estimating Population Sizes, Migration Rates and Divergence Time, With Applications to the Divergence of Drosophila pseudoobscura and D. persimilis , 2004, Genetics.

[18]  Jinliang Wang,et al.  Application of the One‐Migrant‐per‐Generation Rule to Conservation and Management , 2004 .

[19]  Zaid Abdo,et al.  Evaluating the performance of likelihood methods for detecting population structure and migration , 2004, Molecular ecology.

[20]  P. Beerli,et al.  Effect of unsampled populations on the estimation of population sizes and migration rates between sampled populations , 2004, Molecular ecology.

[21]  G. Luikart,et al.  SNPs in ecology, evolution and conservation , 2004 .

[22]  Arnaud Estoup,et al.  Genetic assignment methods for the direct, real‐time estimation of migration rate: a simulation‐based exploration of accuracy and power , 2004, Molecular ecology.

[23]  P. Ehrlich,et al.  Population biology of the checkerspot butterfly, Euphydryas chalcedona structure of the Jasper Ridge colony , 2004, Oecologia.

[24]  Jody Hey,et al.  Understanding and confronting species uncertainty in biology and conservation , 2003 .

[25]  Paul Beier,et al.  Population viability analysis. , 2016 .

[26]  M. Stephens,et al.  Inference of population structure using multilocus genotype data: linked loci and correlated allele frequencies. , 2003, Genetics.

[27]  D. Balding Likelihood-based inference for genetic correlation coefficients. , 2003, Theoretical population biology.

[28]  Bruce Rannala,et al.  Bayesian inference of recent migration rates using multilocus genotypes. , 2003, Genetics.

[29]  G. Hewitt,et al.  Nuclear DNA analyses in genetic studies of populations: practice, problems and prospects , 2003, Molecular ecology.

[30]  M. Sillanpää,et al.  Bayesian analysis of genetic differentiation between populations. , 2003, Genetics.

[31]  M. Whitlock,et al.  Estimating effective population size and migration rates from genetic samples over space and time. , 2003, Genetics.

[32]  F. Balloux,et al.  The estimation of population differentiation with microsatellite markers , 2002, Molecular ecology.

[33]  M. Kirkpatrick,et al.  A Selective Advantage to Immigrant Genes in a Daphnia Metapopulation , 2002, Science.

[34]  W. Munns,et al.  What is a population , 2002 .

[35]  S. Kalinowski,et al.  How many alleles per locus should be used to estimate genetic distances? , 2002, Heredity.

[36]  B S Weir,et al.  Estimating F-statistics. , 2002, Annual review of genetics.

[37]  N. Ryman,et al.  Statistical power when testing for genetic differentiation , 2001, Molecular ecology.

[38]  K J Dawson,et al.  A Bayesian approach to the identification of panmictic populations and the assignment of individuals. , 2001, Genetical research.

[39]  Y. Benjamini,et al.  THE CONTROL OF THE FALSE DISCOVERY RATE IN MULTIPLE TESTING UNDER DEPENDENCY , 2001 .

[40]  F. Balloux EASYPOP (version 1.7): a computer program for population genetics simulations. , 2001, The Journal of heredity.

[41]  Peter Beerli,et al.  Maximum likelihood estimation of a migration matrix and effective population sizes in n subpopulations by using a coalescent approach , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[42]  F. Balloux,et al.  SEX‐BIASED DISPERSAL IN A MIGRATORY BAT: A CHARACTERIZATION USING SEX‐SPECIFIC DEMOGRAPHIC PARAMETERS , 2001, Evolution; international journal of organic evolution.

[43]  M. Macnair Genetics of Populations (2nd edn) , 2000, Heredity.

[44]  K. Crandall,et al.  Considering evolutionary processes in conservation biology. , 2000, Trends in ecology & evolution.

[45]  P. Donnelly,et al.  Inference of population structure using multilocus genotype data. , 2000, Genetics.

[46]  F. Bonhomme,et al.  Properties of bias and variance of two multiallelic estimators of F(ST). , 2000, Theoretical population biology.

[47]  Rudolf Meier,et al.  Species concepts and phylogenetic theory : a debate , 2000 .

[48]  M. Ford,et al.  Viable salmonid populations and the recovery of evolutionarily significant units , 2000 .

[49]  G Luikart,et al.  New methods employing multilocus genotypes to select or exclude populations as origins of individuals. , 1999, Genetics.

[50]  R. Lande,et al.  Spatial Scale of Population Synchrony: Environmental Correlation versus Dispersal and Density Regulation , 1999, The American Naturalist.

[51]  J. Goudet An Improved Procedure for Testing the Effects of Key Innovations on Rate of Speciation , 1999, The American Naturalist.

[52]  P. Hedrick PERSPECTIVE: HIGHLY VARIABLE LOCI AND THEIR INTERPRETATION IN EVOLUTION AND CONSERVATION , 1999, Evolution; international journal of organic evolution.

[53]  J. Goudet,et al.  Hierarchical analyses of genetic differentiation in a hybrid zone of Sorex araneus (Insectivora: Soricidae) , 1999 .

[54]  A. Bohonak,et al.  Dispersal, Gene Flow, and Population Structure , 1999, The Quarterly Review of Biology.

[55]  M. Whitlock,et al.  Indirect measures of gene flow and migration: FST≠1/(4Nm+1) , 1999, Heredity.

[56]  R. Waples Separating the wheat from the chaff: patterns of genetic differentiation in high gene flow species , 1998 .

[57]  B. Rannala,et al.  Detecting immigration by using multilocus genotypes. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[58]  F. Allendorf,et al.  The One‐Migrant‐per‐Generation Rule in Conservation and Management , 1996 .

[59]  M Raymond,et al.  Testing differentiation in diploid populations. , 1996, Genetics.

[60]  L. Rieseberg,et al.  Molecular marker incongruence in plant hybrid zones and phylogenetic trees , 1996 .

[61]  B Rannala,et al.  Estimating gene flow in island populations. , 1996, Genetical research.

[62]  J. Goudet FSTAT (Version 1.2): A Computer Program to Calculate F-Statistics , 1995 .

[63]  J. Avise Mitochondrial DNA Polymorphism and a Connection Between Genetics and Demography of Relevance to Conservation , 1995 .

[64]  I. Stirling,et al.  Microsatellite analysis of population structure in Canadian polar bears , 1995, Molecular ecology.

[65]  François Rousset,et al.  GENEPOP (version 1.2): population genetic software for exact tests and ecumenicism , 1995 .

[66]  Y. Benjamini,et al.  Controlling the false discovery rate: a practical and powerful approach to multiple testing , 1995 .

[67]  C. Moritz Defining 'Evolutionarily Significant Units' for conservation. , 1994, Trends in ecology & evolution.

[68]  Alan Hastings,et al.  Complex interactions between dispersal and dynamics: Lessons from coupled logistic equations , 1993 .

[69]  C. Cockerham,et al.  ESTIMATION OF GENE FLOW FROM F‐STATISTICS , 1993, Evolution; international journal of organic evolution.

[70]  Kevin C. Nixon,et al.  Populations, Genetic Variation, and the Delimitation of Phylogenetic Species , 1992 .

[71]  W. Rice ANALYZING TABLES OF STATISTICAL TESTS , 1989, Evolution; international journal of organic evolution.

[72]  A. J. Noordwijk Futuyma, D. J. 1986. Evolutionary Biology 2nd edition, Sinauer Associates Inc. Sunderland, Mass. , 1988 .

[73]  B. Weir,et al.  Correlations, descent measures: drift with migration and mutation. , 1987, Proceedings of the National Academy of Sciences of the United States of America.

[74]  M. Nei Molecular Evolutionary Genetics , 1987 .

[75]  L. Slobodkin The Ecological Web The Ecological Web: More on the Distribution and Abundance of Animals H. G. Andrewartha L. C. Birch , 1986 .

[76]  B. Weir,et al.  ESTIMATING F‐STATISTICS FOR THE ANALYSIS OF POPULATION STRUCTURE , 1984, Evolution; international journal of organic evolution.

[77]  J. Crow,et al.  Group selection for a polygenic behavioral trait: estimating the degree of population subdivision. , 1984, Proceedings of the National Academy of Sciences of the United States of America.

[78]  W. G. Hill,et al.  Deviations from Hardy-Weinberg proportions: sampling variances and use in estimation of inbreeding coefficients. , 1984, Genetics.

[79]  P. Hedrick Genetics of populations , 1983 .

[80]  H. Booke The Conundrum of the Stock Concept—Are Nature and Nurture Definable in Fishery Science? , 1981 .

[81]  D. Hartl,et al.  Principles of population genetics , 1981 .

[82]  P. J. Boer The significance of Dispersal Power for the Survival of Species, with Special Reference to the Carabid Beetles in a Cultivated Countryside , 1979 .

[83]  マグローヒル科学技術用語大辞典編集委員会,et al.  マグローヒル科学技術用語大辞典 = McGraw-Hill dictionary of scientific and technical terms , 1979 .

[84]  Sewall Wright,et al.  Variability within and among natural populations , 1978 .

[85]  P. J. Boer Dispersal power and survival. Carabids in a cultivated coutryside , 1977 .

[86]  E. C. Pielou Population and Community Ecology. , 1976 .

[87]  Sybil P. Parker,et al.  McGraw-Hill dictionary of scientific and technical terms , 1976 .

[88]  C. Krebs Ecology: The Experimental Analysis of Distribution and Abundance , 1973 .

[89]  T. Dobzhansky Genetics of the Evolutionary Process , 1970 .

[90]  O. L. Davies,et al.  Statistical Methods. 6th Edition. , 1968 .

[91]  G. Williams Adaptation and Natural Selection , 2018 .

[92]  T. A. Goudge What Is a Population? , 1955, Philosophy of Science.

[93]  S. Wright,et al.  Isolation by Distance. , 1943, Genetics.

[94]  S. Wright,et al.  Evolution in Mendelian Populations. , 1931, Genetics.