Bayesian clustering algorithms ascertaining spatial population structure: a new computer program and a comparison study

On the basis of simulated data, this study compares the relative performances of the Bayesian clustering computer programs STRUCTURE, GENELAND, GENECLUST and a new program named TESS. While these four programs can detect population genetic structure from multilocus genotypes, only the last three ones include simultaneous analysis from geographical data. The programs are compared with respect to their abilities to infer the number of populations, to estimate membership probabilities, and to detect genetic discontinuities and clinal variation. The results suggest that combining analyses using TESS and STRUCTURE offers a convenient way to address inference of spatial population structure.

[1]  Matthew Stephens,et al.  Assigning African elephant DNA to geographic region of origin: applications to the ivory trade. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[2]  Gilles Celeux,et al.  EM procedures using mean field-like approximations for Markov model-based image segmentation , 2003, Pattern Recognit..

[3]  Sophie Ancelet,et al.  Bayesian Clustering Using Hidden Markov Random Fields in Spatial Population Genetics , 2006, Genetics.

[4]  S. Pääbo,et al.  Evidence for gradients of human genetic diversity within and among continents. , 2004, Genome research.

[5]  M. Stephens,et al.  Inference of population structure using multilocus genotype data: dominant markers and null alleles , 2007, Molecular ecology notes.

[6]  Guha Dharmarajan,et al.  Relative performance of Bayesian clustering software for inferring population substructure and individual assignment at low levels of population differentiation , 2006, Conservation Genetics.

[7]  Stephanie Manel,et al.  Assignment methods: matching biological questions with appropriate techniques. , 2005, Trends in ecology & evolution.

[8]  B. Weir,et al.  ESTIMATING F‐STATISTICS FOR THE ANALYSIS OF POPULATION STRUCTURE , 1984, Evolution; international journal of organic evolution.

[9]  M. Feldman,et al.  Clines, Clusters, and the Effect of Study Design on the Inference of Human Population Structure , 2005, PLoS genetics.

[10]  G. Evanno,et al.  Detecting the number of clusters of individuals using the software structure: a simulation study , 2005, Molecular ecology.

[11]  K J Dawson,et al.  A Bayesian approach to the identification of panmictic populations and the assignment of individuals. , 2001, Genetical research.

[12]  M. Feldman,et al.  Genetic Structure of Human Populations , 2002, Science.

[13]  P. Green,et al.  On Bayesian Analysis of Mixtures with an Unknown Number of Components (with discussion) , 1997 .

[14]  Noah A Rosenberg,et al.  Low Levels of Genetic Divergence across Geographically and Linguistically Diverse Populations from India , 2006, PLoS genetics.

[15]  M. Sillanpää,et al.  Bayesian analysis of genetic differentiation between populations. , 2003, Genetics.

[16]  Olivier François,et al.  fastruct: model‐based clustering made faster , 2006 .

[17]  B. Rannala,et al.  The Bayesian revolution in genetics , 2004, Nature Reviews Genetics.

[18]  A Coulon,et al.  Genetic structure is influenced by landscape features: empirical evidence from a roe deer population , 2006, Molecular ecology.

[19]  Arnaud Estoup,et al.  A Spatial Statistical Model for Landscape Genetics , 2005, Genetics.

[20]  P. Donnelly,et al.  Inference of population structure using multilocus genotype data. , 2000, Genetics.

[21]  David B. Dunson,et al.  Bayesian Data Analysis , 2010 .

[22]  L. Excoffier,et al.  Computer programs for population genetics data analysis: a survival guide , 2006, Nature Reviews Genetics.

[23]  Noah A. Rosenberg,et al.  CLUMPP: a cluster matching and permutation program for dealing with label switching and multimodality in analysis of population structure , 2007, Bioinform..

[24]  D. Rubin,et al.  Maximum likelihood from incomplete data via the EM - algorithm plus discussions on the paper , 1977 .