diveRsity: An R package for the estimation and exploration of population genetics parameters and their associated errors

Summary We present a new R package, diveRsity, for the calculation of various diversity statistics, including common diversity partitioning statistics (θ, GST) and population differentiation statistics (DJost, GST ', χ2 test for population heterogeneity), among others. The package calculates these estimators along with their respective bootstrapped confidence intervals for loci, sample population pairwise and global levels. Various plotting tools are also provided for a visual evaluation of estimated values, allowing users to critically assess the validity and significance of statistical tests from a biological perspective. diveRsity has a set of unique features, which facilitate the use of an informed framework for assessing the validity of the use of traditional F-statistics for the inference of demography, with reference to specific marker types, particularly focusing on highly polymorphic microsatellite loci. However, the package can be readily used for other co-dominant marker types (e.g. allozymes, SNPs). Detailed examples of usage and descriptions of package capabilities are provided. The examples demonstrate useful strategies for the exploration of data and interpretation of results generated by diveRsity. Additional online resources for the package are also described, including a GUI web app version intended for those with more limited experience using R for statistical analysis.

[1]  R. Ward,et al.  Informativeness of genetic markers for inference of ancestry. , 2003, American journal of human genetics.

[2]  P. Hedrick,et al.  Assessing population structure: FST and related measures , 2011, Molecular ecology resources.

[3]  P. McGinnity,et al.  Beaufort trout MicroPlex: a high-throughput multiplex platform comprising 38 informative microsatellite loci for use in resident and anadromous (sea trout) brown trout Salmo trutta genetic studies. , 2013, Journal of fish biology.

[4]  L. Waits,et al.  Using a reference population yardstick to calibrate and compare genetic diversity reported in different studies: an example from the brown bear , 2012, Heredity.

[5]  G. Gerlach,et al.  Calculations of population differentiation based on GST and D: forget GST but not all of statistics! , 2010, Molecular ecology.

[6]  Nicholas G. Crawford,et al.  smogd: software for the measurement of genetic diversity , 2010, Molecular ecology resources.

[7]  E. Wagenmakers A practical solution to the pervasive problems ofp values , 2007, Psychonomic bulletin & review.

[8]  B. Weir,et al.  ESTIMATING F‐STATISTICS FOR THE ANALYSIS OF POPULATION STRUCTURE , 1984, Evolution; international journal of organic evolution.

[9]  J. Goudet HIERFSTAT , a package for R to compute and test hierarchical F -statistics , 2005 .

[10]  M. Nei,et al.  Estimation of fixation indices and gene diversities , 1983, Annals of human genetics.

[11]  Maria Blettner,et al.  Confidence Interval or P-Value? , 2009 .

[12]  P. Hedrick PERSPECTIVE: HIGHLY VARIABLE LOCI AND THEIR INTERPRETATION IN EVOLUTION AND CONSERVATION , 1999, Evolution; international journal of organic evolution.

[13]  L. Excoffier,et al.  Computer programs for population genetics data analysis: a survival guide , 2006, Nature Reviews Genetics.

[14]  G. Hommel,et al.  Confidence interval or p-value?: part 4 of a series on evaluation of scientific publications. , 2009, Deutsches Arzteblatt international.

[15]  M. Whitlock and D do not replace FST , 2011, Molecular ecology.

[16]  François Rousset,et al.  GENEPOP (version 1.2): population genetic software for exact tests and ecumenicism , 1995 .

[17]  Anne Chao User ’ s Guide for Program SPADE ( Species Prediction And Diversity Estimation ) , 2010 .

[18]  D. Tennenhouse Common misconceptions. , 1979, Ophthalmology.

[19]  D. Winter mmod: an R library for the calculation of population differentiation statistics , 2012, Molecular ecology resources.

[20]  P. Hedrick A STANDARDIZED GENETIC DIFFERENTIATION MEASURE , 2005, Evolution; international journal of organic evolution.

[21]  P. Meirmans,et al.  genotype and genodive: two programs for the analysis of genetic diversity of asexual organisms , 2004 .

[22]  F. Balloux EASYPOP (version 1.7): a computer program for population genetics simulations. , 2001, The Journal of heredity.

[23]  R. Toonen,et al.  Common misconceptions in molecular ecology: echoes of the modern synthesis , 2012, Molecular ecology.

[24]  L. Jost GST and its relatives do not measure differentiation , 2008, Molecular ecology.