Microsatellites are molecular clocks that support accurate inferences about history.

Microsatellite length mutations are often modeled using the generalized stepwise mutation process, which is a type of random walk. If this model is sufficiently accurate, one can estimate the coalescence time between alleles of a locus after a mathematical transformation of the allele lengths. When large-scale microsatellite genotyping first became possible, there was substantial interest in using this approach to make inferences about time and demography, but that interest has waned because it has not been possible to empirically validate the clock by comparing it with data in which the mutation process is well understood. We analyzed data from 783 microsatellite loci in human populations and 292 loci in chimpanzee populations, and compared them with up to one gigabase of aligned sequence data, where the molecular clock based upon nucleotide substitutions is believed to be reliable. We empirically demonstrate a remarkable linearity (r(2) > 0.95) between the microsatellite average square distance statistic and sequence divergence. We demonstrate that microsatellites are accurate molecular clocks for coalescent times of at least 2 million years (My). We apply this insight to confirm that the African populations San, Biaka Pygmy, and Mbuti Pygmy have the deepest coalescent times among populations in the Human Genome Diversity Project. Furthermore, we show that microsatellites support unbiased estimates of population differentiation (F(ST)) that are less subject to ascertainment bias than single nucleotide polymorphism (SNP) F(ST). These results raise the prospect of using microsatellite data sets to determine parameters of population history. When genotyped along with SNPs, microsatellite data can also be used to correct for SNP ascertainment bias.

[1]  M. Nalls,et al.  Reduced Neutrophil Count in People of African Descent Is Due To a Regulatory Variant in the Duffy Antigen Receptor for Chemokines Gene , 2009, PLoS genetics.

[2]  J. Mullikin,et al.  Nature Genetics: doi:10.1038/ng.303Supplementary Methods , 2022 .

[3]  M. Feldman,et al.  Population differentiation and migration: coalescence times in a two-sex island model for autosomal and X-linked loci. , 2008, Theoretical population biology.

[4]  Noah A. Rosenberg,et al.  ADZE: a rarefaction approach for counting alleles private to combinations of populations , 2008, Bioinform..

[5]  D. Reich,et al.  Analysis of Chimpanzee History Based on Genome Sequence Alignments , 2008, PLoS genetics.

[6]  N. Freimer,et al.  Geographic Patterns of Genome Admixture in Latin American Mestizos , 2008, PLoS genetics.

[7]  M. Feldman,et al.  Worldwide Human Relationships Inferred from Genome-Wide Patterns of Variation , 2008 .

[8]  Jonathan Scott Friedlaender,et al.  The Genetic Structure of Pacific Islanders , 2008, PLoS genetics.

[9]  Mattias Jakobsson,et al.  Genetic Variation and Population Structure in Native Americans , 2007, PLoS genetics.

[10]  J. Mullikin,et al.  Measurement of the human allele frequency spectrum demonstrates greater genetic drift in East Asians than in Europeans , 2007, Nature Genetics.

[11]  David Reich,et al.  Genetic Structure of Chimpanzee Populations , 2007, PLoS genetics.

[12]  Motoo Kimura,et al.  A model of mutation appropriate to estimate the number of electrophoretically detectable alleles in a finite population*. , 1973, Genetical research.

[13]  Noah A Rosenberg,et al.  Low Levels of Genetic Divergence across Geographically and Linguistically Diverse Populations from India , 2006, PLoS genetics.

[14]  D. Reich,et al.  Population Structure and Eigenanalysis , 2006, PLoS genetics.

[15]  N. Rosenberg,et al.  Standardized Subsets of the HGDP‐CEPH Human Genome Diversity Cell Line Panel, Accounting for Atypical and Duplicated Samples and Pairs of Close Relatives , 2006, Annals of human genetics.

[16]  D. Conrad,et al.  A worldwide survey of haplotype variation and linkage disequilibrium in the human genome , 2006, Nature Genetics.

[17]  Eric S. Lander,et al.  Genetic evidence for complex speciation of humans and chimpanzees , 2006, Nature.

[18]  B. Nickel,et al.  Demographic History and Genetic Differentiation in Apes , 2006, Current Biology.

[19]  M. Feldman,et al.  Clines, Clusters, and the Effect of Study Design on the Inference of Human Population Structure , 2005, PLoS genetics.

[20]  Carlos D Bustamante,et al.  Ascertainment bias in studies of human genome-wide polymorphism. , 2005, Genome research.

[21]  Sohini Ramachandran,et al.  Support from the relationship of genetic and geographic distance in human populations for a serial founder effect originating in Africa. , 2005, Proceedings of the National Academy of Sciences of the United States of America.

[22]  Jean L. Chang,et al.  Initial sequence of the chimpanzee genome and comparison with the human genome , 2005, Nature.

[23]  H. Ellegren Microsatellites: simple sequences with complex evolution , 2004, Nature Reviews Genetics.

[24]  M. Feldman,et al.  Features of evolution and expansion of modern humans, inferred from genomewide microsatellite markers. , 2003, American journal of human genetics.

[25]  M. Feldman,et al.  The application of molecular genetic approaches to the study of human evolution , 2003, Nature Genetics.

[26]  M. Feldman,et al.  Genetic Structure of Human Populations , 2002, Science.

[27]  Noah A Rosenberg,et al.  The probability of topological concordance of gene trees and species trees. , 2002, Theoretical population biology.

[28]  Richard R. Hudson,et al.  Generating samples under a Wright-Fisher neutral model of genetic variation , 2002, Bioinform..

[29]  J. Mullikin,et al.  SSAHA: a fast search method for large DNA databases. , 2001, Genome research.

[30]  L. Zhivotovsky,et al.  Estimating divergence time with the use of microsatellite genetic distances: impacts of population growth and gene flow. , 2001, Molecular biology and evolution.

[31]  J. V. Moran,et al.  Initial sequencing and analysis of the human genome. , 2001, Nature.

[32]  James L. Weber,et al.  7 Genotyping for human whole-genome scans: Past, present, and future , 2001 .

[33]  John P. Rice,et al.  Genotyping for human whole-genome scans: past, present, and future. , 2001, Advances in genetics.

[34]  International Human Genome Sequencing Consortium Initial sequencing and analysis of the human genome , 2001, Nature.

[35]  H. Ellegren Microsatellite mutations in the germline: implications for evolutionary inference. , 2000, Trends in genetics : TIG.

[36]  Eric S. Lander,et al.  An SNP map of the human genome generated by reduced representation shotgun sequencing , 2000, Nature.

[37]  Mei Peng,et al.  The direction of microsatellite mutations is dependent upon allele length , 2000, Nature Genetics.

[38]  E. Knapik,et al.  Zebrafish genetic map with 2000 microsatellite markers. , 1999, Genomics.

[39]  R. Durrett,et al.  Equilibrium distributions of microsatellite repeat length resulting from a balance between slippage events and point mutations. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[40]  M. Ganal,et al.  A microsatellite map of wheat. , 1998, Genetics.

[41]  D. Goldstein,et al.  Genetic evidence for a Paleolithic human population expansion in Africa. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[42]  M Kimmel,et al.  Signatures of population expansion in microsatellite repeat data. , 1998, Genetics.

[43]  L. Waits,et al.  An empirical evaluation of genetic distance statistics using microsatellite data from bear (Ursidae) populations. , 1997, Genetics.

[44]  D. Pollock,et al.  Launching microsatellites: a review of mutation processes and methods of phylogenetic interference. , 1997, The Journal of heredity.

[45]  J. Weber,et al.  Human whole-genome shotgun sequencing. , 1997, Genome research.

[46]  M. Feldman,et al.  Microsatellite genetic distances with range constraints: analytic description and problems of estimation. , 1997, Genetics.

[47]  M. Kimmel,et al.  Measures of variation at DNA repeat loci under a general stepwise mutation model. , 1996, Theoretical population biology.

[48]  F. Weissing,et al.  Constraints on allele size at microsatellite loci: implications for genetic differentiation. , 1996, Genetics.

[49]  Cécile Fizames,et al.  A comprehensive genetic map of the human genome based on 5,264 microsatellites , 1996, Nature.

[50]  D. Rubinsztein,et al.  Microsatellites are subject to directional evolution , 1996, Nature Genetics.

[51]  M W Feldman,et al.  Genetic absolute dating based on microsatellites and the origin of modern humans. , 1995, Proceedings of the National Academy of Sciences of the United States of America.

[52]  M W Feldman,et al.  An evaluation of genetic distances for use with microsatellite loci. , 1994, Genetics.

[53]  M Slatkin,et al.  A measure of population subdivision based on microsatellite allele frequencies. , 1995, Genetics.

[54]  N. Freimer,et al.  Mutational processes of simple-sequence repeat loci in human populations. , 1994, Proceedings of the National Academy of Sciences of the United States of America.

[55]  L. Cavalli-Sforza,et al.  High resolution of human evolutionary trees with polymorphic microsatellites , 1994, Nature.

[56]  J. Weber,et al.  Mutation of human short tandem repeats. , 1993, Human molecular genetics.

[57]  N. Freimer,et al.  Allele frequencies at microsatellite loci: the stepwise mutation model revisited. , 1993, Genetics.

[58]  M. Slatkin Inbreeding coefficients and coalescence times. , 1991, Genetical research.

[59]  G. Levinson,et al.  High frequencies of short frameshifts in poly-CA/TG tandem repeats borne by bacteriophage M13 in Escherichia coli K-12 , 1987, Nucleic Acids Res..

[60]  B. Efron,et al.  A Leisurely Look at the Bootstrap, the Jackknife, and , 1983 .