Theory of the effects of population structure and sampling on patterns of linkage disequilibrium applied to genomic data from humans.

We develop predictions for the correlation of heterozygosity and for linkage disequilibrium between two loci using a simple model of population structure that includes migration among local populations, or demes. We compare the results for a sample of size two from the same deme (a single-deme sample) to those for a sample of size two from two different demes (a scattered sample). The correlation in heterozygosity for a scattered sample is surprisingly insensitive to both the migration rate and the number of demes. In contrast, the correlation in heterozygosity for a single-deme sample is sensitive to both, and the effect of an increase in the number of demes is qualitatively similar to that of a decrease in the migration rate: both increase the correlation in heterozygosity. These same conclusions hold for a commonly used measure of linkage disequilibrium (r(2)). We compare the predictions of the theory to genomic data from humans and show that subdivision might account for a substantial portion of the genetic associations observed within the human genome, even though migration rates among local populations of humans are relatively large. Because correlations due to subdivision rather than to physical linkage can be large even in a single-deme sample, then if long-term migration has been important in shaping patterns of human polymorphism, the common practice of disease mapping using linkage disequilibrium in "isolated" local populations may be subject to error.

[1]  W. G. Hill,et al.  Linkage disequilibrium in finite populations , 1968, Theoretical and Applied Genetics.

[2]  S. Lessard,et al.  The two-locus ancestral graph in a subdivided population: convergence as the number of demes grows in the island model , 2004, Journal of mathematical biology.

[3]  G. McVean,et al.  A genealogical interpretation of linkage disequilibrium. , 2002, Genetics.

[4]  Eric S. Lander,et al.  Human genome sequence variation and the influence of gene history, mutation and recombination , 2002, Nature Genetics.

[5]  R. Hudson,et al.  Inferences about human demography based on multilocus analyses of noncoding sequences. , 2002, Genetics.

[6]  D. Gudbjartsson,et al.  A high-resolution recombination map of the human genome , 2002, Nature Genetics.

[7]  August G. Wang,et al.  Linkage disequilibrium and demographic history of the isolated population of the Faroe Islands , 2002, European Journal of Human Genetics.

[8]  S. Gabriel,et al.  The Structure of Haplotype Blocks in the Human Genome , 2002, Science.

[9]  Simon Tavaré,et al.  Linkage disequilibrium: what history has to tell us. , 2002, Trends in genetics : TIG.

[10]  R. Hudson Two-locus sampling distributions and their application. , 2001, Genetics.

[11]  S. Liu-Cordero,et al.  The discovery of single-nucleotide polymorphisms--and inferences about human demographic history. , 2001, American journal of human genetics.

[12]  J. Wakeley,et al.  Gene genealogies in a metapopulation. , 2001, Genetics.

[13]  J. Pritchard,et al.  Linkage disequilibrium in humans: models and data. , 2001, American journal of human genetics.

[14]  Pardis C Sabeti,et al.  Linkage disequilibrium in the human genome , 2001, Nature.

[15]  D. Couvet,et al.  Two-locus identity probabilities and identity disequilibrium in a partially selfing subdivided population. , 2001, Genetical research.

[16]  D. Couvet,et al.  Estimation of effective population size and migration rate from one- and two-locus identity measures. , 2001, Genetics.

[17]  John A. Todd,et al.  The genetically isolated populations of Finland and Sardinia may not be a panacea for linkage disequilibrium mapping of common disease genes , 2000, Nature Genetics.

[18]  R. Hudson,et al.  Adjusting the focus on human variation. , 2000, Trends in genetics : TIG.

[19]  P. Donnelly,et al.  Association mapping in structured populations. , 2000, American journal of human genetics.

[20]  M. Nordborg Linkage disequilibrium, gene trees and selfing: an ancestral recombination graph with partial self-fertilization. , 2000, Genetics.

[21]  R. Nielsen Estimation of population parameters and recombination rates from single nucleotide polymorphisms. , 2000, Genetics.

[22]  M. Wolpoff,et al.  Population bottlenecks and Pleistocene human evolution. , 2000, Molecular biology and evolution.

[23]  J. Wakeley,et al.  Nonequilibrium migration in human history. , 1999, Genetics.

[24]  Mario Pirastu,et al.  Population choice in mapping genes for complex diseases , 1999, Nature Genetics.

[25]  L. Kruglyak Prospects for whole-genome linkage disequilibrium mapping of common disease genes , 1999, Nature Genetics.

[26]  H. Wilkinson-Herbots,et al.  Genealogy and subpopulation differentiation under various models of population structure , 1998 .

[27]  J. Wakeley,et al.  Segregating sites in Wright's island model. , 1998, Theoretical population biology.

[28]  S T Sherry,et al.  Genetic traces of ancient demography. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[29]  J. Wakeley Using the variance of pairwise differences to estimate the recombination rate. , 1997, Genetical research.

[30]  P. Donnelly,et al.  Optimal sequencing strategies for surveying molecular genetic diversity. , 1996, Genetics.

[31]  E. Lander The New Genomics: Global Views of Biology , 1996, Science.

[32]  N Risch,et al.  The Future of Genetic Studies of Complex Human Diseases , 1996, Science.

[33]  J. Witte,et al.  Genetic dissection of complex traits. , 1994, Nature genetics.

[34]  Stephen Wolfram,et al.  The Mathematica Book , 1996 .

[35]  N. Takahata A GENETIC PERSPECTIVE ON THE ORIGIN AND HISTORY OF HUMANS , 1995 .

[36]  W. Ewens,et al.  The transmission/disequilibrium test: history, subdivision, and admixture. , 1995, American journal of human genetics.

[37]  Y. Yuval,et al.  Dominant inheritance in two families with familial Mediterranean fever (FMF). , 1995, American journal of medical genetics.

[38]  L. Jorde Linkage disequilibrium as a gene-mapping tool. , 1995, American journal of human genetics.

[39]  B S Weir,et al.  Likelihood methods for locating disease genes in nonequilibrium populations. , 1995, American journal of human genetics.

[40]  M. Slatkin Linkage disequilibrium in growing and stable populations. , 1994, Genetics.

[41]  Luigi Luca Cavalli-sfroza The History and Geography of Human Genes , 1994 .

[42]  M. Slatkin Inbreeding coefficients and coalescence times. , 1991, Genetical research.

[43]  J. Hey,et al.  A multi-dimensional coalescent process applied to multi-allelic selection models and migration models. , 1991, Theoretical population biology.

[44]  R. Hudson,et al.  Estimating the recombination parameter of a finite population model without selection. , 1987, Genetical research.

[45]  R. Hudson,et al.  The use of sample genealogies for studying a selectively neutral m-loci model with recombination. , 1985, Theoretical population biology.

[46]  R. Hudson,et al.  The sampling distribution of linkage disequilibrium under an infinite allele model without selection. , 1985, Genetics.

[47]  R. Hudson Properties of a neutral allele model with intragenic recombination. , 1983, Theoretical population biology.

[48]  T. Ohta,et al.  Linkage disequilibrium with the island model. , 1982, Genetics.

[49]  Robert C. Griffiths,et al.  Neutral two-locus multiple allele models with recombination , 1981 .

[50]  C. Strobeck,et al.  The effect of intragenic recombination on the number of alleles in a finite population. , 1978, Genetics.

[51]  G. A. Watterson On the number of segregating sites in genetical models without recombination. , 1975, Theoretical population biology.

[52]  T. Maruyama,et al.  A simple proof that certain quantities are independent of the geographical structure of population. , 1974, Theoretical population biology.

[53]  M. Nei,et al.  Linkage disequilibrium in subdivided populations. , 1973, Genetics.

[54]  B D Latter,et al.  The island model of population differentiation: a general solution. , 1973, Genetics.

[55]  T. Ohta,et al.  Linkage disequilibrium between two segregating nucleotide sites under the steady flux of mutations in a finite population. , 1971, Genetics.

[56]  R. Lewontin The Interaction of Selection and Linkage. I. General Considerations; Heterotic Models. , 1964, Genetics.

[57]  P. Moran,et al.  The Theory of Some Genetical Effects of Population Subdivision , 1959 .

[58]  S WRIGHT,et al.  Genetical structure of populations. , 1950, Nature.

[59]  S. Wright,et al.  Evolution in Mendelian Populations. , 1931, Genetics.