Genetic Similarities Within and Between Human Populations

The proportion of human genetic variation due to differences between populations is modest, and individuals from different populations can be genetically more similar than individuals from the same population. Yet sufficient genetic data can permit accurate classification of individuals into populations. Both findings can be obtained from the same data set, using the same number of polymorphic loci. This article explains why. Our analysis focuses on the frequency, ω, with which a pair of random individuals from two different populations is genetically more similar than a pair of individuals randomly selected from any single population. We compare ω to the error rates of several classification methods, using data sets that vary in number of loci, average allele frequency, populations sampled, and polymorphism ascertainment strategy. We demonstrate that classification methods achieve higher discriminatory power than ω because of their use of aggregate properties of populations. The number of loci analyzed is the most critical variable: with 100 polymorphisms, accurate classification is possible, but ω remains sizable, even when using populations as distinct as sub-Saharan Africans and Europeans. Phenotypes controlled by a dozen or fewer loci can therefore be expected to show substantial overlap between human populations. This provides empirical justification for caution when using population labels in biomedical settings, with broad implications for personalized medicine, pharmacogenetics, and the meaning of race.

[1]  Michael J Bamshad,et al.  Human population genetic structure and inference of group membership. , 2003, American journal of human genetics.

[2]  Pierre Duchesne,et al.  AFLP utility for population assignment studies: analytical investigation and empirical comparison with microsatellites , 2003, Molecular ecology.

[3]  Manfred Kayser,et al.  Proportioning whole-genome single-nucleotide-polymorphism diversity for the identification of geographic population structure and genetic ancestry. , 2006, American journal of human genetics.

[4]  L. Jin,et al.  A unified approach to study hypervariable polymorphisms: statistical considerations of determining relatedness and population distances. , 1993, EXS.

[5]  D. Reich,et al.  Variants associated with common disease are not unusually differentiated in frequency across populations. , 2006, American journal of human genetics.

[6]  G. Luikart,et al.  Detecting Wildlife Poaching: Identifying the Origin of Individuals with Bayesian Assignment Tests and Multilocus Genotypes , 2002 .

[7]  J. Mountain,et al.  Impact of human population history on distributions of individual-level genetic distance , 2005, Human Genomics.

[8]  Simon Easteal,et al.  Number of SNPS Loci Needed to Detect Population Structure , 2003, Human Heredity.

[9]  A W F Edwards,et al.  Human genetic diversity: Lewontin's fallacy. , 2003, BioEssays : news and reviews in molecular, cellular and developmental biology.

[10]  Rui Mei,et al.  Large-scale SNP analysis reveals clustered and continuous patterns of human genetic variation , 2005, Human Genomics.

[11]  R. Lewontin The Apportionment of Human Diversity , 1972 .

[12]  P. Smouse,et al.  The use of restriction fragment length polymorphisms in paternity analysis. , 1986, American journal of human genetics.

[13]  M. Feldman,et al.  Genetic Structure of Human Populations , 2002, Science.

[14]  M. Feldman,et al.  Clines, Clusters, and the Effect of Study Design on the Inference of Human Population Structure , 2005, PLoS genetics.

[15]  J. Mitton Measurement of Differentiation: Reply to Lewontin, Powell, and Taylor , 1978, The American Naturalist.

[16]  J. Mitton Genetic Differentiation of Races of Man as Judged by Single-Locus and Multilocus Analyses , 1977, The American Naturalist.

[17]  L. Cavalli-Sforza,et al.  High resolution of human evolutionary trees with polymorphic microsatellites , 1994, Nature.

[18]  J. Stephens,et al.  Polymorphic admixture typing in human ethnic populations. , 1994, American journal of human genetics.

[19]  Jerilyn A. Walker,et al.  Genetic variation among world populations: inferences from 100 Alu insertion polymorphisms. , 2003, Genome research.

[20]  L. Jorde,et al.  Diversity and Divergence Among the Tribal Populations of India , 2005, Annals of human genetics.

[21]  M. Nei Analysis of gene diversity in subdivided populations. , 1973, Proceedings of the National Academy of Sciences of the United States of America.

[22]  M. Olivier A haplotype map of the human genome , 2003, Nature.

[23]  Xiaofeng Zhu,et al.  Genetic Structure, Self-identified Race/ethnicity, and Confounding in Case-control Association Studies , 2022 .

[24]  J. Powell,et al.  Are Human Races "Substantially" Different Genetically? , 1978, The American Naturalist.

[25]  B. Latter Genetic Differences Within and Between Populations of the Major Human Subgroups , 1980, The American Naturalist.

[26]  G Luikart,et al.  New methods employing multilocus genotypes to select or exclude populations as origins of individuals. , 1999, Genetics.

[27]  W S Watkins,et al.  The distribution of human genetic diversity: a comparison of mitochondrial, autosomal, and Y-chromosome data. , 2000, American journal of human genetics.

[28]  J. Stephens,et al.  Haplotype Variation and Linkage Disequilibrium in 313 Human Genes , 2001, Science.

[29]  S. Boissinot,et al.  Human Population Genetic Structure and Diversity Inferred from Polymorphic L1(LINE-1) and Alu Insertions , 2006, Human Heredity.

[30]  S. Pääbo,et al.  Evidence for gradients of human genetic diversity within and among continents. , 2004, Genome research.

[31]  G. Sermonti The human genome. , 1988, Rivista di biologia.

[32]  P. Donnelly,et al.  Inference of population structure using multilocus genotype data. , 2000, Genetics.

[33]  Hua Tang,et al.  Categorization of humans in biomedical research: genes, race and disease , 2002, Genome Biology.

[34]  M. Nei,et al.  Gene Differences between Caucasian, Negro, and Japanese Populations , 1972, Science.

[35]  S. Sherry,et al.  Patterns of human diversity, within and among continents, inferred from biallelic DNA polymorphisms. , 2002, Genome research.

[36]  G Barbujani,et al.  An apportionment of human DNA diversity. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[37]  R. Spielman,et al.  Multiple-Locus Allocation of Individuals to Groups as a Function of the Genetic Variation Within and Differences Among Human Populations , 1982, The American Naturalist.

[38]  Michael Bamshad,et al.  Deconstructing the relationship between genetics and race , 2004, Nature Reviews Genetics.

[39]  L. Cavalli-Sforza,et al.  Multilocus genotypes, a tree of individuals, and human evolutionary history. , 1997, American journal of human genetics.

[40]  E Pennisi,et al.  The Human Genome , 2001, Science.