Identification and Analysis of Genomic Regions with Large Between‐Population Differentiation in Humans

The primary aim of genetic association and linkage studies is to identify genetic variants that contribute to phenotypic variation within human populations. Since the overwhelming majority of human genetic variation is found within populations, these methods are expected to be effective and can likely be extrapolated from one human population to another. However, they may lack power in detecting the genetic variants that contribute to phenotypes that differ greatly between human populations. Phenotypes that show large differences between populations are expected to be associated with genomic regions exhibiting large allele frequency differences between populations. Thus, from genome‐wide polymorphism data genomic regions with large allele frequency differences between populations can be identified, and evaluated as candidates for large between‐population phenotypic differences. Here we use allele frequency data from ∼1.5 million SNPs from three human populations, and present an algorithm that identifies genomic regions containing SNPs with extreme Fst. We demonstrate that our candidate regions have reduced heterozygosity in Europeans and Chinese relative to African‐Americans, and are likely enriched with genes that have experienced positive natural selection. We identify genes that are likely responsible for phenotypes known to differ dramatically between human populations and present several candidates worthy of future investigation. Our list of high Fst genomic regions is a first step in identifying the genetic variants that contribute to large phenotypic differences between populations, many of which have likely experienced positive natural selection. Our approach based on between population differences can compliment traditional within population linkage and association studies to uncover novel genotype‐phenotype relationships.

[1]  C. Spencer,et al.  Scanning the human genome for signals of selection. , 2006, Current opinion in genetics & development.

[2]  Erhard Rahm,et al.  FUNC: a package for detecting significant associations between gene sets and ontological annotations , 2007, BMC Bioinformatics.

[3]  A. Kimchi,et al.  The death domain: a module shared by proteins with diverse cellular functions. , 1995, Trends in biochemical sciences.

[4]  J. Pritchard,et al.  A Map of Recent Positive Selection in the Human Genome , 2006, PLoS biology.

[5]  Andrew G. Clark,et al.  Reconstituting the Frequency Spectrum of Ascertained Single-Nucleotide Polymorphism Data , 2004, Genetics.

[6]  T. Ishida,et al.  Evidence for recent positive selection at the human AIM1 locus in a European population. , 2006, Molecular biology and evolution.

[7]  J. Relethford,et al.  Apportionment of global human genetic diversity based on craniometrics and skin color. , 2002, American journal of physical anthropology.

[8]  K. Makova,et al.  Worldwide polymorphism at the MC1R locus and normal pigmentation variation in humans , 2005, Peptides.

[9]  B S Weir,et al.  Estimating F-statistics. , 2002, Annual review of genetics.

[10]  Pardis C Sabeti,et al.  Detecting recent positive selection in the human genome from haplotype structure , 2002, Nature.

[11]  Keith C. Cheng,et al.  SLC24A5, a Putative Cation Exchanger, Affects Pigmentation in Zebrafish and Humans , 2005, Science.

[12]  Christian Schlötterer,et al.  A microsatellite-based multilocus screen for the identification of local selective sweeps. , 2002, Genetics.

[13]  M. Stoneking,et al.  Identifying genes underlying skin pigmentation differences among human populations , 2006, Human Genetics.

[14]  D. Swallow,et al.  The genetic polymorphism of intestinal lactase activity in adult humans , 2000 .

[15]  J. M. Smith,et al.  The hitch-hiking effect of a favourable gene. , 1974, Genetical research.

[16]  M. Shriver,et al.  Interrogating a high-density SNP map for signatures of natural selection. , 2002, Genome research.

[17]  D. Conklin,et al.  Interleukin 20 Discovery, Receptor Identification, and Role in Epidermal Function , 2001, Cell.

[18]  Holly M. Mortensen,et al.  Convergent adaptation of human lactase persistence in Africa and Europe , 2007, Nature Genetics.

[19]  M. Olivier A haplotype map of the human genome , 2003, Nature.

[20]  R. Lewontin The Apportionment of Human Diversity , 1972 .

[21]  Pascal Schneider,et al.  Generation of the primary hair follicle pattern. , 2006, Proceedings of the National Academy of Sciences of the United States of America.

[22]  Kevin R. Thornton,et al.  Controlling the False-Positive Rate in Multilocus Genome Scans for Selection , 2007, Genetics.

[23]  R. Kittles,et al.  Genetic evidence for the convergent evolution of light skin in Europeans and East Asians. , 2006, Molecular biology and evolution.

[24]  M. Shriver,et al.  The genetic architecture of normal variation in human pigmentation: an evolutionary perspective and model. , 2006, Human molecular genetics.

[25]  Matthew W. Hahn,et al.  Positive Selection on a Human-Specific Transcription Factor Binding Site Regulating IL4 Expression , 2003, Current Biology.

[26]  A. Navarro,et al.  Signatures of Positive Selection in Genes Associated with Human Skin Pigmentation as Revealed from Analyses of Single Nucleotide Polymorphisms , 2007, Annals of human genetics.

[27]  Matthew W. Hahn,et al.  Ancient and Recent Positive Selection Transformed Opioid cis-Regulation in Humans , 2005, PLoS biology.

[28]  Kevin R. Thornton,et al.  A New Approach for Using Genome Scans to Detect Recent Positive Selection in the Human Genome , 2007, PLoS biology.

[29]  K. Kidd,et al.  The evolution and population genetics of the ALDH2 locus: random genetic drift, selection, and low levels of recombination , 2004, Annals of human genetics.

[30]  M. Nachman,et al.  Genome scans of DNA variability in humans reveal evidence for selective sweeps outside of Africa. , 2004, Molecular biology and evolution.

[31]  L. Cook,et al.  The theory of gene frequencies , 1976 .

[32]  D. Gudbjartsson,et al.  A high-resolution recombination map of the human genome , 2002, Nature Genetics.

[33]  Leena Peltonen,et al.  Identification of a variant associated with adult-type hypolactasia , 2002, Nature Genetics.

[34]  David B. Witonsky,et al.  CYP3A variation and the evolution of salt-sensitivity variants. , 2004, American journal of human genetics.

[35]  Geoffrey B. Nilsen,et al.  Whole-Genome Patterns of Common DNA Variation in Three Human Populations , 2005, Science.

[36]  HUMAN PIGMENTATION , 1961 .

[37]  Pardis C Sabeti,et al.  Genetic signatures of strong recent positive selection at the lactase gene. , 2004, American journal of human genetics.

[38]  Peter Nürnberg,et al.  Identification of a candidate genetic variant for the high prevalence of type II diabetes in Polynesians , 2007, European Journal of Human Genetics.

[39]  John Maynard Smith,et al.  The hitch-hiking effect of a favourable gene. , 1974, Genetical research.

[40]  Bernice R. Packer,et al.  Widespread purifying selection at polymorphic sites in human protein-coding loci , 2003, Proceedings of the National Academy of Sciences of the United States of America.

[41]  Jason E Stajich,et al.  Disentangling the effects of demography and selection in human history. , 2004, Molecular biology and evolution.

[42]  Steven G. Schroeder,et al.  The Effects of Artificial Selection on the Maize Genome , 2005, Science.

[43]  Deborah A Nickerson,et al.  Population History and Natural Selection Shape Patterns of Genetic Variation in 132 Genes , 2004, PLoS biology.

[44]  Deborah A Nickerson,et al.  Genomic regions exhibiting positive selection identified from dense genotype data. , 2005, Genome research.

[45]  W. G. Hill,et al.  Measures of human population structure show heterogeneity among genomic regions. , 2005, Genome research.

[46]  S. Alonso,et al.  A scan for signatures of positive selection in candidate loci for skin pigmentation in humans. , 2006, Molecular biology and evolution.

[47]  D. Balding,et al.  Identifying adaptive genetic divergence among populations from genome scans , 2004, Molecular ecology.

[48]  D. Allison,et al.  Estimating African American admixture proportions by use of population-specific alleles. , 1998, American journal of human genetics.

[49]  B. Weir,et al.  ESTIMATING F‐STATISTICS FOR THE ANALYSIS OF POPULATION STRUCTURE , 1984, Evolution; international journal of organic evolution.

[50]  C. Bustamante,et al.  Selective sweep mapping of genes with large phenotypic effects. , 2005, Genome research.

[51]  Á. Carracedo,et al.  Charting the ancestry of African Americans. , 2005, American journal of human genetics.

[52]  Joshua M Akey,et al.  Genomic signatures of positive selection in humans and the limits of outlier approaches. , 2006, Genome research.

[53]  N F Box,et al.  Human pigmentation genes: identification, structure and consequences of polymorphic variation. , 2001, Gene.

[54]  M. Olivier A haplotype map of the human genome. , 2003, Nature.

[55]  W. Stephan,et al.  Detecting a local signature of genetic hitchhiking along a recombining chromosome. , 2002, Genetics.

[56]  J. Rees Genetics of hair and skin color. , 2003, Annual review of genetics.

[57]  Robert C. Edgar,et al.  MUSCLE: multiple sequence alignment with high accuracy and high throughput. , 2004, Nucleic acids research.

[58]  M. Ashburner,et al.  Gene Ontology: tool for the unification of biology , 2000, Nature Genetics.

[59]  M. Stoneking,et al.  A genome scan to detect candidate regions influenced by local natural selection in human populations. , 2003, Molecular biology and evolution.

[60]  Carlos D Bustamante,et al.  Ascertainment bias in studies of human genome-wide polymorphism. , 2005, Genome research.

[61]  Patrick D. Evans,et al.  Microcephalin, a Gene Regulating Brain Size, Continues to Evolve Adaptively in Humans , 2005, Science.