Correlation between Genetic and Geographic Structure in Europe

Understanding the genetic structure of the European population is important, not only from a historical perspective, but also for the appropriate design and interpretation of genetic epidemiological studies. Previous population genetic analyses with autosomal markers in Europe either had a wide geographic but narrow genomic coverage [1, 2], or vice versa [3-6]. We therefore investigated Affymetrix GeneChip 500K genotype data from 2,514 individuals belonging to 23 different subpopulations, widely spread over Europe. Although we found only a low level of genetic differentiation between subpopulations, the existing differences were characterized by a strong continent-wide correlation between geographic and genetic distance. Furthermore, mean heterozygosity was larger, and mean linkage disequilibrium smaller, in southern as compared to northern Europe. Both parameters clearly showed a clinal distribution that provided evidence for a spatial continuity of genetic diversity in Europe. Our comprehensive genetic data are thus compatible with expectations based upon European population history, including the hypotheses of a south-north expansion and/or a larger effective population size in southern than in northern Europe. By including the widely used CEPH from Utah (CEU) samples into our analysis, we could show that these individuals represent northern and western Europeans reasonably well, thereby confirming their assumed regional ancestry.

[1]  J. Pritchard,et al.  A Map of Recent Positive Selection in the Human Genome , 2006, PLoS biology.

[2]  R J Mitchell,et al.  Y-chromosomal diversity in Europe is clinal and influenced primarily by geography, rather than by language. , 2000, American journal of human genetics.

[3]  Mark D Shriver,et al.  Measuring European population stratification with microarray genotype data. , 2007, American journal of human genetics.

[4]  R. Cann The history and geography of human genes , 1995, The Journal of Asian Studies.

[5]  Bernard W. Silverman,et al.  Density Estimation for Statistics and Data Analysis , 1987 .

[6]  Lounès Chikhi,et al.  Y genetic data support the Neolithic demic diffusion model , 2002, Proceedings of the National Academy of Sciences of the United States of America.

[7]  Ulrike Schmidt,et al.  Significant genetic differentiation between Poland and Germany follows present-day political borders, as revealed by Y-chromosome analysis , 2005, Human Genetics.

[8]  R. Ward,et al.  Informativeness of genetic markers for inference of ancestry. , 2003, American journal of human genetics.

[9]  L. Excoffier,et al.  Analysis of molecular variance inferred from metric distances among DNA haplotypes: application to human mitochondrial DNA restriction data. , 1992, Genetics.

[10]  A. Hofman,et al.  The Rotterdam Study: objectives and design update , 2007, European Journal of Epidemiology.

[11]  D. Schaid,et al.  Exact tests of Hardy-Weinberg equilibrium and homogeneity of disequilibrium across strata. , 2006, American journal of human genetics.

[12]  Clément Calenge,et al.  The package “adehabitat” for the R software: A tool for the analysis of space and habitat use by animals , 2006 .

[13]  R R Sokal,et al.  Spatial patterns of human gene frequencies in Europe. , 1989, American journal of physical anthropology.

[14]  Zachary A. Szpiech,et al.  Genotype, haplotype and copy-number variation in worldwide human populations , 2008, Nature.

[15]  G. Barbujani,et al.  Origins and evolution of the Europeans' genome: evidence from multiple microsatellite loci , 2006, Proceedings of the Royal Society B: Biological Sciences.

[16]  Pablo Villoslada,et al.  European Population Substructure: Clustering of Northern and Southern Populations , 2006, PLoS genetics.

[17]  B. Weir,et al.  ESTIMATING F‐STATISTICS FOR THE ANALYSIS OF POPULATION STRUCTURE , 1984, Evolution; international journal of organic evolution.

[18]  S. Pääbo,et al.  Paternal and maternal DNA lineages reveal a bottleneck in the founding of the Finnish population. , 1996, Proceedings of the National Academy of Sciences of the United States of America.

[19]  C. Tyler-Smith,et al.  Signature of recent historical events in the European Y-chromosomal STR haplotype distribution , 2005, Human Genetics.

[20]  Monique M. B. Breteler,et al.  The Rotterdam Study: 2016 objectives and design update , 2015, European Journal of Epidemiology.

[21]  R R Sokal,et al.  Genetic structure of human populations in the British Isles. , 1993, Annals of human biology.

[22]  A Coppa,et al.  A signal, from human mtDNA, of postglacial recolonization in Europe. , 2001, American journal of human genetics.

[23]  P. J. Green,et al.  Density Estimation for Statistics and Data Analysis , 1987 .

[24]  A. Tagliabracci,et al.  Y-chromosome genetic structure in sub-Apennine populations of Central Italy by SNP and STR analysis , 2007, International Journal of Legal Medicine.

[25]  A. Götherström,et al.  Y-chromosome diversity in Sweden – A long-time perspective , 2006, European Journal of Human Genetics.

[26]  C. Triantaphyllidis,et al.  Genetic studies in 5 Greek population samples using 12 highly polymorphic DNA loci. , 1999, Human biology.

[27]  David Reich,et al.  Discerning the Ancestry of European Americans in Genetic Association Studies , 2007, PLoS genetics.

[28]  A. Hofman,et al.  Determinants of disease and disability in the elderly: The Rotterdam elderly study , 1991, European Journal of Epidemiology.

[29]  H. Cann,et al.  Centre d'etude du polymorphisme humain (CEPH): collaborative genetic mapping of the human genome. , 1990, Genomics.

[30]  P. Elliott,et al.  Genome-wide scan identifies variation in MLXIPL associated with plasma triglycerides , 2008, Nature Genetics.

[31]  J. Bertranpetit,et al.  Geographic patterns of mtDNA diversity in Europe. , 2000, American journal of human genetics.

[32]  M. Jobling,et al.  Homogeneity and distinctiveness of Polish paternal lineages revealed by Y chromosome microsatellite haplotype analysis , 2002, Human Genetics.

[33]  E. Heyer,et al.  Geographic Patterns of (Genetic, Morphologic, Linguistic) Variation: How Barriers Can Be Detected by Using Monmonier's Algorithm , 2004, Human biology.

[34]  Chiara Sabatti,et al.  Homozygosity and linkage disequilibrium. , 2002, Genetics.

[35]  D. F. Roberts,et al.  The History and Geography of Human Genes , 1996 .

[36]  Michael Krawczak,et al.  PopGen: Population-Based Recruitment of Patients and Controls for the Analysis of Complex Genotype-Phenotype Relationships , 2006, Public Health Genomics.

[37]  D. Comas,et al.  Joining the Pillars of Hercules: mtDNA Sequences Show Multidirectional Gene Flow in the Western Mediterranean , 2003, Annals of human genetics.

[38]  D. Maraganore,et al.  Reliability of self-reported ancestry among siblings: implications for genetic association studies. , 2006, American journal of epidemiology.

[39]  D. Strachan,et al.  LDL-cholesterol concentrations: a genome-wide association study , 2008, The Lancet.

[40]  Johan T den Dunnen,et al.  Three genome-wide association studies and a linkage analysis identify HERC2 as a human iris color gene. , 2008, American journal of human genetics.

[41]  N. Mantel The detection of disease clustering and a generalized regression approach. , 1967, Cancer research.

[42]  C. Meisinger,et al.  The MONICA Augsburg surveys--basis for prospective cohort studies. , 2005, Gesundheitswesen (Bundesverband der Arzte des Offentlichen Gesundheitsdienstes (Germany)).

[43]  Pablo Villoslada,et al.  Analysis and Application of European Genetic Substructure Using 300 K SNP Information , 2008, PLoS genetics.

[44]  Clive E. Bowman,et al.  Genome-wide approaches to identify pharmacogenetic contributions to adverse drug reactions , 2009, The Pharmacogenomics Journal.

[45]  D. Reich,et al.  Population Structure and Eigenanalysis , 2006, PLoS genetics.