Genome-Wide Analysis of Single Nucleotide Polymorphisms Uncovers Population Structure in Northern Europe

Background Genome-wide data provide a powerful tool for inferring patterns of genetic variation and structure of human populations. Principal Findings In this study, we analysed almost 250,000 SNPs from a total of 945 samples from Eastern and Western Finland, Sweden, Northern Germany and Great Britain complemented with HapMap data. Small but statistically significant differences were observed between the European populations (FST = 0.0040, p<10−4), also between Eastern and Western Finland (FST = 0.0032, p<10−3). The latter indicated the existence of a relatively strong autosomal substructure within the country, similar to that observed earlier with smaller numbers of markers. The Germans and British were less differentiated than the Swedes, Western Finns and especially the Eastern Finns who also showed other signs of genetic drift. This is likely caused by the later founding of the northern populations, together with subsequent founder and bottleneck effects, and a smaller population size. Furthermore, our data suggest a small eastern contribution among the Finns, consistent with the historical and linguistic background of the population. Significance Our results warn against a priori assumptions of homogeneity among Finns and other seemingly isolated populations. Thus, in association studies in such populations, additional caution for population structure may be necessary. Our results illustrate that population history is often important for patterns of genetic variation, and that the analysis of hundreds of thousands of SNPs provides high resolution also for population genetics.

[1]  C. Lindgren,et al.  Population Structure in Contemporary Sweden—A Y‐Chromosomal and Mitochondrial DNA Analysis , 2009, Annals of human genetics.

[2]  Amit R. Indap,et al.  A role for clonal inactivation in T cell tolerance to Mls-1a , 2008, Nature.

[3]  Amit R. Indap,et al.  Genes mirror geography within Europe , 2008, Nature.

[4]  Christian Gieger,et al.  Correlation between Genetic and Geographic Structure in Europe , 2008, Current Biology.

[5]  Leena Peltonen,et al.  Isolated populations and complex disease gene identification , 2008, Genome Biology.

[6]  Gilles Guillot,et al.  Population substructure in Finland and Sweden revealed by the use of spatial coordinates and a small number of unlinked autosomal SNPs , 2008, BMC Genetics.

[7]  T. Lappalainen,et al.  Migration Waves to the Baltic Sea Region , 2008, Annals of human genetics.

[8]  M. Feldman,et al.  Worldwide Human Relationships Inferred from Genome-Wide Patterns of Variation , 2008 .

[9]  Zachary A. Szpiech,et al.  Genotype, haplotype and copy-number variation in worldwide human populations , 2008, Nature.

[10]  Pablo Villoslada,et al.  Analysis and Application of European Genetic Substructure Using 300 K SNP Information , 2008, PLoS genetics.

[11]  David Reich,et al.  Discerning the Ancestry of European Americans in Genetic Association Studies , 2007, PLoS genetics.

[12]  K. Mossman The Wellcome Trust Case Control Consortium, U.K. , 2008 .

[13]  Zhaohui S. Qin,et al.  A second generation human haplotype map of over 3.1 million SNPs , 2007, Nature.

[14]  D. Holmberg,et al.  The genetic population structure of northern Sweden and its implications for mapping genetic diseases. , 2007, Hereditas.

[15]  Anne-Béatrice Dufour,et al.  The ade4 Package: Implementing the Duality Diagram for Ecologists , 2007 .

[16]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[17]  Simon C. Potter,et al.  Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls , 2007, Nature.

[18]  Mark D Shriver,et al.  Measuring European population stratification with microarray genotype data. , 2007, American journal of human genetics.

[19]  Laurent Excoffier,et al.  Arlequin (version 3.0): An integrated software package for population genetics data analysis , 2005, Evolutionary bioinformatics online.

[20]  Thomas Meitinger,et al.  SNP-Based Analysis of Genetic Substructure in the German Population , 2006, Human Heredity.

[21]  Pablo Villoslada,et al.  European Population Substructure: Clustering of Northern and Southern Populations , 2006, PLoS genetics.

[22]  T. Lappalainen,et al.  Regional differences among the Finns: a Y-chromosomal perspective. , 2006, Gene.

[23]  A. Götherström,et al.  Y-chromosome diversity in Sweden – A long-time perspective , 2006, European Journal of Human Genetics.

[24]  Chiara Sabatti,et al.  Magnitude and distribution of linkage disequilibrium in population isolates and implications for genome-wide association studies , 2006, Nature Genetics.

[25]  Michael Krawczak,et al.  PopGen: Population-Based Recruitment of Patients and Controls for the Analysis of Complex Genotype-Phenotype Relationships , 2006, Public Health Genomics.

[26]  Carlos D Bustamante,et al.  Ascertainment bias in studies of human genome-wide polymorphism. , 2005, Genome research.

[27]  D. Clayton,et al.  Population structure, differential bias and genomic control in a large-scale, case-control association study , 2005, Nature Genetics.

[28]  Ulrike Schmidt,et al.  Significant genetic differentiation between Poland and Germany follows present-day political borders, as revealed by Y-chromosome analysis , 2005, Human Genetics.

[29]  P. Visscher,et al.  Genome-wide linkage disequilibrium from 100,000 SNPs in the East Finland founder population. , 2005, Twin research and human genetics : the official journal of the International Society for Twin Studies.

[30]  Birgir Hrafnkelsson,et al.  An Icelandic example of the impact of population structure on association studies , 2005, Nature Genetics.

[31]  A. Sajantila,et al.  Analysis of 16 Y STR loci in the Finnish population reveals a local reduction in the diversity of male lineages. , 2004, Forensic science international.

[32]  P. Donnelly,et al.  The effects of human population structure on large genetic association studies , 2004, Nature Genetics.

[33]  S. Gabriel,et al.  Assessing the impact of population stratification on genetic association studies , 2004, Nature Genetics.

[34]  A. Siiriäinen,et al.  The Stone and Bronze Ages , 2003 .

[35]  M. Stephens,et al.  Inference of population structure using multilocus genotype data: linked loci and correlated allele frequencies. , 2003, Genetics.

[36]  R. Norio Finnish Disease Heritage II: population prehistory and genetic roots of Finns , 2003, Human Genetics.

[37]  M. Feldman,et al.  The application of molecular genetic approaches to the study of human evolution , 2003, Nature Genetics.

[38]  R. Norio Finnish Disease Heritage I: characteristics, causes, background. , 2003, Human genetics.

[39]  J. Olesen,et al.  The Cambridge history of Scandinavia , 2003 .

[40]  T. Paunio,et al.  The interval of linkage disequilibrium (LD) detected with microsatellite and SNP markers in chromosomes of Finnish populations with different histories. , 2003, Human molecular genetics.

[41]  S. Pääbo,et al.  Extensive linkage disequilibrium in small human populations in Eurasia. , 2002, American journal of human genetics.

[42]  J. Kere,et al.  Human population genetics: lessons from Finland. , 2001, Annual review of genomics and human genetics.

[43]  A. Sajantila,et al.  Y chromosomal polymorphisms reveal founding lineages in the Finns and the Saami , 1999, European Journal of Human Genetics.

[44]  F. Wright,et al.  Linkage disequilibrium mapping in isolated populations: the example of Finland revisited. , 1998, Proceedings of the National Academy of Sciences of the United States of America.

[45]  L. Peltonen,et al.  Dual origins of Finns revealed by Y chromosome haplotype variation. , 1998, American journal of human genetics.

[46]  L. Peltonen,et al.  The genetic relationship between the Finns and the Finnish Saami (Lapps): analysis of nuclear DNA and mtDNA. , 1996, American journal of human genetics.

[47]  R. Cann The history and geography of human genes , 1995, The Journal of Asian Studies.

[48]  A. Piazza History and Geography of Human Genes , 1994 .

[49]  P. Sistonen,et al.  A population genetic study in Finland: comparison of the Finnish- and Swedish-speaking populations. , 1991, Human heredity.

[50]  J. H. Mielke,et al.  The genetic structure of finland. , 1976, American journal of physical anthropology.

[51]  H. Nevanlinna The Finnish population structure. A genetic and genealogical study. , 2009, Hereditas.

[52]  R. Dickinson The regions of Germany , 1946 .

[53]  W. Fitzgerald The Regions of Germany , 1945, Nature.