Swedish Population Substructure Revealed by Genome-Wide Single Nucleotide Polymorphism Data

The use of genome-wide single nucleotide polymorphism (SNP) data has recently proven useful in the study of human population structure. We have studied the internal genetic structure of the Swedish population using more than 350,000 SNPs from 1525 Swedes from all over the country genotyped on the Illumina HumanHap550 array. We have also compared them to 3212 worldwide reference samples, including Finns, northern Germans, British and Russians, based on the more than 29,000 SNPs that overlap between the Illumina and Affymetrix 250K Sty arrays. The Swedes - especially southern Swedes - were genetically close to the Germans and British, while their genetic distance to Finns was substantially longer. The overall structure within Sweden appeared clinal, and the substructure in the southern and middle parts was subtle. In contrast, the northern part of Sweden, Norrland, exhibited pronounced genetic differences both within the area and relative to the rest of the country. These distinctive genetic features of Norrland probably result mainly from isolation by distance and genetic drift caused by low population density. The internal structure within Sweden (FST = 0.0005 between provinces) was stronger than that in many Central European populations, although smaller than what has been observed for instance in Finland; importantly, it is of the magnitude that may hamper association studies with a moderate number of markers if cases and controls are not properly matched geographically. Overall, our results underline the potential of genome-wide data in analyzing substructure in populations that might otherwise appear relatively homogeneous, such as the Swedes.

[1]  G. Kirov,et al.  Population structure and genome-wide patterns of variation in Ireland and Britain , 2010, European Journal of Human Genetics.

[2]  Michael Krawczak,et al.  PopGen: Population-Based Recruitment of Patients and Controls for the Analysis of Complex Genotype-Phenotype Relationships , 2006, Public Health Genomics.

[3]  Anushya Muruganujan,et al.  Applications for protein sequence–function evolution data: mRNA/protein expression analysis and coding SNP scoring tools , 2006, Nucleic Acids Res..

[4]  Laurent Excoffier,et al.  Arlequin (version 3.0): An integrated software package for population genetics data analysis , 2005, Evolutionary bioinformatics online.

[5]  Thomas Lindkvist Kings and provinces in Sweden , 2003 .

[6]  K. Mossman The Wellcome Trust Case Control Consortium, U.K. , 2008 .

[7]  Mark George Thomas,et al.  Ancient DNA Reveals Lack of Continuity between Neolithic Hunter-Gatherers and Contemporary Scandinavians , 2009, Current Biology.

[8]  J. Olesen,et al.  The Cambridge history of Scandinavia , 2003 .

[9]  J. Palmgren,et al.  Linkage Disequilibrium Mapping of CHEK2: Common Variation and Breast Cancer Risk , 2006, PLoS medicine.

[10]  H. Lindqvist A History of Sweden , 2006 .

[11]  Yurii S. Aulchenko,et al.  BIOINFORMATICS APPLICATIONS NOTE doi:10.1093/bioinformatics/btm108 Genetics and population analysis GenABEL: an R library for genome-wide association analysis , 2022 .

[12]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[13]  E. Heyer,et al.  Geographic Patterns of (Genetic, Morphologic, Linguistic) Variation: How Barriers Can Be Detected by Using Monmonier's Algorithm , 2004, Human biology.

[14]  Gilles Guillot,et al.  Population substructure in Finland and Sweden revealed by the use of spatial coordinates and a small number of unlinked autosomal SNPs , 2008, BMC Genetics.

[15]  G. Evanno,et al.  Detecting the number of clusters of individuals using the software structure: a simulation study , 2005, Molecular ecology.

[16]  C. Lindgren,et al.  Population Structure in Contemporary Sweden—A Y‐Chromosomal and Mitochondrial DNA Analysis , 2009, Annals of human genetics.

[17]  Michael C. Hout,et al.  Multidimensional Scaling , 2003, Encyclopedic Dictionary of Archaeology.

[18]  Stefan Schreiber,et al.  Genome-Wide Analysis of Single Nucleotide Polymorphisms Uncovers Population Structure in Northern Europe , 2008, PloS one.

[19]  M. Stephens,et al.  Inference of population structure using multilocus genotype data: linked loci and correlated allele frequencies. , 2003, Genetics.

[20]  D. Yoerger,et al.  Iron Age , 2002 .

[21]  M. Feldman,et al.  Worldwide Human Relationships Inferred from Genome-Wide Patterns of Variation , 2008 .

[22]  Sharon R Grossman,et al.  Integrating common and rare genetic variation in diverse human populations , 2010, Nature.

[23]  A. Piazza History and Geography of Human Genes , 1994 .

[24]  D. Clayton,et al.  Population structure, differential bias and genomic control in a large-scale, case-control association study , 2005, Nature Genetics.

[25]  Andrew Collins,et al.  The genome-wide patterns of variation expose significant substructure in a founder population. , 2008, American journal of human genetics.

[26]  P. Visscher,et al.  Geographical structure and differential natural selection among North European populations. , 2009, Genome research.

[27]  R. Cann The history and geography of human genes , 1995, The Journal of Asian Studies.

[28]  R. Mägi,et al.  Genetic Structure of Europeans: A View from the North–East , 2009, PloS one.

[29]  Simon C. Potter,et al.  Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls , 2007, Nature.

[30]  M. Campbell,et al.  PANTHER: a library of protein families and subfamilies indexed by function. , 2003, Genome research.

[31]  Thomas Meitinger,et al.  SNP-Based Analysis of Genetic Substructure in the German Population , 2006, Human Heredity.

[32]  Amit R. Indap,et al.  Genes mirror geography within Europe , 2008, Nature.

[33]  F. Collins,et al.  Potential etiologic and functional implications of genome-wide association loci for human diseases and traits , 2009, Proceedings of the National Academy of Sciences.

[34]  T. Lappalainen,et al.  Migration Waves to the Baltic Sea Region , 2008, Annals of human genetics.

[35]  Amit R. Indap,et al.  A role for clonal inactivation in T cell tolerance to Mls-1a , 2008, Nature.

[36]  Philippa Marrack,et al.  A role for clonal inactivation in T cell tolerance to Mls-1a , 2008, Nature.

[37]  A. Götherström,et al.  Y-chromosome diversity in Sweden – A long-time perspective , 2006, European Journal of Human Genetics.

[38]  S. Heath,et al.  Investigation of the fine structure of European populations with applications to disease association studies , 2008, European Journal of Human Genetics.

[39]  Anne-Béatrice Dufour,et al.  The ade4 Package: Implementing the Duality Diagram for Ecologists , 2007 .

[40]  A. Siiriäinen,et al.  The Stone and Bronze Ages , 2003 .

[41]  P. Sistonen,et al.  A population genetic study in Finland: comparison of the Finnish- and Swedish-speaking populations. , 1991, Human heredity.

[42]  D. Holmberg,et al.  The genetic population structure of northern Sweden and its implications for mapping genetic diseases. , 2007, Hereditas.

[43]  Christian Gieger,et al.  Correlation between Genetic and Geographic Structure in Europe , 2008, Current Biology.