European Population Substructure: Clustering of Northern and Southern Populations

Using a genome-wide single nucleotide polymorphism (SNP) panel, we observed population structure in a diverse group of Europeans and European Americans. Under a variety of conditions and tests, there is a consistent and reproducible distinction between “northern” and “southern” European population groups: most individual participants with southern European ancestry (Italian, Spanish, Portuguese, and Greek) have >85% membership in the “southern” population; and most northern, western, eastern, and central Europeans have >90% in the “northern” population group. Ashkenazi Jewish as well as Sephardic Jewish origin also showed >85% membership in the “southern” population, consistent with a later Mediterranean origin of these ethnic groups. Based on this work, we have developed a core set of informative SNP markers that can control for this partition in European population structure in a variety of clinical and genetic studies.

[1]  J. Weber,et al.  Markers that discriminate between European and African ancestry show limited variation within Africa , 2002, Human Genetics.

[2]  N. Risch,et al.  Assessing genetic contributions to phenotypic differences among 'racial' and 'ethnic' groups , 2004, Nature Genetics.

[3]  Steven J. Schrodi,et al.  A missense single-nucleotide polymorphism in a gene encoding a protein tyrosine phosphatase (PTPN22) is associated with rheumatoid arthritis. , 2004, American journal of human genetics.

[4]  R. Ward,et al.  Informativeness of genetic markers for inference of ancestry. , 2003, American journal of human genetics.

[5]  Annette Lee,et al.  The PTPN22 R620W polymorphism associates with RF positive rheumatoid arthritis in a dose-dependent manner but not with HLA-SE status , 2005, Genes and Immunity.

[6]  Daniel Rabinowitz,et al.  A Unified Approach to Adjusting Association Tests for Population Admixture with Arbitrary Pedigree Structure and Arbitrary Missing Marker Information , 2000, Human Heredity.

[7]  L. Chikhi,et al.  DNAs from the European Neolithic , 2006, Heredity.

[8]  J. Belmont,et al.  Mexican American ancestry-informative markers: examination of population structure and marker characteristics in European Americans, Mexican Americans, Amerindians and Asians , 2004, Human Genetics.

[9]  Alberto Piazza,et al.  The History and Geography of Human Genes: Abridged paperback Edition , 1996 .

[10]  M. Shriver,et al.  Ancestral proportions and their association with skin pigmentation and bone mineral density in Puerto Rican women from New York city , 2004, Human Genetics.

[11]  S. Gabriel,et al.  The Structure of Haplotype Blocks in the Human Genome , 2002, Science.

[12]  J. Long,et al.  Information on ancestry from genetic markers , 2004, Genetic epidemiology.

[13]  Alice B. Kehoe Archaeology and Language: The Puzzle of Indo-European Origins , 1989, American Antiquity.

[14]  Guido Barbujani,et al.  Y chromosomal haplogroup J as a signature of the post-neolithic colonization of Europe , 2004, Human Genetics.

[15]  M. Feldman,et al.  Genetic Structure of Human Populations , 2002, Science.

[16]  Pardis C Sabeti,et al.  Linkage disequilibrium in the human genome , 2001, Nature.

[17]  L. Chikhi,et al.  Population genetics: DNAs from the European Neolithic. , 2006, Heredity.

[18]  Anne Cambon-Thomsen,et al.  Phylogeography of Y-chromosome haplogroup I reveals distinct domains of prehistoric gene flow in europe. , 2004, American journal of human genetics.

[19]  H. Bandelt,et al.  Paleolithic and neolithic lineages in the European mitochondrial gene pool. , 1996, American journal of human genetics.

[20]  J. Bach,et al.  The effect of infections on susceptibility to autoimmune and allergic diseases. , 2002, The New England journal of medicine.

[21]  B. Guinand Use of a multivariate model using allele frequency distributions to analyse patterns of genetic differentiation among populations , 1996 .

[22]  P. Donnelly,et al.  Inference of population structure using multilocus genotype data. , 2000, Genetics.

[23]  Mark D Shriver,et al.  Control of confounding of genetic associations in stratified populations. , 2003, American journal of human genetics.

[24]  P. Underhill,et al.  Origin, diffusion, and differentiation of Y-chromosome haplogroups E and J: inferences on the neolithization of Europe and later migratory events in the Mediterranean area. , 2004, American journal of human genetics.

[25]  G. Barbujani,et al.  Origins and evolution of the Europeans' genome: evidence from multiple microsatellite loci , 2006, Proceedings of the Royal Society B: Biological Sciences.

[26]  J. Cornuet,et al.  Analytical bayesian approach for assigning individuals to populations. , 2004, The Journal of heredity.

[27]  G A Satten,et al.  Accounting for unmeasured population substructure in case-control studies of genetic association using a novel latent-class model. , 2001, American journal of human genetics.

[28]  K. Roeder,et al.  Genomic Control for Association Studies , 1999, Biometrics.

[29]  R. Sokal,et al.  Historical Population Movements in Europe Influence Genetic Relationships in Modern Samples , 2012, Human biology.

[30]  Shuichi Matsumura,et al.  Ancient DNA from the First European Farmers in 7500-Year-Old Neolithic Sites , 1975, Science.

[31]  B. Sykes,et al.  Phylogeography of mitochondrial DNA in western Europe , 1998, Annals of human genetics.

[32]  Hongzhe Li,et al.  Examination of ancestry and ethnic affiliation using highly informative diallelic DNA markers: application to diverse and admixed populations and implications for clinical epidemiology and forensic medicine , 2005, Human Genetics.

[33]  J. Pritchard,et al.  A Map of Recent Positive Selection in the Human Genome , 2006, PLoS biology.

[34]  R J Mitchell,et al.  Y-chromosomal diversity in Europe is clinal and influenced primarily by geography, rather than by language. , 2000, American journal of human genetics.

[35]  B. Cunliffe The Oxford illustrated prehistory of Europe , 1994 .

[36]  M. Zeviani,et al.  The molecular dissection of mtDNA haplogroup H confirms that the Franco-Cantabrian glacial refuge was a major source for the European gene pool. , 2004, American journal of human genetics.

[37]  G. Bertorelle,et al.  Genetics and the population history of Europe. , 2001, Proceedings of the National Academy of Sciences of the United States of America.

[38]  M. Stephens,et al.  Inference of population structure using multilocus genotype data: linked loci and correlated allele frequencies. , 2003, Genetics.

[39]  B. Rannala,et al.  Detecting immigration by using multilocus genotypes. , 1997, Proceedings of the National Academy of Sciences of the United States of America.

[40]  Robert R. Sokal,et al.  Genetic evidence for the spread of agriculture in Europe by demic diffusion , 1991, Nature.

[41]  Stephen B. Johnson,et al.  The New York cancer project: Rationale, organization, design, and baseline characteristics , 2004, Journal of Urban Health.

[42]  N. Risch,et al.  The importance of race and ethnic background in biomedical research and clinical practice. , 2003, The New England journal of medicine.

[43]  L. Cavalli-Sforza,et al.  Demic expansions and human evolution , 1993, Science.

[44]  D. F. Roberts,et al.  The History and Geography of Human Genes , 1996 .

[45]  J. Tuomilehto,et al.  Finnish case–control and family studies support PTPN22 R620W polymorphism as a risk factor in rheumatoid arthritis, but suggest only minimal or no effect in juvenile idiopathic arthritis , 2005, Genes and Immunity.

[46]  S. Milisauskas European Prehistory. A Survey , 1978 .

[47]  Camara P Jones,et al.  Invited commentary: "race," racism, and the practice of epidemiology. , 2001, American journal of epidemiology.

[48]  P. Gregersen,et al.  PTPN22 and rheumatoid arthritis: gratifying replication. , 2005, Arthritis and rheumatism.

[49]  C. Scarre Exploring Prehistoric Europe , 1998 .

[50]  J. Cornuet,et al.  GENECLASS2: a software for genetic assignment and first-generation migrant detection. , 2004, The Journal of heredity.

[51]  K. Konvička,et al.  Matching strategies for genetic association studies in structured populations. , 2004, American journal of human genetics.

[52]  P. Donnelly,et al.  Association mapping in structured populations. , 2000, American journal of human genetics.

[53]  D. Behar,et al.  High-resolution mtDNA evidence for the late-glacial resettlement of Europe from an Iberian refugium. , 2005, Genome research.

[54]  R. Cooper,et al.  Race and genomics. , 2003, The New England journal of medicine.

[55]  Elizabeth L. Ogburn,et al.  Demonstrating stratification in a European American population , 2005, Nature Genetics.

[56]  Pardis C Sabeti,et al.  Genetic signatures of strong recent positive selection at the lactase gene. , 2004, American journal of human genetics.

[57]  Li Jin,et al.  Skin pigmentation, biogeographical ancestry and admixture mapping , 2003, Human Genetics.

[58]  B. Weir,et al.  ESTIMATING F‐STATISTICS FOR THE ANALYSIS OF POPULATION STRUCTURE , 1984, Evolution; international journal of organic evolution.

[59]  K J Dawson,et al.  A Bayesian approach to the identification of panmictic populations and the assignment of individuals. , 2001, Genetical research.

[60]  P. Menozzi,et al.  Synthetic maps of human gene frequencies in Europeans. , 1978, Science.

[61]  S. Gabriel,et al.  Enhancing linkage analysis of complex disorders: an evaluation of high-density genotyping. , 2004, Human molecular genetics.

[62]  Xiangli Xiao,et al.  Screening the genome for rheumatoid arthritis susceptibility genes: a replication study and combined analysis of 512 multicase families. , 2003, Arthritis and rheumatism.