Ancestry informative marker sets for determining continental origin and admixture proportions in common populations in America

To provide a resource for assessing continental ancestry in a wide variety of genetic studies, we identified, validated, and characterized a set of 128 ancestry informative markers (AIMs). The markers were chosen for informativeness, genome‐wide distribution, and genotype reproducibility on two platforms (TaqMan® assays and Illumina arrays). We analyzed genotyping data from 825 subjects with diverse ancestry, including European, East Asian, Amerindian, African, South Asian, Mexican, and Puerto Rican. A comprehensive set of 128 AIMs and subsets as small as 24 AIMs are shown to be useful tools for ascertaining the origin of subjects from particular continents, and to correct for population stratification in admixed population sample sets. Our findings provide general guidelines for the application of specific AIM subsets as a resource for wide application. We conclude that investigators can use TaqMan assays for the selected AIMs as a simple and cost efficient tool to control for differences in continental ancestry when conducting association studies in ethnically diverse populations. Hum Mutat 0,1–10, 2008. © 2008 Wiley‐Liss, Inc.

[1]  P. Donnelly,et al.  The effects of human population structure on large genetic association studies , 2004, Nature Genetics.

[2]  Keith C. Cheng,et al.  SLC24A5, a Putative Cation Exchanger, Affects Pigmentation in Zebrafish and Humans , 2005, Science.

[3]  Annette Lee,et al.  A genomewide single-nucleotide-polymorphism panel for Mexican American admixture mapping. , 2007, American journal of human genetics.

[4]  Mark D Shriver,et al.  Measuring European population stratification with microarray genotype data. , 2007, American journal of human genetics.

[5]  Pardis C Sabeti,et al.  Genetic signatures of strong recent positive selection at the lactase gene. , 2004, American journal of human genetics.

[6]  G A Satten,et al.  Accounting for unmeasured population substructure in case-control studies of genetic association using a novel latent-class model. , 2001, American journal of human genetics.

[7]  S. Gabriel,et al.  Assessing the impact of population stratification on genetic association studies , 2004, Nature Genetics.

[8]  M. Olivier A haplotype map of the human genome , 2003, Nature.

[9]  M. Olivier A haplotype map of the human genome. , 2003, Nature.

[10]  M. Feldman,et al.  Genetic Structure of Human Populations , 2002, Science.

[11]  D. Ballinger,et al.  A genomewide single-nucleotide-polymorphism panel with high ancestry information for African American admixture mapping. , 2006, American journal of human genetics.

[12]  Scott M. Williams,et al.  A high-density admixture map for disease gene discovery in african americans. , 2004, American journal of human genetics.

[13]  Pablo Villoslada,et al.  Analysis and Application of European Genetic Substructure Using 300 K SNP Information , 2008, PLoS genetics.

[14]  M. Feldman,et al.  Clines, Clusters, and the Effect of Study Design on the Inference of Human Population Structure , 2005, PLoS genetics.

[15]  B. Weir,et al.  ESTIMATING F‐STATISTICS FOR THE ANALYSIS OF POPULATION STRUCTURE , 1984, Evolution; international journal of organic evolution.

[16]  R. Ward,et al.  Informativeness of genetic markers for inference of ancestry. , 2003, American journal of human genetics.

[17]  Hongzhe Li,et al.  Examination of ancestry and ethnic affiliation using highly informative diallelic DNA markers: application to diverse and admixed populations and implications for clinical epidemiology and forensic medicine , 2005, Human Genetics.

[18]  C. Hoggart,et al.  Design and analysis of admixture mapping studies. , 2004, American journal of human genetics.

[19]  Birgir Hrafnkelsson,et al.  An Icelandic example of the impact of population structure on association studies , 2005, Nature Genetics.

[20]  N. Risch,et al.  Genetic admixture and asthma‐related phenotypes in Mexican American and Puerto Rican asthmatics , 2005, Genetic epidemiology.

[21]  D. Clayton,et al.  Population structure, differential bias and genomic control in a large-scale, case-control association study , 2005, Nature Genetics.

[22]  D. Reich,et al.  Principal components analysis corrects for stratification in genome-wide association studies , 2006, Nature Genetics.

[23]  P. Donnelly,et al.  Association mapping in structured populations. , 2000, American journal of human genetics.

[24]  D. Gudbjartsson,et al.  A high-resolution recombination map of the human genome , 2002, Nature Genetics.

[25]  K. Roeder,et al.  Genomic Control for Association Studies , 1999, Biometrics.

[26]  P. Donnelly,et al.  Inference of population structure using multilocus genotype data. , 2000, Genetics.

[27]  Mark D Shriver,et al.  Control of confounding of genetic associations in stratified populations. , 2003, American journal of human genetics.

[28]  Kenneth K Kidd,et al.  Evidence of positive selection on a class I ADH locus. , 2007, American journal of human genetics.

[29]  A. Di Rienzo,et al.  Detection of the signature of natural selection in humans: evidence from the Duffy blood group locus. , 2000, American journal of human genetics.

[30]  Holly M. Mortensen,et al.  Convergent adaptation of human lactase persistence in Africa and Europe , 2007, Nature Genetics.

[31]  R. Kittles,et al.  Implications of correlations between skin color and genetic ancestry for biomedical research , 2004, Nature Genetics.

[32]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[33]  M. Stephens,et al.  Inference of population structure using multilocus genotype data: linked loci and correlated allele frequencies. , 2003, Genetics.

[34]  Stephen B. Johnson,et al.  The New York cancer project: Rationale, organization, design, and baseline characteristics , 2004, Journal of Urban Health.

[35]  Pablo Villoslada,et al.  European Population Substructure: Clustering of Northern and Southern Populations , 2006, PLoS genetics.

[36]  Michael P Epstein,et al.  A simple and improved correction for population stratification in case-control studies. , 2007, American journal of human genetics.

[37]  David Reich,et al.  Discerning the Ancestry of European Americans in Genetic Association Studies , 2007, PLoS genetics.

[38]  D. Cox,et al.  A genomewide admixture map for Latino populations. , 2007, American journal of human genetics.

[39]  Elizabeth L. Ogburn,et al.  Demonstrating stratification in a European American population , 2005, Nature Genetics.

[40]  R. Mei,et al.  A genomewide admixture mapping panel for Hispanic/Latino populations. , 2007, American journal of human genetics.