Using ancestry-informative markers to define populations and detect population stratification

A serious problem with case-control studies is that population subdivision, recent admixture and sampling variance can lead to spurious associations between a phenotype and a marker locus, or indeed may mask true associations. This is also a concern in therapeutics since drug response may differ by ethnicity. Population stratification can occur if cases and controls have different frequencies of ethnic groups or in admixed populations, different fractions of ancestry, and when phenotypes of interest such as disease, drug response or drug metabolism, also differ between ethnic groups. Although most genetic variation is inter-individual, there is also significant inter-ethnic variation. The International HapMap Project has provided allele frequencies for approximately three million single nucleotide polymorphisms (SNPs) in Africans, Europeans and East Asians. SNP variation is greatest in Africans. Statistical methods for the detection and correction of population stratification, principally Structured Association and Genomic Control, have recently become freely available. These methods use marker loci spread throughout the genome that are unlinked to the candidate locus to estimate the ancestry of individuals within a sample, and to test for and adjust the ethnic matching of cases and controls. To date, few case-control association studies have incorporated testing for population stratification. This paper will focus on the debate about the quantity and methods for selection of highly informative marker loci required to characterize populations that vary in substructure or the degree of admixture, and will discuss how these theoretically desirable approaches can be effectively put into practice.

[1]  P. Donnelly,et al.  Case-control studies of association in structured or admixed populations. , 2001, Theoretical population biology.

[2]  M. Enoch,et al.  Pharmacogenomics of Alcohol Response and Addiction , 2003, American journal of pharmacogenomics : genomics-related research in drug development and clinical practice.

[3]  D. Allison,et al.  Estimating African American admixture proportions by use of population-specific alleles. , 1998, American journal of human genetics.

[4]  K. Kohda,et al.  Serotonin transporter-linked promoter region polymorphism and personality traits in a Japanese population , 2003, Neuroscience Letters.

[5]  M. Derenko,et al.  Mitochondrial DNA Variation in the Aboriginal Populations of the Altai‐Baikal Region: Implications for the Genetic History of North Asia and America , 2004, Annals of the New York Academy of Sciences.

[6]  K. Roeder,et al.  Genomic Control for Association Studies , 1999, Biometrics.

[7]  B. Pollock,et al.  Serotonin Transporter Promoter Polymorphism in African Americans , 2003 .

[8]  David Goldman,et al.  Genetic origins of anxiety in women: a role for a functional catechol-O-methyltransferase polymorphism , 2003, Psychiatric genetics.

[9]  C. Rotimi,et al.  Genetic structure in four West African population groups , 2005, BMC Genetics.

[10]  Geoffrey B. Nilsen,et al.  Whole-Genome Patterns of Common DNA Variation in Three Human Populations , 2005, Science.

[11]  J. Long,et al.  Autosomal, mitochondrial, and Y chromosome DNA variation in Finland: evidence for a male-specific bottleneck. , 1999, American journal of physical anthropology.

[12]  R. Ward,et al.  Informativeness of genetic markers for inference of ancestry. , 2003, American journal of human genetics.

[13]  David Goldman,et al.  Serotonin transporter promoter gain-of-function genotypes are linked to obsessive-compulsive disorder. , 2006, American journal of human genetics.

[14]  P. Underhill,et al.  Different genetic components in the Norwegian population revealed by the analysis of mtDNA and Y chromosome polymorphisms , 2002, European Journal of Human Genetics.

[15]  D. Goldman,et al.  Thr105Ile, a functional polymorphism of histamine N-methyltransferase, is associated with alcoholism in two independent populations. , 2005, Alcoholism, clinical and experimental research.

[16]  S. Kaasa,et al.  The Val158Met polymorphism of the human catechol-O-methyltransferase (COMT) gene may influence morphine requirements in cancer pain patients , 2005, Pain.

[17]  J. Witte,et al.  Genetic dissection of complex traits , 1996, Nature Genetics.

[18]  Hongzhe Li,et al.  Examination of ancestry and ethnic affiliation using highly informative diallelic DNA markers: application to diverse and admixed populations and implications for clinical epidemiology and forensic medicine , 2005, Human Genetics.

[19]  W J Ewens,et al.  The TDT and other family-based tests for linkage disequilibrium and association. , 1996, American journal of human genetics.

[20]  J. Belmont,et al.  Mexican American ancestry-informative markers: examination of population structure and marker characteristics in European Americans, Mexican Americans, Amerindians and Asians , 2004, Human Genetics.

[21]  Birgir Hrafnkelsson,et al.  An Icelandic example of the impact of population structure on association studies , 2005, Nature Genetics.

[22]  E. Lander,et al.  Meta-analysis of genetic association studies supports a contribution of common variants to susceptibility to common disease , 2003, Nature Genetics.

[23]  N. Risch,et al.  Genetic admixture and asthma‐related phenotypes in Mexican American and Puerto Rican asthmatics , 2005, Genetic epidemiology.

[24]  A. Llerena,et al.  Influence of genetic admixture on polymorphisms of drug‐metabolizing enzymes: Analyses of mutations on NAT2 and CYP2E1 genes in a mixed Hispanic population , 1998, Clinical pharmacology and therapeutics.

[25]  J. Stephens,et al.  Haplotype Variation and Linkage Disequilibrium in 313 Human Genes , 2001, Science.

[26]  P. Frossard,et al.  The geographic distribution of cystic fibrosis mutations gives clues about population origins , 2000, European Journal of Pediatrics.

[27]  Alicja Lerner,et al.  Serotonin transporter: gene, genetic disorders, and pharmacogenetics. , 2004, Molecular interventions.

[28]  Elizabeth L. Ogburn,et al.  Demonstrating stratification in a European American population , 2005, Nature Genetics.

[29]  M. Feldman,et al.  Genetic Structure of Human Populations , 2002, Science.

[30]  J. Pritchard,et al.  Use of unlinked genetic markers to detect population stratification in association studies. , 1999, American journal of human genetics.

[31]  Pui-Yan Kwok,et al.  Population structure, admixture, and aging-related phenotypes in African American adults: the Cardiovascular Health Study. , 2005, American journal of human genetics.

[32]  P. Donnelly,et al.  Inference of population structure using multilocus genotype data. , 2000, Genetics.

[33]  F. Salzano,et al.  Heterogeneity of the genome ancestry of individuals classified as White in the State of Rio Grande do Sul, Brazil , 2005, American journal of human biology : the official journal of the Human Biology Council.

[34]  John Kwagyan,et al.  CYP3A4-V and prostate cancer in African Americans: causal or confounding association because of population stratification? , 2002, Human Genetics.

[35]  H. Blumberg,et al.  CHRM2 gene predisposes to alcohol dependence, drug dependence and affective disorders: results from an extended case-control structured association study. , 2005, Human molecular genetics.

[36]  Yusuke Nakamura,et al.  Inference from the relationships between linkage disequilibrium and allele frequency distributions of 240 candidate SNPs in 109 drug-related genes in four Asian populations , 2004, Journal of Human Genetics.

[37]  M. Egan,et al.  Catechol-O-methyltransferase val108/158met genotype predicts working memory response to antipsychotic medications , 2004, Biological Psychiatry.

[38]  K. Konvička,et al.  Matching strategies for genetic association studies in structured populations. , 2004, American journal of human genetics.

[39]  P. Donnelly,et al.  Association mapping in structured populations. , 2000, American journal of human genetics.

[40]  J. Long,et al.  Individual estimates of European genetic admixture associated with lower body-mass index, plasma glucose, and prevalence of type 2 diabetes in Pima Indians. , 2000, American journal of human genetics.

[41]  J. Witte,et al.  Genetic dissection of complex traits. , 1994, Nature genetics.

[42]  A. B. Perkins,et al.  High-density single-nucleotide polymorphism maps of the human genome. , 2005, Genomics.

[43]  L. Wasserman,et al.  Genomic control, a new approach to genetic-based association studies. , 2001, Theoretical population biology.

[44]  Kenneth K Kidd,et al.  Global variation in the frequencies of functionally different catechol-O-methyltransferase alleles , 1999, Biological Psychiatry.

[45]  R. Williams,et al.  Gm3;5,13,14 and type 2 diabetes mellitus: an association in American Indians with genetic admixture. , 1988, American journal of human genetics.

[46]  J. Weber,et al.  Markers that discriminate between European and African ancestry show limited variation within Africa , 2002, Human Genetics.