Design and analysis of admixture mapping studies.

Admixture between populations originating on different continents can be exploited to detect disease susceptibility loci at which risk alleles are distributed differentially between these populations. We first examine the statistical power and mapping resolution of this approach in the limiting situation in which gamete admixture and locus ancestry are measured without uncertainty. We show that, for a rare disease, the most efficient design is to study affected individuals only. In a typical African American population (two-way admixture proportions 0.8/0.2, ancestry crossover rate 2 per 100 cM), a study of 800 affected individuals has 90% power to detect at P values <10(-5) a locus that generates a risk ratio of 2 between populations, with an expected mapping resolution (size of 95% confidence region for the position of the locus) of 4 cM. In practice, to infer locus ancestry from marker data requires Bayesian computationally intensive methods, as implemented in the program ADMIXMAP. Affected-only study designs require strong prior information on the frequencies of each allele given locus ancestry. We show how data from unadmixed and admixed populations can be combined to estimate these ancestry-specific allele frequencies within the admixed population under study, allowing for variation between allele frequencies in unadmixed and admixed populations. Using simulated data based on the genetic structure of the African American population, we show that 60% of information can be extracted in a test for linkage using markers with an ancestry information content of 36% at 3-cM spacing. As in classic linkage studies, the most efficient strategy is to use markers at a moderate density for an initial genome search and then to saturate regions of putative linkage with additional markers, to extract nearly all information about locus ancestry.

[1]  R. Ward,et al.  Informativeness of genetic markers for inference of ancestry. , 2003, American journal of human genetics.

[2]  Mikko J Sillanpää,et al.  Bayesian analysis of multilocus association in quantitative and qualitative traits , 2003, Genetic epidemiology.

[3]  M. Stephens,et al.  Inference of population structure using multilocus genotype data: linked loci and correlated allele frequencies. , 2003, Genetics.

[4]  L. Wasserman,et al.  Analysis of multilocus models of association , 2003, Genetic epidemiology.

[5]  Mark D Shriver,et al.  Control of confounding of genetic associations in stratified populations. , 2003, American journal of human genetics.

[6]  Li Jin,et al.  Skin pigmentation, biogeographical ancestry and admixture mapping , 2003, Human Genetics.

[7]  C. Hoggart,et al.  Relation of risk of systemic lupus erythematosus to west African admixture in a Caribbean population , 2003, Human Genetics.

[8]  Pardis C Sabeti,et al.  Detecting recent positive selection in the human genome from haplotype structure , 2002, Nature.

[9]  John Kwagyan,et al.  CYP3A4-V and prostate cancer in African Americans: causal or confounding association because of population stratification? , 2002, Human Genetics.

[10]  R. Kittles,et al.  Cyp17 promoter variant associated with prostate cancer aggressiveness in African Americans. , 2001, Cancer epidemiology, biomarkers & prevention : a publication of the American Association for Cancer Research, cosponsored by the American Society of Preventive Oncology.

[11]  B Devlin,et al.  A Bayesian hierarchical model for allele frequencies , 2001, Genetic epidemiology.

[12]  Joseph D. Terwilliger,et al.  Gene Mapping in the 20th and 21st Centuries: Statistical Methods, Data Analysis, and Experimental Design , 2009, Human biology.

[13]  A. C. Collins,et al.  A method for fine mapping quantitative trait loci in outbred animal stocks. , 2000, Proceedings of the National Academy of Sciences of the United States of America.

[14]  J. Carpenter,et al.  Estimation of admixture and detection of linkage in admixed populations by a Bayesian approach: application to African‐American populations , 2000, Annals of human genetics.

[15]  M. Sillanpää,et al.  Bayesian mapping of multiple quantitative trait loci from incomplete outbred offspring data. , 1999, Genetics.

[16]  D. Allison,et al.  Estimating African American admixture proportions by use of population-specific alleles. , 1998, American journal of human genetics.

[17]  K. Weiss,et al.  Linkage disequilibrium mapping of complex disease: fantasy or reality? , 1998, Current opinion in biotechnology.

[18]  P. McKeigue,et al.  Mapping genes that underlie ethnic differences in disease risk: methods for detecting linkage in admixed populations, by conditioning on parental admixture. , 1998, American journal of human genetics.

[19]  D. F. Roberts,et al.  The History and Geography of Human Genes , 1996 .

[20]  E. Lander,et al.  Complete multipoint sib-pair analysis of qualitative and quantitative traits. , 1995, American journal of human genetics.

[21]  R. Cann The history and geography of human genes , 1995, The Journal of Asian Studies.

[22]  L Kruglyak,et al.  High-resolution genetic mapping of complex traits. , 1995, American journal of human genetics.

[23]  J. Stephens,et al.  Mapping by admixture linkage disequilibrium in human populations: limits and guidelines. , 1994, American journal of human genetics.

[24]  K. Weiss,et al.  Admixture as a tool for finding linked genes and detecting that difference from allelic association between loci. , 1988, Proceedings of the National Academy of Sciences of the United States of America.

[25]  D. Rubin Bayesianly Justifiable and Relevant Frequency Calculations for the Applied Statistician , 1984 .

[26]  S WRIGHT,et al.  Genetical Structure of Populations , 1950, British medical journal.

[27]  A. Johnstone Recognition of the rare RH type RY in three generations. , 1950, Annals of eugenics.

[28]  H. Jeffreys The Theory of Probability , 1896 .