Population stratification in epidemiologic studies of common genetic variants and cancer: quantification of bias.

BACKGROUND Some critics argue that bias from population stratification (the mixture of individuals from heterogeneous genetic backgrounds) undermines the credibility of epidemiologic studies designed to estimate the association between a genotype and the risk of disease. We investigated the degree of bias likely from population stratification in U.S. studies of cancer among non-Hispanic Caucasians of European origin. METHODS An expression of the confounding risk ratio-the ratio of the effect of the genetic factor on risk of disease with and without adjustment for ethnicity-is used to measure the potential relative bias from population stratification. We first use empirical data on the frequency of the N-acetyltransferase (NAT2) slow acetylation genotype and incidence rates of male bladder cancer and female breast cancer in non-Hispanic U.S. Caucasians with ancestries from eight European countries to assess the bias in a hypothetical population-based U.S. study that does not take ethnicity into consideration. Then, we provide theoretical calculations of the bias over a large range of allele frequencies and disease rates. RESULTS Ignoring ethnicity leads to a bias of 1% or less in our empirical studies of NAT2. Furthermore, evaluation of a wide range of allele frequencies and representative ranges of cancer rates that exist across European populations shows that the risk ratio is biased by less than 10% in U.S. studies except under extreme conditions. We note that the bias decreases as the number of ethnic strata increases. CONCLUSIONS There will be only a small bias from population stratification in a well-designed case-control study of genetic factors that ignores ethnicity among non-Hispanic U.S. Caucasians of European origin. Further work is needed to estimate the effect of population stratification within other populations.

[1]  P. Simpson,et al.  Statistical methods in cancer research , 2001, Journal of surgical oncology.

[2]  N. Rothman,et al.  NAT2 slow acetylation and bladder cancer risk: a meta-analysis of 22 case-control studies conducted in the general population. , 2000, Pharmacogenetics.

[3]  N. Rothman,et al.  Case-control studies of common alleles and environmental factors. , 1999, Journal of the National Cancer Institute. Monographs.

[4]  J. Witte,et al.  Asymptotic bias and efficiency in case-control studies of candidate genes and gene-environment interactions: basic family designs. , 1999, American journal of epidemiology.

[5]  Freely associating , 1999, Nature Genetics.

[6]  D. Altshuler,et al.  Genetic polymorphisms and disease. , 1998, The New England journal of medicine.

[7]  R. Millikan,et al.  Cigarette smoking, N-acetyltransferases 1 and 2, and breast cancer risk. , 1998, Cancer epidemiology, biomarkers & prevention : a publication of the American Association for Cancer Research, cosponsored by the American Society of Preventive Oncology.

[8]  C. Smith,et al.  A simplified assay for the arylamine N-acetyltransferase 2 polymorphism validated by phenotyping with isoniazid. , 1997, Journal of medical genetics.

[9]  J. Witte,et al.  Genetic dissection of complex traits , 1996, Nature Genetics.

[10]  D. F. Roberts,et al.  The History and Geography of Human Genes , 1996 .

[11]  Rappold,et al.  Human Molecular Genetics , 1996, Nature Medicine.

[12]  W. Ewens,et al.  The transmission/disequilibrium test: history, subdivision, and admixture. , 1995, American journal of human genetics.

[13]  R. Cann The history and geography of human genes , 1995, The Journal of Asian Studies.

[14]  N Risch,et al.  The A1 allele at the D2 dopamine receptor gene and alcoholism. A reappraisal. , 1993, JAMA.

[15]  M. Barry,et al.  Pharmacogenetics and drug metabolism: An Irish prspective , 1991, Irish journal of medical science.

[16]  K. Blum,et al.  Allelic association of human dopamine D2 receptor gene in alcoholism. , 1990, JAMA.

[17]  R. Williams,et al.  Gm3;5,13,14 and type 2 diabetes mellitus: an association in American Indians with genetic admixture. , 1988, American journal of human genetics.

[18]  O. Axelson,et al.  Indirect methods of assessing the effects of tobacco use in occupational studies. , 1988, American journal of industrial medicine.

[19]  J. Pilch,et al.  N-acetyltransferase phenotype of patients with cancer of the larynx. , 1987, Neoplasma.

[20]  S Wacholder,et al.  Conditions for confounding of the risk ratio and of the odds ratio. , 1985, American journal of epidemiology.

[21]  D. Evans,et al.  The association of the slow acetylator phenotype with bladder cancer. , 1983, Journal of medical genetics.

[22]  O S Miettinen,et al.  Components of the crude risk ratio. , 1972, American journal of epidemiology.

[23]  J. Ferlay,et al.  Cancer Incidence in Five Continents , 1970, Union Internationale Contre Le Cancer / International Union against Cancer.

[24]  E. C. Hammond,et al.  Smoking and lung cancer: recent evidence and a discussion of some questions. , 1959, Journal of the National Cancer Institute.