Confounding from Cryptic Relatedness in Case-Control Association Studies

Case-control association studies are widely used in the search for genetic variants that contribute to human diseases. It has long been known that such studies may suffer from high rates of false positives if there is unrecognized population structure. It is perhaps less widely appreciated that so-called “cryptic relatedness” (i.e., kinship among the cases or controls that is not known to the investigator) might also potentially inflate the false positive rate. Until now there has been little work to assess how serious this problem is likely to be in practice. In this paper, we develop a formal model of cryptic relatedness, and study its impact on association studies. We provide simple expressions that predict the extent of confounding due to cryptic relatedness. Surprisingly, these expressions are functions of directly observable parameters. Our analytical results show that, for well-designed studies in outbred populations, the degree of confounding due to cryptic relatedness will usually be negligible. However, in contrast, studies where there is a sampling bias toward collecting relatives may indeed suffer from excessive rates of false positives. Furthermore, cryptic relatedness may be a serious concern in founder populations that have grown rapidly and recently from a small size. As an example, we analyze the impact of excess relatedness among cases for six phenotypes measured in the Hutterite population.

[1]  N. Risch Linkage strategies for genetically complex traits. I. Multilocus models. , 1990, American journal of human genetics.

[2]  N. Risch Searching for genetic determinants in the new millennium , 2000, Nature.

[3]  J. Witte,et al.  Genetic dissection of complex traits , 1996, Nature Genetics.

[4]  M S McPeek,et al.  Estimation of variance components of quantitative traits in inbred populations. , 2000, American journal of human genetics.

[5]  A. Tsalenko,et al.  A second-generation genomewide screen for asthma-susceptibility alleles in a founder population. , 2000, American journal of human genetics.

[6]  G A Satten,et al.  Accounting for unmeasured population substructure in case-control studies of genetic association using a novel latent-class model. , 2001, American journal of human genetics.

[7]  John S Witte,et al.  Point: population stratification: a problem for case-control studies of candidate-gene associations? , 2002, Cancer epidemiology, biomarkers & prevention : a publication of the American Association for Cancer Research, cosponsored by the American Society of Preventive Oncology.

[8]  F. Rousset,et al.  Inbreeding and relatedness coefficients: what do they measure? , 2002, Heredity.

[9]  J. Hey,et al.  A multi-dimensional coalescent process applied to multi-allelic selection models and migration models. , 1991, Theoretical population biology.

[10]  P Donnelly,et al.  The coalescent process with selfing. , 1997, Genetics.

[11]  M. McPeek,et al.  Quantitative-trait homozygosity and association mapping and empirical genomewide significance in large, complex pedigrees: fasting serum-insulin level in the Hutterites. , 2002, American journal of human genetics.

[12]  Nathaniel Rothman,et al.  Counterpoint: bias from population stratification is not a major threat to the validity of conclusions from epidemiological studies of common polymorphisms and cancer. , 2002, Cancer epidemiology, biomarkers & prevention : a publication of the American Association for Cancer Research, cosponsored by the American Society of Preventive Oncology.

[13]  K. Roeder,et al.  Genomic Control for Association Studies , 1999, Biometrics.

[14]  M. Lynch,et al.  Estimation of pairwise relatedness with molecular markers. , 1999, Genetics.

[15]  E A Thompson,et al.  The estimation of pairwise relationships , 1975, Annals of human genetics.

[16]  P. Donnelly,et al.  Association mapping in structured populations. , 2000, American journal of human genetics.

[17]  R. Williams,et al.  Gm3;5,13,14 and type 2 diabetes mellitus: an association in American Indians with genetic admixture. , 1988, American journal of human genetics.

[18]  P. Armitage Tests for Linear Trends in Proportions and Frequencies , 1955 .

[19]  D. Reich,et al.  Detecting association in a case‐control study while correcting for population stratification , 2001, Genetic epidemiology.

[20]  B. Milligan,et al.  Maximum-likelihood estimation of relatedness. , 2003, Genetics.

[21]  N J Cox,et al.  The importance of genealogy in determining genetic associations with complex traits. , 2001, American journal of human genetics.

[22]  L. Partridge,et al.  Oxford Surveys in Evolutionary Biology , 1991 .

[23]  K. Ritland Marker‐inferred relatedness as a tool for detecting heritability in nature , 2000, Molecular ecology.

[24]  Courtney A. Harper,et al.  A genomic screen of autism: evidence for a multilocus etiology. , 1999, American journal of human genetics.

[25]  M. McPeek,et al.  Broad and narrow heritabilities of quantitative traits in a founder population. , 2001, American journal of human genetics.

[26]  P. Sasieni From genotypes to genes: doubling the sample size. , 1997, Biometrics.

[27]  J. Wakeley,et al.  Nonequilibrium migration in human history. , 1999, Genetics.

[28]  M. Nei,et al.  Effective population size, genetic diversity, and coalescence time in subdivided populations , 1993, Journal of Molecular Evolution.

[29]  Igor Rudan,et al.  Inbreeding and the genetic complexity of human hypertension. , 2003, Genetics.

[30]  D J Schaid,et al.  Evaluation of candidate genes in case-control studies: a statistical method to account for related subjects. , 2001, American journal of human genetics.

[31]  K. Roeder,et al.  The power of genomic control. , 2000, American journal of human genetics.

[32]  Mary Sara McPeek,et al.  Novel case-control test in a founder population identifies P-selectin as an atopy-susceptibility locus. , 2003, American journal of human genetics.

[33]  J. Pritchard,et al.  Use of unlinked genetic markers to detect population stratification in association studies. , 1999, American journal of human genetics.