Population structure, differential bias and genomic control in a large-scale, case-control association study

The main problems in drawing causal inferences from epidemiological case-control studies are confounding by unmeasured extraneous factors, selection bias and differential misclassification of exposure. In genetics the first of these, in the form of population structure, has dominated recent debate. Population structure explained part of the significant +11.2% inflation of test statistics we observed in an analysis of 6,322 nonsynonymous SNPs in 816 cases of type 1 diabetes and 877 population-based controls from Great Britain. The remainder of the inflation resulted from differential bias in genotype scoring between case and control DNA samples, which originated from two laboratories, causing false-positive associations. To avoid excluding SNPs and losing valuable information, we extended the genomic control method by applying a variable downweighting to each SNP.

[1]  P. Armitage Tests for Linear Trends in Proportions and Frequencies , 1955 .

[2]  Nathan Mantel,et al.  Chi-square tests with one degree of freedom , 1963 .

[3]  P. McCullagh,et al.  Generalized Linear Models , 1972, Predictive Analytics.

[4]  M. Plummer,et al.  International agency for research on cancer. , 2020, Archives of pathology.

[5]  N. Breslow,et al.  Statistical methods in cancer research: volume 1- The analysis of case-control studies , 1980 .

[6]  N. Breslow,et al.  Statistical methods in cancer research. Vol. 1. The analysis of case-control studies. , 1981 .

[7]  K. Roeder,et al.  Genomic Control for Association Studies , 1999, Biometrics.

[8]  P. Simpson,et al.  Statistical methods in cancer research , 2001, Journal of surgical oncology.

[9]  Ronald W. Davis,et al.  Multiplexed genotyping with sequence-tagged molecular inversion probes , 2003, Nature Biotechnology.

[10]  Toshihiro Tanaka The International HapMap Project , 2003, Nature.

[11]  Luc J. Smink,et al.  Association of the T-cell regulatory gene CTLA4 with susceptibility to autoimmune disease , 2003, Nature.

[12]  P. Tam The International HapMap Consortium. The International HapMap Project (Co-PI of Hong Kong Centre which responsible for 2.5% of genome) , 2003 .

[13]  J Tuomilehto,et al.  Cost-effective analysis of candidate genes using htSNPs: a staged approach , 2004, Genes and Immunity.

[14]  S. Gabriel,et al.  Assessing the impact of population stratification on genetic association studies , 2004, Nature Genetics.

[15]  Kathryn Roeder,et al.  Genomic Control to the extreme , 2004, Nature Genetics.

[16]  P. Donnelly,et al.  The effects of human population structure on large genetic association studies , 2004, Nature Genetics.

[17]  Adrian Vella,et al.  Localization of a type 1 diabetes locus in the IL2RA/CD25 region by use of tag single-nucleotide polymorphisms. , 2005, American journal of human genetics.

[18]  D. Clayton,et al.  Genome-wide association studies: theoretical and practical concerns , 2005, Nature Reviews Genetics.

[19]  Fuli Yu,et al.  Highly multiplexed molecular inversion probe genotyping: over 10,000 targeted SNPs genotyped in a single tube assay. , 2005, Genome research.

[20]  James Ireland,et al.  Optimal genotype determination in highly multiplexed SNP data , 2006, European Journal of Human Genetics.