Testing for association in case-control genome-wide association studies with shared controls

The statistical analysis of genome-wide association studies (GWASs) with multiple diseases and shared controls (SCs) is discussed. The usual method for analyzing data from these studies is to compare each individual disease with either the SCs or the pooled controls which include other diseases. We observed that applying individual association tests can be problematic because these tests may suffer from power loss in detecting significant associations between diseases and single-nucleotide polymorphism or copy number variant. We propose here a two-stage procedure wherein we first apply an overall chi-square test for multiple diseases with SCs; if the overall test is rejected, then individual tests using the chi-square partition method will be applied to each disease against SCs. A real GWAS data set with SCs and a Monte Carlo simulation study are used to demonstrate that the proposed method is more effective and preferable than other existing methods for analyzing data from GWASs with multiple diseases and SCs.

[1]  F. Dudbridge,et al.  Estimation of significance thresholds for genomewide association scans , 2008, Genetic epidemiology.

[2]  Qingzhong Liu,et al.  A New Approach to Account for the Correlations among Single Nucleotide Polymorphisms in Genome-Wide Association Studies , 2011, Human Heredity.

[3]  Min-Jeong Kwak,et al.  Improving Power for Testing Genetic Association in Case–Control Studies by Reducing the Alternative Space , 2010, Biometrics.

[4]  Joseph L. Gastwirth,et al.  Comparison of robust tests for genetic association using case-control studies , 2006, math/0611179.

[5]  Joseph L. Gastwirth,et al.  Trend Tests for Case-Control Studies of Genetic Markers: Power, Sample Size and Robustness , 2002, Human Heredity.

[6]  R. Doerge,et al.  Empirical threshold values for quantitative trait mapping. , 1994, Genetics.

[7]  Hon Keung Tony Ng,et al.  Design and analysis of multiple diseases genome-wide association studies without controls. , 2012, Gene.

[8]  J. Li,et al.  Adjusting multiple testing in multilocus analyses using the eigenvalues of a correlation matrix , 2005, Heredity.

[9]  W. G. Cochran Some Methods for Strengthening the Common χ 2 Tests , 1954 .

[10]  J. Cheverud,et al.  A simple correction for multiple comparisons in interval mapping genome scans , 2001, Heredity.

[11]  Kai Wang,et al.  Statistical tests of genetic association for case-control study designs. , 2012, Biostatistics.

[12]  Joseph L. Gastwirth,et al.  The Use of Maximin Efficiency Robust Tests in Combining Contingency Tables and Survival Analysis , 1985 .

[13]  M. Daly,et al.  Estimation of the multiple testing burden for genomewide association studies of nearly all common variants , 2008, Genetic epidemiology.

[14]  P. Armitage Tests for Linear Trends in Proportions and Frequencies , 1955 .

[15]  Gang Zheng,et al.  Genetic model selection in two-phase analysis for case-control association studies. , 2008, Biostatistics.

[16]  Zhongxue Chen A new association test based on Chi‐square partition for case‐control GWA studies , 2011, Genetic epidemiology.

[17]  Hon Keung Tony Ng,et al.  A Robust Method for Testing Association in Genome-Wide Association Studies , 2011, Human Heredity.

[18]  V. Moskvina,et al.  On multiple‐testing correction in genome‐wide association studies , 2008, Genetic epidemiology.

[19]  Eden R Martin,et al.  A multiple testing correction method for genetic association studies using correlated single nucleotide polymorphisms , 2008, Genetic epidemiology.

[20]  Joseph L. Gastwirth,et al.  Choice of scores in trend tests for case-control studies of candidate-gene associations , 2003 .

[21]  M. Boehnke,et al.  So many correlated tests, so little time! Rapid adjustment of P values for multiple correlated tests. , 2007, American journal of human genetics.