Robust Mantel‐Haenszel test under genetic model uncertainty allowing for covariates in case‐control association studies

The trend test under the additive model is commonly used when a case‐control genetic association study is carried out. However, for many complex diseases, the underlying genetic models are unknown and a mis‐specification of the genetic model may result in a substantial loss of power. MAX3 has been proposed as an efficiency robust test against genetic model uncertainty which takes the maximum absolute value of the trend test statistics under the recessive, additive, and dominant models. Besides its popularity, little attention has been paid to the adjustment of covariates in this test and existing approaches all depend on the estimators of parameters of interest which may be seriously biased if the individuals are divided into a large number of partial tables stratified by covariates. In this article, we propose a modified MAX3 test based on the Mantel‐Haenszel test (MHT). This new test avoids estimating the nuisance parameters induced by the covariates; thus, it is valid under both large and small numbers of partial tables while still enjoys the property of efficiency robustness. The asymptotic distribution of the test under the null hypothesis of no association is also derived; thus the corresponding asymptotic P‐value of the statistic can be easily calculated. Besides, we prove that this new test can be equally derived through a conditional likelihood. As a result, the original MAX3 based on the trend tests or the matching trend tests can be treated as a special case and generally incorporated into the newly proposed test. Simulation results show that the modified MAX3 can keep the correct size under the null hypothesis and is more efficiency robustness than any single MHT optimal for a specified genetic model under the alternative hypothesis. Two real examples corresponding to the large and small number of partial tables scenarios, respectively, are analyzed using the proposed method. A type 2 diabetes mellitus data set is also analyzed to evaluate the performance of the proposed test under the GWAS criteria. Genet. Epidemiol. 2011. © 2011 Wiley Periodicals, Inc. 35:695‐705, 2011

[1]  D. Y. Lin,et al.  An efficient Monte Carlo approach to assessing statistical significance in genomic studies , 2005, Bioinform..

[2]  Estimation of P‐value of MAX test with double triangle diagram for 2 × 3 SNP case‐control tables , 2010, Genetic epidemiology.

[3]  N. Jewell,et al.  A geometric approach to assess bias due to omitted covariates in generalized linear models , 1993 .

[4]  T. Hudson,et al.  A genome-wide association study identifies novel risk loci for type 2 diabetes , 2007, Nature.

[5]  Chia-Ling Kuo,et al.  What's the best statistic for a simple test of genetic association in a case‐control study? , 2009, Genetic epidemiology.

[6]  Xavier Estivill,et al.  Maximizing association statistics over genetic models , 2008, Genetic epidemiology.

[7]  Gang Zheng,et al.  Asymptotic powers for matched trend tests and robust matched trend tests in case-control genetic association studies , 2010, Comput. Stat. Data Anal..

[8]  Joseph L. Gastwirth,et al.  Choice of scores in trend tests for case-control studies of candidate-gene associations , 2003 .

[9]  P. Armitage Tests for Linear Trends in Proportions and Frequencies , 1955 .

[10]  J. Neuhaus Estimation efficiency with omitted covariates in generalized linear models , 1998 .

[11]  Nathan Mantel,et al.  Chi-square tests with one degree of freedom , 1963 .

[12]  Qizhai Li,et al.  Efficient Approximation of P‐value of the Maximum of Correlated Tests, with Applications to Genome‐Wide Association Studies , 2008, Annals of human genetics.

[13]  R. Baughman Design of a case control etiologic study of sarcoidosis (ACCESS). ACCESS Research Group. , 1999, Journal of clinical epidemiology.

[14]  K. Lunetta,et al.  Testing for population subdivision and association in four case-control studies. , 2002, American journal of human genetics.

[15]  U. Vogel,et al.  Association of DNA repair gene XRCC1 and lung cancer susceptibility among nonsmoking Chinese women. , 2009, Cancer genetics and cytogenetics.

[16]  C. Xing,et al.  Adjusting for covariates in logistic regression models , 2010, Genetic epidemiology.

[17]  P. Sasieni From genotypes to genes: doubling the sample size. , 1997, Biometrics.

[18]  Hon-Cheong So,et al.  Robust Association Tests Under Different Genetic Models, Allowing for Binary or Quantitative Traits and Covariates , 2011, Behavior genetics.

[19]  Manal M. Hassan,et al.  Effect of Insulin-Like Growth Factor Gene Polymorphisms Alone or In Interaction with Diabetes on the Risk of Pancreatic Cancer , 2008, Cancer Epidemiology Biomarkers & Prevention.

[20]  Joseph L. Gastwirth,et al.  Trend Tests for Case-Control Studies of Genetic Markers: Power, Sample Size and Robustness , 2002, Human Heredity.

[21]  S. Zeger,et al.  Longitudinal data analysis using generalized linear models , 1986 .

[22]  W. Fung,et al.  Simple algorithms to calculate asymptotic null distributions of robust tests in case-control genetic association studies in R , 2010 .

[23]  W. Haenszel,et al.  Statistical aspects of the analysis of data from retrospective studies of disease. , 1959, Journal of the National Cancer Institute.

[24]  Simon C. Potter,et al.  Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls , 2007, Nature.

[25]  Day Ne,et al.  Testing hypotheses in case-control studies--equivalence of Mantel-Haenszel statistics and logit score tests. , 1979 .

[26]  N. Day,et al.  Testing hypotheses in case-control studies--equivalence of Mantel-Haenszel statistics and logit score tests. , 1979, Biometrics.

[27]  G. Zheng,et al.  Robust trend tests for genetic association using matched case‐control design , 2006, Statistics in medicine.

[28]  Alan Agresti,et al.  Categorical Data Analysis , 2003 .

[29]  N Risch,et al.  The Future of Genetic Studies of Complex Human Diseases , 1996, Science.

[30]  Y. Okada,et al.  An optimal dose‐effect mode trend test for SNP genotype tables , 2009, Genetic epidemiology.