Comparison of statistics in association tests of genetic markers for survival outcomes

Computationally efficient statistical tests are needed in association testing of large scale genetic markers for survival outcomes. In this study, we explore several test statistics based on the Cox proportional hazards model for survival data. First, we consider the classical partial likelihood-based Wald and score tests. A revised way to compute the score statistics is explored to improve the computational efficiency. Next, we propose a Cox-Snell residual-based score test, which allows us to handle the controlling variables more conveniently. We also illustrated the incorporation of these three tests into a permutation procedure to adjust for the multiple testing. In addition, we examine a simulation-based approach proposed by Lin (2005) to adjust for multiple testing. We presented the comparison of these four statistics in terms of type I error, power, family-wise error rate, and computational efficiency under various scenarios via extensive simulation.

[1]  David B. Goldstein,et al.  Whole genome association studies in complex diseases: where do we stand? , 2010, Dialogues in clinical neuroscience.

[2]  John R Thompson,et al.  Biostatistical Aspects of Genome‐Wide Association Studies , 2008, Biometrical journal. Biometrische Zeitschrift.

[3]  M. Horowitz,et al.  Petersdorf Hematopoietic Cell Transplantation MHC-Resident Variation Affects Risks After Unrelated Donor , 2012 .

[4]  D. Cox,et al.  A General Definition of Residuals , 1968 .

[5]  D. Lin On rapid stimulation of P values in association studies. , 2005, American journal of human genetics.

[6]  John P. Klein,et al.  Effects of model misspecification in estimating covariate effects in survival analysis for small sample sizes , 1996 .

[7]  Frank Dudbridge A note on permutation tests in multistage association scans. , 2006, American journal of human genetics.

[8]  D. Y. Lin,et al.  An efficient Monte Carlo approach to assessing statistical significance in genomic studies , 2005, Bioinform..

[9]  Brian L. Browning,et al.  PRESTO: Rapid calculation of order statistic distributions and multiple-testing adjusted P-values via permutation for one and two-stage genetic association studies , 2008, BMC Bioinformatics.

[10]  M. Boehnke,et al.  So many correlated tests, so little time! Rapid adjustment of P values for multiple correlated tests. , 2007, American journal of human genetics.

[11]  B Müller-Myhsok,et al.  Rapid simulation of P values for product methods and multiple-testing adjustment in association studies. , 2005, American journal of human genetics.

[12]  Stephen W. Lagakos,et al.  The graphical evaluation of explanatory variables in proportional hazard regression models , 1981 .

[13]  R. Doerge,et al.  Empirical threshold values for quantitative trait mapping. , 1994, Genetics.

[14]  L. J. Wei,et al.  The Robust Inference for the Cox Proportional Hazards Model , 1989 .

[15]  S. Holm A Simple Sequentially Rejective Multiple Test Procedure , 1979 .

[16]  D. Schaid,et al.  Score tests for association between traits and haplotypes when linkage phase is ambiguous. , 2002, American journal of human genetics.

[17]  John P. Klein,et al.  Competing risks , 2010 .

[18]  C. Carlson,et al.  Mapping complex disease loci in whole-genome association studies , 2004, Nature.

[19]  D. Cox Regression Models and Life-Tables , 1972 .

[20]  Eleazar Eskin,et al.  Rapid and Accurate Multiple Testing Correction and Power Estimation for Millions of Correlated Markers , 2009, PLoS genetics.

[21]  Pierre Legendre,et al.  An empirical comparison of permutation methods for tests of partial regression coefficients in a linear model , 1999 .