Genetic model selection in two-phase analysis for case-control association studies.

The Cochran-Armitage trend test (CATT) is well suited for testing association between a marker and a disease in case-control studies. When the underlying genetic model for the disease is known, the CATT optimal for the genetic model is used. For complex diseases, however, the genetic models of the true disease loci are unknown. In this situation, robust tests are preferable. We propose a two-phase analysis with model selection for the case-control design. In the first phase, we use the difference of Hardy-Weinberg disequilibrium coefficients between the cases and the controls for model selection. Then, an optimal CATT corresponding to the selected model is used for testing association. The correlation of the statistics used for selection and the test for association is derived to adjust the two-phase analysis with control of the Type-I error rate. The simulation studies show that this new approach has greater efficiency robustness than the existing methods.

[1]  W. G. Cochran Some Methods for Strengthening the Common χ 2 Tests , 1954 .

[2]  Joseph L. Gastwirth,et al.  Comparison of robust tests for genetic association using case-control studies , 2006, math/0611179.

[3]  R. Elston,et al.  A powerful method of combining measures of association and Hardy–Weinberg disequilibrium for fine‐mapping in case‐control studies , 2006, Statistics in medicine.

[4]  Joseph L. Gastwirth,et al.  The Use of Maximin Efficiency Robust Tests in Combining Contingency Tables and Survival Analysis , 1985 .

[5]  D. Schaid,et al.  Case-Control Studies of Genetic Markers: Power and Sample Size Approximations for Armitage’s Test for Trend , 2001, Human Heredity.

[6]  Joseph L. Gastwirth,et al.  Trend Tests for Case-Control Studies of Genetic Markers: Power, Sample Size and Robustness , 2002, Human Heredity.

[7]  M J Podgor,et al.  Efficiency Robust Tests for Survival or Ordered Categorical Data , 1999, Biometrics.

[8]  K. Roeder,et al.  Genomic Control for Association Studies , 1999, Biometrics.

[9]  R. Davies Hypothesis testing when a nuisance parameter is present only under the alternative , 1977 .

[10]  Gang Zheng,et al.  Genomic Control for Association Studies under Various Genetic Models , 2005, Biometrics.

[11]  J. Pritchard,et al.  Use of unlinked genetic markers to detect population stratification in association studies. , 1999, American journal of human genetics.

[12]  B. Freidlin,et al.  On power and efficiency robust linkage tests for affected sibs , 2000, Annals of human genetics.

[13]  R. Hogg Adaptive Robust Procedures: A Partial Review and Some Suggestions for Future Applications and Theory , 1974 .

[14]  D. Zucker,et al.  Weighted log rank type statistics for comparing survival curves when there is a time lag in the effectiveness of treatment , 1990 .

[15]  E. Korn,et al.  Choice of column scores for testing independence in ordered 2 X K contingency tables. , 1987, Biometrics.

[16]  R. Elston,et al.  Tests for a Disease-susceptibility Locus allowing for an Inbreeding Coefficient (F) , 2003, Genetica.

[17]  Prakash Gorroochurn,et al.  Centralizing the non‐central chi‐square: a new method to correct for population stratification in genetic case‐control association studies , 2006, Genetic epidemiology.

[18]  J. Ott,et al.  Complement Factor H Polymorphism in Age-Related Macular Degeneration , 2005, Science.

[19]  A. Whittemore,et al.  Simple, robust linkage tests for affected sibs. , 1998, American journal of human genetics.

[20]  P. Armitage Tests for Linear Trends in Proportions and Frequencies , 1955 .

[21]  Joseph L. Gastwirth,et al.  Choice of scores in trend tests for case-control studies of candidate-gene associations , 2003 .

[22]  A. Whittemore,et al.  Allele-sharing among affected relatives: non-parametric methods for identifying genes , 2001, Statistical methods in medical research.

[23]  Bruce S. Weir,et al.  Genetic Data Analysis: Methods for Discrete Population Genetic Data. , 1991 .

[24]  Wei-Min Chen,et al.  QTL fine mapping by measuring and testing for Hardy-Weinberg and linkage disequilibrium at a series of linked marker loci in extreme samples of populations. , 2000, American journal of human genetics.

[25]  G A Satten,et al.  Accounting for unmeasured population substructure in case-control studies of genetic association using a novel latent-class model. , 2001, American journal of human genetics.

[26]  Jacqueline K. Wittke-Thompson,et al.  Rational inferences about departures from Hardy-Weinberg equilibrium. , 2005, American journal of human genetics.

[27]  M. Ehm,et al.  Detecting marker-disease association by testing for Hardy-Weinberg disequilibrium at a marker locus. , 1998, American journal of human genetics.

[28]  Richard Simon,et al.  Two-stage selection and testing designs for comparative clinical trials , 1988 .

[29]  P. Sasieni From genotypes to genes: doubling the sample size. , 1997, Biometrics.

[30]  J. Gastwirth ON ROBUST PROCEDURES , 1966 .

[31]  Wayne P. Maddison,et al.  Genetic Data Analysis: Methods for Discrete Population Genetic Data , 1991 .

[32]  Kai Wang,et al.  A constrained-likelihood approach to marker-trait association studies. , 2005, American journal of human genetics.