Robust Tests in Genome-Wide Scans under Incomplete Linkage Disequilibrium

Under complete linkage disequilibrium (LD), robust tests often have greater power than Pearson's chi-square test and trend tests for the analysis of case-control genetic association studies. Robust statistics have been used in candidate-gene and genome-wide association studies (GWAS) when the genetic model is unknown. We consider here a more general incomplete LD model, and examine the impact of penetrances at the marker locus when the genetic models are defined at the disease locus. Robust statistics are then reviewed and their efficiency and robustness are compared through simulations in GWAS of 300,000 markers under the incomplete LD model. Applications of several robust tests to the Wellcome Trust Case-Control Consortium [Nature 447 (2007) 661--678] are presented.

[1]  R. Lewontin The Interaction of Selection and Linkage. I. General Considerations; Heterotic Models. , 1964, Genetics.

[2]  J. Tukey WHICH PART OF THE SAMPLE CONTAINS THE INFORMATION? , 1965, Proceedings of the National Academy of Sciences of the United States of America.

[3]  J. Gastwirth ON ROBUST PROCEDURES , 1966 .

[4]  M. M. Siddiqui,et al.  Robust Estimation of Location , 1967 .

[5]  R. Davies Hypothesis testing when a nuisance parameter is present only under the alternative , 1977 .

[6]  Joseph L. Gastwirth,et al.  The Use of Maximin Efficiency Robust Tests in Combining Contingency Tables and Survival Analysis , 1985 .

[7]  Bruce S. Weir,et al.  Genetic Data Analysis: Methods for Discrete Population Genetic Data. , 1991 .

[8]  P. Sasieni From genotypes to genes: doubling the sample size. , 1997, Biometrics.

[9]  M. Ehm,et al.  Detecting marker-disease association by testing for Hardy-Weinberg disequilibrium at a marker locus. , 1998, American journal of human genetics.

[10]  B. Weir,et al.  A classical setting for associations between markers and loci affecting quantitative traits. , 1999, Genetical research.

[11]  Jeffrey Ross-Ibarra,et al.  Genetic Data Analysis II. Methods for Discrete Population Genentic Data , 2002 .

[12]  Joseph L. Gastwirth,et al.  Trend Tests for Case-Control Studies of Genetic Markers: Power, Sample Size and Robustness , 2002, Human Heredity.

[13]  J. Ott,et al.  Mathematical multi-locus approaches to localizing complex human trait genes , 2003, Nature Reviews Genetics.

[14]  P. Donnelly,et al.  Genome-wide strategies for detecting multiple loci that influence complex diseases , 2005, Nature Genetics.

[15]  Christoph Lange,et al.  Genomic screening and replication using the same data set in family-based association testing , 2005, Nature Genetics.

[16]  Daniel J Schaid,et al.  Nonparametric tests of association of multiple genes with human disease. , 2005, American journal of human genetics.

[17]  Jacqueline K. Wittke-Thompson,et al.  Rational inferences about departures from Hardy-Weinberg equilibrium. , 2005, American journal of human genetics.

[18]  Dmitri V Zaykin,et al.  Ranks of Genuine Associations in Whole-Genome Scans , 2005, Genetics.

[19]  Kai Wang,et al.  A constrained-likelihood approach to marker-trait association studies. , 2005, American journal of human genetics.

[20]  R. Elston,et al.  A powerful method of combining measures of association and Hardy–Weinberg disequilibrium for fine‐mapping in case‐control studies , 2006, Statistics in medicine.

[21]  D. Balding A tutorial on statistical methods for population association studies , 2006, Nature Reviews Genetics.

[22]  W. Knowler,et al.  Design and Analysis of Genetic Association Studies to Finely Map a Locus Identified by Linkage Analysis: Sample Size and Power Calculations , 2006, Annals of human genetics.

[23]  T. Hudson,et al.  A genome-wide association study identifies novel risk loci for type 2 diabetes , 2007, Nature.

[24]  J. Hirschhorn,et al.  Genetic model testing and statistical power in population‐based association studies of quantitative traits , 2007, Genetic epidemiology.

[25]  M. Boehnke,et al.  So many correlated tests, so little time! Rapid adjustment of P values for multiple correlated tests. , 2007, American journal of human genetics.

[26]  Robert C. Elston,et al.  Adaptive Two-Stage Analysis of Genetic Association in Case-Control Designs , 2007, Human Heredity.

[27]  Simon C. Potter,et al.  Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls , 2007, Nature.

[28]  R. Elston,et al.  Multistage sampling for genetic studies. , 2007, Annual review of genomics and human genetics.

[29]  William Wheeler,et al.  Probability of detecting disease-associated single nucleotide polymorphisms in case-control genome-wide association studies. , 2008, Biostatistics.

[30]  Xavier Estivill,et al.  Maximizing association statistics over genetic models , 2008, Genetic epidemiology.

[31]  B. Prum,et al.  A Note on Allelic Tests in Case‐Control Association Studies , 2008, Annals of human genetics.

[32]  Z. Li,et al.  MAX-rank: a simple and robust genome-wide scan for case-control association studies , 2008, Human Genetics.

[33]  Wentian Li,et al.  Comparison of two‐phase analyses for case–control genetic association studies , 2008, Statistics in medicine.

[34]  Gang Zheng,et al.  Genetic model selection in two-phase analysis for case-control association studies. , 2008, Biostatistics.

[35]  Qizhai Li,et al.  Efficient Approximation of P‐value of the Maximum of Correlated Tests, with Applications to Genome‐Wide Association Studies , 2008, Annals of human genetics.

[36]  Mitchell H. Gail,et al.  On Combining Data From Genome-Wide Association Studies to Discover Disease-Associated SNPs , 2009, 1010.5046.

[37]  Peter Kraft,et al.  Replication in genome-wide association studies. , 2009, Statistical science : a review journal of the Institute of Mathematical Statistics.

[38]  Larry Wasserman,et al.  Genome-Wide Significance Levels and Weighted Hypothesis Testing. , 2009, Statistical science : a review journal of the Institute of Mathematical Statistics.

[39]  Gang Zheng,et al.  A Robust Genome‐Wide Scan Statistic of the Wellcome Trust Case–Control Consortium , 2009, Biometrics.

[40]  Gang Zheng,et al.  Pearson's Test, Trend Test, and MAX Are All Trend Tests with Different Types of Scores , 2009, Annals of human genetics.

[41]  Juan Pablo Lewinger,et al.  Methodological Issues in Multistage Genome-wide Association Studies. , 2009, Statistical science : a review journal of the Institute of Mathematical Statistics.

[42]  Colin O. Wu,et al.  Robust genome-wide scans with genetic model selection using case-control design ∗ , 2009 .

[43]  Y. Okada,et al.  An optimal dose‐effect mode trend test for SNP genotype tables , 2009, Genetic epidemiology.

[44]  W. Fung,et al.  Simple algorithms to calculate asymptotic null distributions of robust tests in case-control genetic association studies in R , 2010 .

[45]  Julie Zhou Robust Estimationに , 2009 .