A Likelihood Ratio Test for Genome‐Wide Association under Genetic Heterogeneity

Most existing association tests for genome‐wide association studies (GWASs) fail to account for genetic heterogeneity. Zhou and Pan proposed a binomial‐mixture‐model‐based association test to account for the possible genetic heterogeneity in case‐control studies. The idea is elegant, however, the proposed test requires an expectation‐maximization (EM)‐type iterative algorithm to identify the penalised maximum likelihood estimates and a permutation method to assess p‐values. The intensive computational burden induced by the EM‐algorithm and the permutation becomes prohibitive for direct applications to GWASs. This paper develops a likelihood ratio test (LRT) for GWASs under genetic heterogeneity based on a more general alternative mixture model. In particular, a closed‐form formula for the LRT statistic is derived to avoid the EM‐type iterative numerical evaluation. Moreover, an explicit asymptotic null distribution is also obtained, which avoids using the permutation to obtain p‐values. Thus, the proposed LRT is easy to implement for GWASs. Furthermore, numerical studies demonstrate that the LRT has power advantages over the commonly used Armitage trend test and other existing association tests under genetic heterogeneity. A breast cancer GWAS dataset is used to illustrate the newly proposed LRT.

[1]  Y. Shao,et al.  Asymptotics for likelihood ratio tests under loss of identifiability , 2003 .

[2]  W. Pan,et al.  Binomial Mixture Model‐based Association Tests under Genetic Heterogeneity , 2009, Annals of human genetics.

[3]  T. Hudson,et al.  A genome-wide association study identifies novel risk loci for type 2 diabetes , 2007, Nature.

[4]  Yan Li A comparison of tests for Hardy-Weinberg Equilibrium in national genetic household surveys , 2013, BMC Genetics.

[5]  M. Spence,et al.  Analysis of human genetic linkage , 1986 .

[6]  Joseph L. Gastwirth,et al.  Trend Tests for Case-Control Studies of Genetic Markers: Power, Sample Size and Robustness , 2002, Human Heredity.

[7]  Yuejiao Fu,et al.  Testing for homogeneity in genetic linkage analysis , 2006 .

[8]  Deborah Hughes,et al.  Genome-wide association study identifies five new breast cancer susceptibility loci , 2010, Nature Genetics.

[9]  S. Hodge,et al.  Quantification of type I error probabilities for heterogeneity LOD scores , 2002, Genetic epidemiology.

[10]  M. King,et al.  Linkage of early-onset familial breast cancer to chromosome 17q21. , 1990, Science.

[11]  Patrick Neven,et al.  11q13 is a susceptibility locus for hormone receptor positive breast cancer , 2012, Human mutation.

[12]  P. Bickel,et al.  Mathematical Statistics: Basic Ideas and Selected Topics , 1977 .

[13]  C. A. Smith,et al.  Testing for heterogeneity of recombination fraction values in Human Genetics , 1963, Annals of human genetics.

[14]  Pengfei Li,et al.  Non-finite Fisher information and homogeneity: an EM approach , 2009 .

[15]  J. Witte,et al.  Genetic dissection of complex traits. , 1994, Nature genetics.

[16]  T. H. Emigh,et al.  A comparison of tests for Hardy-Weinberg equilibrium. , 1980, Biometrics.

[17]  Eric S. Lander,et al.  Asymptotic distribution of the likelihood ratio test that a mixture of two binomials is a single binomial , 1995 .

[18]  W. Ruan,et al.  Genetic polymorphisms and breast cancer risk: evidence from meta-analyses, pooled analyses, and genome-wide association studies , 2011, Breast Cancer Research and Treatment.

[19]  A. Hattersley,et al.  Maturity‐onset diabetes of the young: clinical heterogeneity explained by genetic heterogeneity , 1998, Diabetic medicine : a journal of the British Diabetic Association.

[20]  J. Witte,et al.  Genetic dissection of complex traits , 1996, Nature Genetics.

[21]  P. Armitage Tests for Linear Trends in Proportions and Frequencies , 1955 .

[22]  S. Seal,et al.  Localization of a breast cancer susceptibility gene, BRCA2, to chromosome 13q12-13. , 1994, Science.

[23]  M. Chiano,et al.  Linkage detection under heterogeneity and the mixture problem , 1995, Annals of Human Genetics.

[24]  K. Taylor,et al.  Genome-Wide Association , 2007, Diabetes.