A Bayesian Hierarchical Model for Detecting Haplotype-Haplotype and Haplotype-Environment Interactions in Genetic Association Studies

Objective: Genetic association studies based on haplotypes are powerful in the discovery and characterization of the genetic basis of complex human diseases. However, statistical methods for detecting haplotype-haplotype and haplotype-environment interactions have not yet been fully developed owing to the difficulties encountered: large numbers of potential haplotypes and unknown haplotype pairs. Furthermore, methods for detecting the association between rare haplotypes and disease have not kept pace with their counterpart of common haplotypes. Methods/Results: We herein propose an efficient and robust method to tackle these problems based on a Bayesian hierarchical generalized linear model. Our model simultaneously fits environmental effects, main effects of numerous common and rare haplotypes, and haplotype-haplotype and haplotype-environment interactions. The key to the approach is the use of a continuous prior distribution on coefficients that favors sparseness in the fitted model and facilitates computation. We develop a fast expectation-maximization algorithm to fit models by estimating posterior modes of coefficients. We incorporate our algorithm into the iteratively weighted least squares for classical generalized linear models as implemented in the R package glm. We evaluate the proposed method and compare its performance to existing methods on extensive simulated data. Conclusion: The results show that the proposed method performs well under all situations and is more powerful than existing approaches.

[1]  Jason H. Moore,et al.  A global view of epistasis , 2005, Nature Genetics.

[2]  A. Albert,et al.  On the existence of maximum likelihood estimates in logistic regression models , 1984 .

[3]  Peter Kraft,et al.  Accounting for haplotype uncertainty in matched association studies: A comparison of simple and flexible techniques , 2005, Genetic epidemiology.

[4]  J. Cheverud,et al.  Epistasis and its contribution to genetic variance components. , 1995, Genetics.

[5]  Jenny Chang-Claude,et al.  Comparison of Different Haplotype-Based Haplotype-Based Association Methods for Gene-Environment (G×E) Interactions in Case-Control Studies when Haplotype-Phase Is Ambiguous , 2009, Human Heredity.

[6]  Raymond J Carroll,et al.  Retrospective analysis of haplotype-based case control studies under a flexible model for gene environment association. , 2008, Biostatistics.

[7]  John Molitor,et al.  Application of Bayesian spatial statistical methods to analysis of haplotypes effects and gene mapping , 2003, Genetic epidemiology.

[8]  Jaeil Ahn,et al.  Tests for gene‐environment interaction from case‐control data: a novel study of type I error, power and designs , 2008, Genetic epidemiology.

[9]  Kui Zhang,et al.  TGFBR 1 Haplotypes and Risk of Non – Small-Cell Lung Cancer , 2009 .

[10]  M. G. Pittau,et al.  A weakly informative default prior distribution for logistic and other regression models , 2008, 0901.4011.

[11]  D. Botstein,et al.  Discovering genotypes underlying human phenotypes: past successes for mendelian disease, future approaches for complex disease , 2003, Nature Genetics.

[12]  L. Excoffier,et al.  Maximum-likelihood estimation of molecular haplotype frequencies in a diploid population. , 1995, Molecular biology and evolution.

[13]  Raymond J Carroll,et al.  Analysis of case‐control studies of genetic and environmental factors with missing genetic information and haplotype‐phase ambiguity , 2005, Genetic epidemiology.

[14]  N Risch,et al.  The Future of Genetic Studies of Complex Human Diseases , 1996, Science.

[15]  J Wouter Jukema,et al.  Estimating effects of rare haplotypes on failure time using a penalized Cox proportional hazards regression model , 2008, BMC Genetics.

[16]  Sven Cichon,et al.  Haplotype interaction analysis of unlinked regions , 2005, Genetic epidemiology.

[17]  D. Schaid Evaluating associations of haplotypes with traits , 2004, Genetic epidemiology.

[18]  Edward RB McCabe Hirschsprung's disease: dissecting complexity in a pathogenetic network , 2002, The Lancet.

[19]  Qiuying Sha,et al.  Tests of Association Between Quantitative Traits and Haplotypes In A Reduced‐Dimensional Space , 2005, Annals of human genetics.

[20]  P. Donnelly,et al.  A new statistical method for haplotype reconstruction from population data. , 2001, American journal of human genetics.

[21]  Andrew P Morris,et al.  Linkage disequilibrium mapping via cladistic analysis of single-nucleotide polymorphism haplotypes. , 2004, American journal of human genetics.

[22]  Sylvia Davidson,et al.  Research suggests importance of haplotypes over SNPs , 2000, Nature Biotechnology.

[23]  Nengjun Yi,et al.  Bayesian Analysis of Genetic Interactions in Case–control Studies, with Application to Adiponectin Genes and Colorectal Cancer Risk , 2011, Annals of human genetics.

[24]  Elias Zintzaras,et al.  An NOS3 Haplotype is Protective against Hypertension in a Caucasian Population , 2010, International journal of hypertension.

[25]  Hajnalka Andrikovics,et al.  Association of some rare haplotypes and genotype combinations in the MDR1 gene with childhood acute lymphoblastic leukaemia. , 2008, Leukemia research.

[26]  Lue Ping Zhao,et al.  A method for the assessment of disease associations with single-nucleotide polymorphism haplotypes and environmental variables in case-control studies. , 2003, American journal of human genetics.

[27]  N. Laird,et al.  Estimation and Tests of Haplotype-Environment Interaction when Linkage Phase Is Ambiguous , 2003, Human Heredity.

[28]  N E Day,et al.  Sample size determination for studies of gene-environment interaction. , 2001, International journal of epidemiology.

[29]  Daniel O. Stram,et al.  Modeling and E-M Estimation of Haplotype-Specific Relative Risks from Genotype Data for a Case-Control Study of Unrelated Individuals , 2003, Human Heredity.

[30]  M. Xiong,et al.  Haplotypes vs single marker linkage disequilibrium tests: what do we gain? , 2001, European Journal of Human Genetics.

[31]  John A Kellum,et al.  4G/5G plasminogen activator inhibitor-1 polymorphisms and haplotypes are associated with pneumonia. , 2007, American journal of respiratory and critical care medicine.

[32]  Jason H. Moore,et al.  The Ubiquitous Nature of Epistasis in Determining Susceptibility to Common Human Diseases , 2003, Human Heredity.

[33]  Kathryn Roeder,et al.  Evolutionary‐based association analysis using haplotype data , 2003, Genetic epidemiology.

[34]  L C Kwee,et al.  Simple methods for assessing haplotype‐environment interactions in case‐only and case‐control studies , 2007, Genetic epidemiology.

[35]  R S Kahn,et al.  Investigating gene environment interaction in complex diseases: increasing power by selective sampling for environmental exposure. , 2007, International journal of epidemiology.

[36]  N. Kaplan,et al.  On the advantage of haplotype analysis in the presence of multiple disease susceptibility alleles , 2002, Genetic epidemiology.

[37]  Chris S. Haley,et al.  Epistasis: too often neglected in complex trait studies? , 2004, Nature Reviews Genetics.

[38]  E M Wijsman,et al.  Genome screens using linkage disequilibrium tests: optimal marker characteristics and feasibility. , 1998, American journal of human genetics.

[39]  Nengjun Yi,et al.  Hierarchical Generalized Linear Models for Multiple Quantitative Trait Locus Mapping , 2009, Genetics.

[40]  K. Roeder,et al.  Transmission/disequilibrium test meets measured haplotype analysis: family-based association analysis guided by evolution of haplotypes. , 2001, American journal of human genetics.

[41]  N. Kaplan,et al.  Issues concerning association studies for fine mapping a susceptibility gene for a complex disease , 2001, Genetic epidemiology.

[42]  Hong-Wen Deng,et al.  Incorporating Single-Locus Tests into Haplotype Cladistic Analysis in Case-Control Studies , 2007, PLoS genetics.

[43]  D. Zeng,et al.  Likelihood-Based Inference on Haplotype Effects in Genetic Association Studies , 2006 .

[44]  R R Recker,et al.  A survey of haplotype variants at several disease candidate genes: the importance of rare variants for complex diseases , 2005, Journal of Medical Genetics.

[45]  Xiaofeng Zhu,et al.  Haplotypes produced from rare variants in the promoter and coding regions of angiotensinogen contribute to variation in angiotensinogen levels. , 2005, Human molecular genetics.

[46]  Dietmar Roesner,et al.  Association between c135G/A genotype and RET proto-oncogene germline mutations and phenotype of Hirschsprung's disease , 2002, The Lancet.

[47]  M. Wade,et al.  Epistasis and the Evolutionary Process , 2000 .

[48]  D. Schaid,et al.  Score tests for association between traits and haplotypes when linkage phase is ambiguous. , 2002, American journal of human genetics.

[49]  Emmanuel Lesaffre,et al.  Partial Separation in Logistic Discrimination , 1989 .

[50]  Wei Guo,et al.  Generalized linear modeling with regularization for detecting common disease rare haplotype association , 2009, Genetic epidemiology.

[51]  Zhaohui S. Qin,et al.  Bayesian haplotype inference for multiple linked single-nucleotide polymorphisms. , 2002, American journal of human genetics.

[52]  Jung-Ying Tzeng,et al.  Evolutionary‐based grouping of haplotypes in association analysis , 2005, Genetic epidemiology.

[53]  Peter H. Westfall,et al.  Testing Association of Statistically Inferred Haplotypes with Discrete and Continuous Traits in Samples of Unrelated Individuals , 2002, Human Heredity.

[54]  D Zeng,et al.  Maximum likelihood estimation of haplotype effects and haplotype‐environment interactions in association studies , 2005, Genetic epidemiology.

[55]  D. Thomas,et al.  Methods for investigating gene-environment interactions in candidate pathway and genome-wide association studies. , 2010, Annual review of public health.

[56]  H. Cordell Detecting gene–gene interactions that underlie human diseases , 2009, Nature Reviews Genetics.

[57]  A. Clark,et al.  The role of haplotypes in candidate gene studies , 2004, Genetic epidemiology.