Evaluation of logistic Bayesian LASSO for identifying association with rare haplotypes

It has been hypothesized that rare variants may hold the key to unraveling the genetic transmission mechanism of many common complex traits. Currently, there is a dearth of statistical methods that are powerful enough to detect association with rare haplotypes. One of the recently proposed methods is logistic Bayesian LASSO for case-control data. By penalizing the regression coefficients through appropriate priors, logistic Bayesian LASSO weeds out the unassociated haplotypes, making it possible for the associated rare haplotypes to be detected with higher powers. We used the Genetic Analysis Workshop 18 simulated data to evaluate the behavior of logistic Bayesian LASSO in terms of its power and type I error under a complex disease model. We obtained knowledge of the simulation model, including the locations of the functional variants, and we chose to focus on two genomic regions in the MAP4 gene on chromosome 3. The sample size was 142 individuals and there were 200 replicates.Despite the small sample size, logistic Bayesian LASSO showed high power to detect two haplotypes containing functional variants in these regions while maintaining low type I errors. At the same time, a commonly used approach for haplotype association implemented in the software hapassoc failed to converge because of the presence of rare haplotypes. Thus, we conclude that logistic Bayesian LASSO can play an important role in the search for rare haplotypes.

[1]  John S. Witte,et al.  Comprehensive Approach to Analyzing Rare Genetic Variants , 2010, PloS one.

[2]  S. RichardsonINSERM,et al.  Bayesian analysis of case-control studies with categorical covariates , 2001 .

[3]  Wei Pan,et al.  Comparison of statistical tests for disease association with rare variants , 2011, Genetic epidemiology.

[4]  G. Satten,et al.  Comparison of prospective and retrospective methods for haplotype inference in case‐control studies , 2004, Genetic epidemiology.

[5]  R. Pyke,et al.  Logistic disease incidence models and case-control studies , 1979 .

[6]  Wei Guo,et al.  Generalized linear modeling with regularization for detecting common disease rare haplotype association , 2009, Genetic epidemiology.

[7]  A. Clark,et al.  The role of haplotypes in candidate gene studies , 2004, Genetic epidemiology.

[8]  G. Satten,et al.  Inference on haplotype effects in case-control studies using unphased genotype data. , 2003, American journal of human genetics.

[9]  Kathryn Roeder,et al.  Testing for an Unusual Distribution of Rare Variants , 2011, PLoS genetics.

[10]  Xihong Lin,et al.  Rare-variant association testing for sequencing data with the sequence kernel association test. , 2011, American journal of human genetics.

[11]  Shili Lin,et al.  Logistic Bayesian LASSO for Identifying Association with Rare Haplotypes and Application to Age‐Related Macular Degeneration , 2012, Biometrics.

[12]  Jung-Ying Tzeng,et al.  Evaluating haplotype effects in case‐control studies via penalized‐likelihood approaches: prospective or retrospective analysis? , 2010, Genetic epidemiology.

[13]  Shili Lin,et al.  Detecting Rare Haplotype‐Environment Interaction With Logistic Bayesian LASSO , 2014, Genetic epidemiology.

[14]  Gaurav Bhatia,et al.  A Covering Method for Detecting Genetic Associations between Rare Variants and Common Phenotypes , 2010, PLoS Comput. Biol..

[15]  Nengjun Yi,et al.  A Bayesian Hierarchical Model for Detecting Haplotype-Haplotype and Haplotype-Environment Interactions in Genetic Association Studies , 2011, Human Heredity.

[16]  Xihong Lin,et al.  Rare Variant Association Testing for Sequencing Data Using the Sequence Kernel Association Test ( SKAT ) , 2011 .

[17]  Gene-based partial least-squares approaches for detecting rare variant associations with complex traits , 2011, BMC proceedings.

[18]  Jinko Graham,et al.  hapassoc: Software for Likelihood Inference of Trait Associations with SNP Haplotypes and Other Attributes , 2006 .

[19]  D. Zeng,et al.  Likelihood-Based Inference on Haplotype Effects in Genetic Association Studies , 2006 .