Bayesian Latent Variable Collapsing Model for Detecting Rare Variant Interaction Effect in Twin Study

By analyzing more next‐generation sequencing data, researchers have affirmed that rare genetic variants are widespread among populations and likely play an important role in complex phenotypes. Recently, a handful of statistical models have been developed to analyze rare variant (RV) association in different study designs. However, due to the scarce occurrence of minor alleles in data, appropriate statistical methods for detecting RV interaction effects are still difficult to develop. We propose a hierarchical Bayesian latent variable collapsing method (BLVCM), which circumvents the obstacles by parameterizing the signals of RVs with latent variables in a Bayesian framework and is parameterized for twin data. The BLVCM can tackle nonassociated variants, allow both protective and deleterious effects, capture SNP‐SNP synergistic effect, provide estimates for the gene level and individual SNP contributions, and can be applied to both independent and various twin designs. We assessed the statistical properties of the BLVCM using simulated data, and found that it achieved better performance in terms of power for interaction effect detection compared to the Granvil and the SKAT. As proof of practical application, the BLVCM was then applied to a twin study analysis of more than 20,000 gene regions to identify significant RVs associated with low‐density lipoprotein cholesterol level. The results show that some of the findings are consistent with previous studies, and we identified some novel gene regions with significant SNP–SNP synergistic effects.

[1]  M. Stephens,et al.  Bayesian statistical methods for genetic association studies , 2009, Nature Reviews Genetics.

[2]  John S. Witte,et al.  Comprehensive Approach to Analyzing Rare Genetic Variants , 2010, PloS one.

[3]  Wei Pan,et al.  Comparison of statistical tests for disease association with rare variants , 2011, Genetic epidemiology.

[4]  E. Zeggini,et al.  An Evaluation of Statistical Approaches to Rare Variant Analysis in Genetic Association Studies , 2009, Genetic epidemiology.

[5]  Dorret I. Boomsma,et al.  The continuing value of twin studies in the omics era , 2012, Nature Reviews Genetics.

[6]  Greg Gibson,et al.  Rare and common variants: twenty arguments , 2012, Nature Reviews Genetics.

[7]  G. McVean,et al.  Differential confounding of rare and common variants in spatially structured populations , 2011, Nature Genetics.

[8]  Antonio Ciampi,et al.  Adjusted Sequence Kernel Association Test for Rare Variants Controlling for Cryptic and Family Relatedness , 2013, Genetic epidemiology.

[9]  Y. Bossé,et al.  Genome-wide linkage scan reveals multiple susceptibility loci influencing lipid and lipoprotein levels in the Québec Family Studys⃞s⃞ The online version of this article (available at http://www.jlr.org) contains one additional table. Published, JLR Papers in Press, December 16, 2003. DOI 10.1194/jl , 2004, Journal of Lipid Research.

[10]  Jacob A. Tennessen,et al.  Evolution and Functional Impact of Rare Coding Variation from Deep Sequencing of Human Exomes , 2012, Science.

[11]  Jason H. Moore,et al.  Missing heritability and strategies for finding the underlying causes of complex disease , 2010, Nature Reviews Genetics.

[12]  Richard Robinson,et al.  Common Disease, Multiple Rare (and Distant) Variants , 2010, PLoS biology.

[13]  David V Conti,et al.  Incorporating model uncertainty in detecting rare variants: the Bayesian risk index , 2011, Genetic epidemiology.

[14]  Noah Kaplan,et al.  Practical Issues in Implementing and Understanding Bayesian Ideal Point Estimation , 2005, Political Analysis.

[15]  P. McCullagh,et al.  Generalized Linear Models , 1984 .

[16]  Robert C Elston,et al.  The genetic basis of complex traits: rare variants or "common gene, common disease"? , 2007, Methods in molecular biology.

[17]  J. O’Connell,et al.  A Null Mutation in Human APOC3 Confers a Favorable Plasma Lipid Profile and Apparent Cardioprotection , 2008, Science.

[18]  M. Alnaqeeb,et al.  Re-sequencing of the APOAI promoter region and the genetic association of the -75G > A polymorphism with increased cholesterol and low density lipoprotein levels among a sample of the Kuwaiti population , 2013, BMC Medical Genetics.

[19]  Nengjun Yi,et al.  Bayesian analysis of rare variants in genetic association studies , 2011, Genetic epidemiology.

[20]  Dolores Corella,et al.  Six new loci associated with blood low-density lipoprotein cholesterol, high-density lipoprotein cholesterol or triglycerides in humans , 2008, Nature Genetics.

[21]  Jennifer G. Robinson,et al.  Whole-exome sequencing identifies rare and low-frequency coding variants associated with LDL cholesterol. , 2014, American journal of human genetics.

[22]  C. Robert Simulation of truncated normal variables , 2009, 0907.4010.

[23]  Jeffrey N. Rouder,et al.  Bayes factor approaches for testing interval null hypotheses. , 2011, Psychological methods.

[24]  John P A Ioannidis,et al.  Effect of formal statistical significance on the credibility of observational associations. , 2008, American journal of epidemiology.

[25]  Maya R. Gupta,et al.  Introduction to the Dirichlet Distribution and Related Processes , 2010 .

[26]  K. Becker The common variants/multiple disease hypothesis of common complex genetic disorders. , 2004, Medical hypotheses.

[27]  Joseph Hilbe,et al.  Data Analysis Using Regression and Multilevel/Hierarchical Models , 2009 .

[28]  John Fox,et al.  OpenMx: An Open Source Extended Structural Equation Modeling Framework , 2011, Psychometrika.

[29]  Xihong Lin,et al.  Rare-variant association testing for sequencing data with the sequence kernel association test. , 2011, American journal of human genetics.

[30]  Lee-Jen Wei,et al.  Pooled Association Tests for Rare Variants in Exon-Resequencing Studies , 2010 .

[31]  Andrew Gelman,et al.  Data Analysis Using Regression and Multilevel/Hierarchical Models , 2006 .

[32]  R. Collins,et al.  Newly identified loci that influence lipid concentrations and risk of coronary artery disease , 2008, Nature Genetics.

[33]  Suzanne M. Leal,et al.  A Novel Adaptive Method for the Analysis of Next-Generation Sequencing Data to Detect Complex Trait Associations with Rare Variants Due to Gene Main Effects and Interactions , 2010, PLoS genetics.

[34]  B. Cohen,et al.  Genetic Interactions Between Transcription Factors Cause Natural Variation in Yeast , 2009, Science.

[35]  Yurii S. Aulchenko,et al.  The Empirical Power of Rare Variant Association Methods: Results from Sanger Sequencing in 1,998 Individuals , 2012, PLoS genetics.

[36]  Shizhong Xu,et al.  Significance Test and Genome Selection in Bayesian Shrinkage Analysis , 2010, International journal of plant genomics.

[37]  K. Frazer,et al.  Human genetic variation and its contribution to complex traits , 2009, Nature Reviews Genetics.

[38]  Wei Pan,et al.  A Data-Adaptive Sum Test for Disease Association with Multiple Common or Rare Variants , 2010, Human Heredity.

[39]  S. Browning,et al.  A Groupwise Association Test for Rare Mutations Using a Weighted Sum Statistic , 2009, PLoS genetics.

[40]  Judy H. Cho,et al.  Finding the missing heritability of complex diseases , 2009, Nature.

[41]  A. Zwinderman,et al.  Frequent Mutation in the ABCC6 Gene (R1141X) Is Associated With a Strong Increase in the Prevalence of Coronary Artery Disease , 2002, Circulation.

[42]  I. Good The Bayes/Non-Bayes Compromise: A Brief Review , 1992 .

[43]  S. Kardia,et al.  SNP-SNP interactions dominate the genetic architecture of candidate genes associated with left ventricular mass in african-americans of the GENOA study , 2010, BMC Medical Genetics.

[44]  Jinzhen Wu,et al.  Association of the GALNT2 gene polymorphisms and several environmental factors with serum lipid levels in the Mulao and Han populations , 2011, Lipids in Health and Disease.

[45]  Kathryn Roeder,et al.  Testing for an Unusual Distribution of Rare Variants , 2011, PLoS genetics.

[46]  P. Donnelly,et al.  A Flexible and Accurate Genotype Imputation Method for the Next Generation of Genome-Wide Association Studies , 2009, PLoS genetics.

[47]  M. McCarthy,et al.  Genome-wide association studies for complex traits: consensus, uncertainty and challenges , 2008, Nature Reviews Genetics.

[48]  S. Leal,et al.  Methods for detecting associations with rare variants for common diseases: application to analysis of sequence data. , 2008, American journal of human genetics.