Leveraging Prior Information to Detect Causal Variants via Multi-Variant Regression

Although many methods are available to test sequence variants for association with complex diseases and traits, methods that specifically seek to identify causal variants are less developed. Here we develop and evaluate a Bayesian hierarchical regression method that incorporates prior information on the likelihood of variant causality through weighting of variant effects. By simulation studies using both simulated and real sequence variants, we compared a standard single variant test for analyzing variant-disease association with the proposed method using different weighting schemes. We found that by leveraging linkage disequilibrium of variants with known GWAS signals and sequence conservation (phastCons), the proposed method provides a powerful approach for detecting causal variants while controlling false positives.

[1]  Qianqian Zhu,et al.  Prioritizing genetic variants for causality on the basis of preferential linkage disequilibrium. , 2012, American journal of human genetics.

[2]  Kathryn Roeder,et al.  Testing for an Unusual Distribution of Rare Variants , 2011, PLoS genetics.

[3]  David B. Goldstein,et al.  Rare Variants Create Synthetic Genome-Wide Associations , 2010, PLoS biology.

[4]  Judy H. Cho,et al.  Finding the missing heritability of complex diseases , 2009, Nature.

[5]  M. Goddard,et al.  Prediction of total genetic value using genome-wide dense marker maps. , 2001, Genetics.

[6]  Shamil R Sunyaev,et al.  Most rare missense alleles are deleterious in humans: implications for complex disease and association studies. , 2007, American journal of human genetics.

[7]  David V Conti,et al.  Incorporating model uncertainty in detecting rare variants: the Bayesian risk index , 2011, Genetic epidemiology.

[8]  C. M. Mutshinda,et al.  Bayesian shrinkage analysis of QTLs under shape-adaptive shrinkage priors, and accurate re-estimation of genetic effects , 2011, Heredity.

[9]  Eden R. Martin,et al.  Reconsidering Association Testing Methods Using Single-Variant Test Statistics as Alternatives to Pooling Tests for Sequence Data with Rare Variants , 2012, PloS one.

[10]  David B Dunson,et al.  Bayesian Semiparametric Multiple Shrinkage , 2010, Biometrics.

[11]  J. Shendure,et al.  Needles in stacks of needles: finding disease-causal variants in a wealth of genomic data , 2011, Nature Reviews Genetics.

[12]  Christian E Elger,et al.  15q13.3 microdeletions increase risk of idiopathic generalized epilepsy , 2009, Nature Genetics.

[13]  Mourad Sahbatou,et al.  Association of NOD2 leucine-rich repeat variants with susceptibility to Crohn's disease , 2001, Nature.

[14]  Qianqian Zhu,et al.  A genome-wide comparison of the functional properties of rare and common genetic variants in humans. , 2011, American journal of human genetics.

[15]  Jon Wakefield,et al.  Commentary: Genome-wide significance thresholds via Bayes factors. , 2012, International journal of epidemiology.

[16]  K. Shianna,et al.  Inosine triphosphate protects against ribavirin-induced adenosine triphosphate loss by adenylosuccinate synthase function. , 2011, Gastroenterology.

[17]  Gonçalo R. Abecasis,et al.  GENOME: a rapid coalescent-based whole genome simulator , 2007, Bioinform..

[18]  Thomas E. Nichols,et al.  Nonparametric permutation tests for functional neuroimaging: A primer with examples , 2002, Human brain mapping.

[19]  Ethan M. Lange,et al.  Prioritized Subset Analysis: Improving Power in Genome-wide Association Studies , 2007, Human Heredity.

[20]  Eleazar Eskin,et al.  An Optimal Weighted Aggregated Association Test for Identification of Rare Variants Involved in Common Diseases , 2011, Genetics.

[21]  P. Bork,et al.  Human non-synonymous SNPs: server and survey. , 2002, Nucleic acids research.

[22]  N. Yi,et al.  Bayesian LASSO for Quantitative Trait Loci Mapping , 2008, Genetics.

[23]  Michael P. Epstein,et al.  A permutation procedure to correct for confounders in case-control studies, including tests of rare variation. , 2012, American journal of human genetics.

[24]  Iuliana Ionita-Laza,et al.  A New Testing Strategy to Identify Rare Variants with Either Risk or Protective Effect on Disease , 2011, PLoS genetics.

[25]  P. Sham,et al.  A Knowledge-Based Weighting Framework to Boost the Power of Genome-Wide Association Studies , 2010, PloS one.

[26]  S. Leal,et al.  Methods for detecting associations with rare variants for common diseases: application to analysis of sequence data. , 2008, American journal of human genetics.

[27]  Adam Kiezun,et al.  Computational and statistical approaches to analyzing variants identified by exome sequencing , 2011, Genome Biology.

[28]  Lee-Jen Wei,et al.  Pooled Association Tests for Rare Variants in Exon-Resequencing Studies , 2010 .

[29]  Jay Shendure,et al.  Single-nucleotide evolutionary constraint scores highlight disease-causing mutations , 2010, Nature Methods.

[30]  Judy H. Cho,et al.  [Letters to Nature] , 1975, Nature.

[31]  Wei Pan,et al.  Adaptive tests for association analysis of rare variants , 2011, Genetic epidemiology.

[32]  R. Doerge,et al.  Empirical threshold values for quantitative trait mapping. , 1994, Genetics.

[33]  S. Browning,et al.  A Groupwise Association Test for Rare Mutations Using a Weighted Sum Statistic , 2009, PLoS genetics.

[34]  Yun Li,et al.  To identify associations with rare variants, just WHaIT: Weighted haplotype and imputation-based tests. , 2010, American journal of human genetics.

[35]  Dan M Roden,et al.  A rare variant in MYH6 is associated with high risk of sick sinus syndrome , 2011, Nature Genetics.

[36]  J. Todd,et al.  Rare Variants of IFIH1, a Gene Implicated in Antiviral Responses, Protect Against Type 1 Diabetes , 2009, Science.

[37]  Jacques Fellay,et al.  ITPA gene variants protect against anaemia in patients treated for chronic hepatitis C , 2010, Nature.

[38]  K. Mossman The Wellcome Trust Case Control Consortium, U.K. , 2008 .

[39]  Xihong Lin,et al.  Rare-variant association testing for sequencing data with the sequence kernel association test. , 2011, American journal of human genetics.

[40]  Bruce Winney,et al.  Multiple rare variants in different genes account for multifactorial inherited susceptibility to colorectal adenomas. , 2004, Proceedings of the National Academy of Sciences of the United States of America.

[41]  Simon C. Potter,et al.  Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls , 2007, Nature.

[42]  John S. Witte,et al.  Comprehensive Approach to Analyzing Rare Genetic Variants , 2010, PloS one.

[43]  Eleazar Eskin,et al.  Incorporating prior information into association studies , 2012, Bioinform..