Incorporating model uncertainty in detecting rare variants: the Bayesian risk index

We are interested in investigating the involvement of multiple rare variants within a given region by conducting analyses of individual regions with two goals: (1) to determine if regional rare variation in aggregate is associated with risk; and (2) conditional upon the region being associated, to identify specific genetic variants within the region that are driving the association. In particular, we seek a formal integrated analysis that achieves both of our goals. For rare variants with low minor allele frequencies, there is very little power to statistically test the null hypothesis of equal allele or genotype counts for each variant. Thus, genetic association studies are often limited to detecting association within a subset of the common genetic markers. However, it is very likely that associations exist for the rare variants that may not be captured by the set of common markers. Our framework aims at constructing a risk index based on multiple rare variants within a region. Our analytical strategy is novel in that we use a Bayesian approach to incorporate model uncertainty in the selection of variants to include in the index as well as the direction of the associated effects. Additionally, the approach allows for inference at both the group and variant‐specific levels. Using a set of simulations, we show that our methodology has added power over other popular rare variant methods to detect global associations. In addition, we apply the approach to sequence data from the WECARE Study of second primary breast cancers. Genet. Epidemiol. 2011. © 2011 Wiley Periodicals, Inc. 35:638‐649, 2011

[1]  C. D. Litton,et al.  Theory of Probability (3rd Edition) , 1984 .

[2]  Nengjun Yi,et al.  Bayesian analysis of rare variants in genetic association studies , 2011, Genetic epidemiology.

[3]  C. Garner A statistical method for scanning the genome for regions with rare disease alleles , 2010, Genetic epidemiology.

[4]  A. Børresen-Dale,et al.  Characterization of BRCA1 and BRCA2 deleterious mutations and variants of unknown clinical significance in unilateral and bilateral breast cancer: the WECARE study , 2010, Human mutation.

[5]  F. Collins,et al.  Potential etiologic and functional implications of genome-wide association loci for human diseases and traits , 2009, Proceedings of the National Academy of Sciences.

[6]  Eleftheria Zeggini,et al.  Rare variant association analysis methods for complex traits. , 2010, Annual review of genetics.

[7]  D. Conti,et al.  SNPs, haplotypes, and model selection in a candidate gene region: The SIMPle analysis for multilocus data , 2004, Genetic epidemiology.

[8]  Hoda Anton-Culver,et al.  Population-based study of the risk of second primary contralateral breast cancer associated with carrying a mutation in BRCA1 or BRCA2. , 2010, Journal of clinical oncology : official journal of the American Society of Clinical Oncology.

[9]  John S. Witte,et al.  Comprehensive Approach to Analyzing Rare Genetic Variants , 2010, PloS one.

[10]  C. Begg,et al.  Population-Based Study of the Risk of Second Primary Contralateral Breast Cancer Associated With Carrying a Mutation in BRCA 1 or BRCA 2 , 2010 .

[11]  S. Leal,et al.  Methods for detecting associations with rare variants for common diseases: application to analysis of sequence data. , 2008, American journal of human genetics.

[12]  E. Zeggini,et al.  An Evaluation of Statistical Approaches to Rare Variant Analysis in Genetic Association Studies , 2009, Genetic epidemiology.

[13]  Scott C Schmidler,et al.  BAYESIAN MODEL SEARCH AND MULTILEVEL INFERENCE FOR SNP ASSOCIATION STUDIES. , 2009, The annals of applied statistics.

[14]  Duncan C Thomas,et al.  The use of hierarchical models for estimating relative risks of individual genetic variants: An application to a study of melanoma , 2008, Statistics in medicine.

[15]  Paul J. Rathouz,et al.  An Evolutionary Framework for Association Testing in Resequencing Studies , 2010, PLoS genetics.

[16]  W. Thilly,et al.  A strategy to discover genes that carry multi-allelic or mono-allelic risk for common diseases: a cohort allelic sums test (CAST). , 2007, Mutation research.

[17]  C. Begg,et al.  Variation of breast cancer risk among BRCA1/2 carriers. , 2008, JAMA.

[18]  Wei Pan,et al.  A Data-Adaptive Sum Test for Disease Association with Multiple Common or Rare Variants , 2010, Human Heredity.

[19]  S. Browning,et al.  A Groupwise Association Test for Rare Mutations Using a Weighted Sum Statistic , 2009, PLoS genetics.

[20]  J. Pritchard Are rare variants responsible for susceptibility to complex diseases? , 2001, American journal of human genetics.

[21]  Shamil R Sunyaev,et al.  Pooled association tests for rare variants in exon-resequencing studies. , 2010, American journal of human genetics.

[22]  Gaurav Bhatia,et al.  A Covering Method for Detecting Genetic Associations between Rare Variants and Common Phenotypes , 2010, PLoS Comput. Biol..

[23]  C. Begg,et al.  Assessment of rare BRCA1 and BRCA2 variants of unknown significance using hierarchical modeling , 2011, Genetic epidemiology.

[24]  Bryan Langholz,et al.  Study design: Evaluating gene–environment interactions in the etiology of breast cancer – the WECARE study , 2004, Breast Cancer Research.

[25]  J. Pritchard,et al.  The allelic architecture of human disease genes: common disease-common variant...or not? , 2002, Human molecular genetics.

[26]  V. Bansal,et al.  Statistical analysis strategies for association studies involving rare variants , 2010, Nature Reviews Genetics.

[27]  Kathryn Roeder,et al.  Testing for an Unusual Distribution of Rare Variants , 2011, PLoS genetics.

[28]  Colin B Begg,et al.  Hierarchical Modeling for Estimating Relative Risks of Rare Genetic Variants: Properties of the Pseudo‐Likelihood Method , 2011, Biometrics.