Collapsing Approaches for the Association Analysis of Rare Variants

In testing for associations with rare variants, alternative methods to those used for common SNPs are required due to their lack of power at lower frequency. Collapsing approaches overcome this power loss by testing for an association with an aggregate of rare variants. These tests pool information across the rare variants such that a single test is performed on the summary statistic, and are powerful tools, provided that certain conditions are satisfied. In this chapter, the general framework of collapsing methods is explored, including optimal conditions for attaining high power. Comparisons are made between specific collapsing methods, as well as data-adaptive versions that have been developed to recover much of the power loss from nonideal settings.

[1]  P. Bork,et al.  A method and server for predicting damaging missense mutations , 2010, Nature Methods.

[2]  J. Pritchard Are rare variants responsible for susceptibility to complex diseases? , 2001, American journal of human genetics.

[3]  Claudia Hemmelmann,et al.  Statistical analysis of rare sequence variants: an overview of collapsing methods , 2011, Genetic epidemiology.

[4]  V. Bansal,et al.  Statistical analysis strategies for association studies involving rare variants , 2010, Nature Reviews Genetics.

[5]  Fiona Cunningham,et al.  A Combined Functional Annotation Score for Non-Synonymous Variants , 2012, Human Heredity.

[6]  P. Stanier,et al.  A functional haplotype variant in the TBX22 promoter is associated with cleft palate and ankyloglossia , 2009, Journal of Medical Genetics.

[7]  Eleftheria Zeggini,et al.  Rare variant association analysis methods for complex traits. , 2010, Annual review of genetics.

[8]  Lee-Jen Wei,et al.  Pooled Association Tests for Rare Variants in Exon-Resequencing Studies , 2010 .

[9]  Yun Li,et al.  Extending rare-variant testing strategies: analysis of noncoding sequence and imputed genotypes. , 2010, American journal of human genetics.

[10]  K. Van Steen,et al.  Sequencing of DISC1 Pathway Genes Reveals Increased Burden of Rare Missense Variants in Schizophrenia Patients from a Northern Swedish Population , 2011, PloS one.

[11]  Eleftheria Zeggini,et al.  ARIEL and AMELIA: Testing for an Accumulation of Rare Variants Using Next-Generation Sequencing Data , 2012, Human Heredity.

[12]  Dara G Torgerson,et al.  Sequencing the IL4 locus in African Americans implicates rare noncoding variants in asthma susceptibility. , 2009, The Journal of allergy and clinical immunology.

[13]  D. Altshuler,et al.  A map of human genome variation from population-scale sequencing , 2010, Nature.

[14]  K. Mossman The Wellcome Trust Case Control Consortium, U.K. , 2008 .

[15]  S. Leal,et al.  Methods for detecting associations with rare variants for common diseases: application to analysis of sequence data. , 2008, American journal of human genetics.

[16]  Yurii S. Aulchenko,et al.  The Empirical Power of Rare Variant Association Methods: Results from Sanger Sequencing in 1,998 Individuals , 2012, PLoS genetics.

[17]  R. Mägi,et al.  Genome-Wide Association Analysis of Imputed Rare Variants: Application to Seven Common Complex Diseases , 2012, Genetic epidemiology.

[18]  Simon C. Potter,et al.  Genome-wide association study of 14,000 cases of seven common diseases and 3,000 shared controls , 2007, Nature.

[19]  W. Thilly,et al.  A strategy to discover genes that carry multi-allelic or mono-allelic risk for common diseases: a cohort allelic sums test (CAST). , 2007, Mutation research.

[20]  Wei Pan,et al.  A Data-Adaptive Sum Test for Disease Association with Multiple Common or Rare Variants , 2010, Human Heredity.

[21]  E. Zeggini,et al.  Functional annotation of non-coding sequence variants , 2014, Nature Methods.

[22]  S. Browning,et al.  A Groupwise Association Test for Rare Mutations Using a Weighted Sum Statistic , 2009, PLoS genetics.

[23]  Gang Shi,et al.  Optimum designs for next‐generation sequencing to discover rare variants for common complex disease , 2011, Genetic epidemiology.

[24]  Andrew D. Johnson,et al.  Whole Genome Sequence-Based Analysis of a Model Complex Trait, High Density Lipoprotein Cholesterol , 2013, Nature Genetics.

[25]  E. Boerwinkle,et al.  Mining gold dust under the genome wide significance level: a two‐stage approach to analysis of GWAS , 2011, Genetic epidemiology.

[26]  Claudio J. Verzilli,et al.  An Abundance of Rare Functional Variants in 202 Drug Target Genes Sequenced in 14,002 People , 2012, Science.

[27]  Steven Henikoff,et al.  SIFT: predicting amino acid changes that affect protein function , 2003, Nucleic Acids Res..

[28]  V. Salomaa,et al.  Excess of rare variants in genes identified by genome-wide association study of hypertriglyceridemia , 2010, Nature Genetics.

[29]  Shuanglin Zhang,et al.  Two Adaptive Weighting Methods to Test for Rare Variant Associations in Family‐Based Designs , 2012, Genetic epidemiology.

[30]  E. Grice,et al.  A common sex-dependent mutation in a RET enhancer underlies Hirschsprung disease risk , 2005, Nature.

[31]  E. Zeggini,et al.  An Evaluation of Statistical Approaches to Rare Variant Analysis in Genetic Association Studies , 2009, Genetic epidemiology.

[32]  P. Bork,et al.  Human non-synonymous SNPs: server and survey. , 2002, Nucleic acids research.