A Data-Adaptive Sum Test for Disease Association with Multiple Common or Rare Variants

Since associations between complex diseases and common variants are typically weak, and approaches to genotyping rare variants (e.g. by next-generation resequencing) multiply, there is an urgent demand to develop powerful association tests that are able to detect disease associations with both common and rare variants. In this article we present such a test. It is based on data-adaptive modifications to a so-called Sum test originally proposed for common variants, which aims to strike a balance between utilizing information on multiple markers in linkage disequilibrium and reducing the cost of large degrees of freedom or of multiple testing adjustment. When applied to multiple common or rare variants in a candidate region, the proposed test is easy to use with 1 degree of freedom and without the need for multiple testing adjustment. We show that the proposed test has high power across a wide range of scenarios with either common or rare variants, or both. In particular, in some situations the proposed test performs better than several commonly used methods.

[1]  F. E. Satterthwaite An approximate distribution of estimates of variance components. , 1946, Biometrics.

[2]  Dara G Torgerson,et al.  Sequencing the IL4 locus in African Americans implicates rare noncoding variants in asthma susceptibility. , 2009, The Journal of allergy and clinical immunology.

[3]  J. Pritchard,et al.  The allelic architecture of human disease genes: common disease-common variant...or not? , 2002, Human molecular genetics.

[4]  Wei Pan,et al.  Test Selection with Application to Detecting Disease Association with Multiple SNPs , 2009, Human Heredity.

[5]  Yanfang Guo,et al.  Gains in power for exhaustive analyses of haplotypes using variable-sized sliding window strategy: a comparison of association-mapping strategies , 2009, European Journal of Human Genetics.

[6]  Kathryn Roeder,et al.  Analysis of single‐locus tests to detect gene/disease associations , 2005, Genetic epidemiology.

[7]  W. Bodmer,et al.  Common and rare variants in multifactorial susceptibility to common diseases , 2008, Nature Genetics.

[8]  Mingyao Li,et al.  U‐Statistics‐based Tests for Multiple Genes in Genetic Association Studies , 2008, Annals of human genetics.

[9]  Sonja W. Scholz,et al.  Genome-wide genotyping in amyotrophic lateral sclerosis and neurologically normal controls: first stage analysis and public release of data , 2007, The Lancet Neurology.

[10]  J. Todd,et al.  The Type 1 Diabetes Genetics Consortium , 2006, Annals of the New York Academy of Sciences.

[11]  Jonathan Flint,et al.  Genetic architecture of quantitative traits in mice, flies, and humans. , 2009, Genome research.

[12]  Momiao Xiong,et al.  Generalized T2 test for genome association studies. , 2002, American journal of human genetics.

[13]  Ruzong Fan,et al.  Genome association studies of complex diseases by case-control designs. , 2003, American journal of human genetics.

[14]  K. Frazer,et al.  Common vs. rare allele hypotheses for complex diseases. , 2009, Current opinion in genetics & development.

[15]  B. Maher Personal genomes: The case of the missing heritability , 2008, Nature.

[16]  Douglas W. Smith,et al.  Both rare and common polymorphisms contribute functional variation at CHGA, a regulator of catecholamine physiology. , 2004, American journal of human genetics.

[17]  D. Clayton,et al.  Genome-wide association study and meta-analysis finds over 40 loci affect risk of type 1 diabetes , 2009, Nature Genetics.

[18]  Kristilyn Eliason,et al.  Multiple rare nonsynonymous variants in the adenomatous polyposis coli gene predispose to colorectal adenomas. , 2008, Cancer research.

[19]  S. Leal,et al.  Methods for detecting associations with rare variants for common diseases: application to analysis of sequence data. , 2008, American journal of human genetics.

[20]  S. Browning,et al.  A Groupwise Association Test for Rare Mutations Using a Weighted Sum Statistic , 2009, PLoS genetics.

[21]  J. Pritchard Are rare variants responsible for susceptibility to complex diseases? , 2001, American journal of human genetics.

[22]  M. Kendall Theoretical Statistics , 1956, Nature.

[23]  Satterthwaite Fe An approximate distribution of estimates of variance components. , 1946 .

[24]  Jonathan C. Cohen,et al.  A spectrum of PCSK9 alleles contributes to plasma levels of low-density lipoprotein cholesterol. , 2006, American journal of human genetics.

[25]  M. Boehnke,et al.  So many correlated tests, so little time! Rapid adjustment of P values for multiple correlated tests. , 2007, American journal of human genetics.

[26]  Jonathan C. Cohen,et al.  Multiple Rare Alleles Contribute to Low Plasma Levels of HDL Cholesterol , 2004, Science.

[27]  Christian Gieger,et al.  Six new loci associated with body mass index highlight a neuronal influence on body weight regulation , 2009, Nature Genetics.

[28]  Tao Wang,et al.  Improved power by use of a weighted score test for linkage disequilibrium mapping. , 2007, American journal of human genetics.

[29]  M. Spitz,et al.  Shifting paradigm of association studies: value of rare single-nucleotide polymorphisms. , 2008, American journal of human genetics.

[30]  Jason Cooper,et al.  Use of unphased multilocus genotype data in indirect association studies , 2004, Genetic epidemiology.

[31]  Hongyu Zhao,et al.  Rare independent mutations in renal salt handling genes contribute to blood pressure variation , 2008, Nature Genetics.

[32]  John Whittaker,et al.  Analysis of multiple SNPs in a candidate gene or region , 2008, Genetic epidemiology.

[33]  A. Singleton,et al.  Rare Structural Variants Disrupt Multiple Genes in Neurodevelopmental Pathways in Schizophrenia , 2008, Science.

[34]  Wei Pan,et al.  Asymptotic tests of association with multiple SNPs in linkage disequilibrium , 2009, Genetic epidemiology.

[35]  S. Gabriel,et al.  The Structure of Haplotype Blocks in the Human Genome , 2002, Science.

[36]  W. Thilly,et al.  A strategy to discover genes that carry multi-allelic or mono-allelic risk for common diseases: a cohort allelic sums test (CAST). , 2007, Mutation research.

[37]  M. Daly,et al.  High-resolution haplotype structure in the human genome , 2001, Nature Genetics.

[38]  Mark I McCarthy,et al.  Exploring the unknown: assumptions about allelic architecture and strategies for susceptibility variant discovery , 2009, Genome Medicine.