Association tests using kernel‐based measures of multi‐locus genotype similarity between individuals

In a genetic association study, it is often desirable to perform an overall test of whether any or all single‐nucleotide polymorphisms (SNPs) in a gene are associated with a phenotype. Several such tests exist, but most of them are powerful only under very specific assumptions about the genetic effects of the individual SNPs. In addition, some of the existing tests assume that the direction of the effect of each SNP is known, which is a highly unlikely scenario. Here, we propose a new kernel‐based association test of joint association of several SNPs. Our test is non‐parametric and robust, and does not make any assumption about the directions of individual SNP effects. It can be used to test multiple correlated SNPs within a gene and can also be used to test independent SNPs or genes in a biological pathway. Our test uses an analysis of variance paradigm to compare variation between cases and controls to the variation within the groups. The variation is measured using kernel functions for each marker, and then a composite statistic is constructed to combine the markers into a single test. We present simulation results comparing our statistic to the U‐statistic‐based method by Schaid et al. ([2005] Am. J. Hum. Genet. 76:780–793) and another statistic by Wessel and Schork ([2006] Am. J. Hum. Genet. 79:792–806). We consider a variety of different disease models and assumptions about how many SNPs within the gene are actually associated with disease. Our results indicate that our statistic has higher power than other statistics under most realistic conditions. Genet. Epidemiol. 34: 213–221, 2010. © 2009 Wiley‐Liss, Inc.

[1]  Qiuying Sha,et al.  A new association test using haplotype similarity , 2007, Genetic epidemiology.

[2]  Qiuying Sha,et al.  Tests of Association Between Quantitative Traits and Haplotypes In A Reduced‐Dimensional Space , 2005, Annals of human genetics.

[3]  Kai Wang,et al.  A principal components regression approach to multilocus genetic association studies , 2008, Genetic epidemiology.

[4]  Daniel J Schaid,et al.  Nonparametric tests of association of multiple genes with human disease. , 2005, American journal of human genetics.

[5]  Brian H. McArdle,et al.  FITTING MULTIVARIATE MODELS TO COMMUNITY DATA: A COMMENT ON DISTANCE‐BASED REDUNDANCY ANALYSIS , 2001 .

[6]  J. Chang-Claude,et al.  Haplotype Sharing Analysis Using Mantel Statistics , 2005, Human Heredity.

[7]  Jennifer Wessel,et al.  DNA sequence-based phenotypic association analysis. , 2008, Advances in genetics.

[8]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[9]  Jurg Ott,et al.  Sum statistics for the joint detection of multiple disease loci in case‐control association studies with SNP markers , 2003, Genetic epidemiology.

[10]  David V Conti,et al.  Testing association between disease and multiple SNPs in a candidate gene , 2007, Genetic epidemiology.

[11]  L. Wasserman,et al.  On the identification of disease mutations by the analysis of haplotype similarity and goodness of fit. , 2003, American journal of human genetics.

[12]  Xihong Lin,et al.  A powerful and flexible multilocus association test for quantitative traits. , 2008, American journal of human genetics.

[13]  P. Donnelly,et al.  A new multipoint method for genome-wide association studies by imputation of genotypes , 2007, Nature Genetics.

[14]  J. Ott,et al.  Trimming, weighting, and grouping SNPs in human case-control association studies. , 2001, Genome research.

[15]  John Whittaker,et al.  Analysis of multiple SNPs in a candidate gene or region , 2008, Genetic epidemiology.

[16]  Tao Wang,et al.  Improved power by use of a weighted score test for linkage disequilibrium mapping. , 2007, American journal of human genetics.

[17]  Daniel J Schaid,et al.  Power comparisons between similarity‐based multilocus association methods, logistic regression, and score tests for haplotypes , 2009, Genetic epidemiology.

[18]  N. Schork,et al.  Generalized genomic distance-based regression methodology for multilocus association analysis. , 2006, American journal of human genetics.

[19]  Kathryn Roeder,et al.  Analysis of single‐locus tests to detect gene/disease associations , 2005, Genetic epidemiology.

[20]  Ruzong Fan,et al.  Genome association studies of complex diseases by case-control designs. , 2003, American journal of human genetics.