Approximate score‐based testing with application to multivariate trait association analysis

For genome‐wide association studies and DNA sequencing studies, several powerful score‐based tests, such as kernel machine regression and sum of powered score tests, have been proposed in the last few years. However, extensions of these score‐based tests to more complex models, such as mixed‐effects models for analysis of multiple and correlated traits, have been hindered by the unavailability of the score vector, due to either no output from statistical software or no closed‐form solution at all. We propose a simple and general method to asymptotically approximate the score vector based on an asymptotically normal and consistent estimate of a parameter vector to be tested and its (consistent) covariance matrix. The proposed method is applicable to both maximum‐likelihood estimation and estimating function‐based approaches. We use the derived approximate score vector to extend several score‐based tests to mixed‐effects models. We demonstrate the feasibility and possible power gains of these tests in association analysis of multiple and correlated quantitative or binary traits with both real and simulated data. The proposed method is easy to implement with a wide applicability.

[1]  Wei Pan,et al.  Relationship between genomic distance‐based regression and kernel machine regression for multi‐marker association testing , 2011, Genetic epidemiology.

[2]  Arnab Maity,et al.  Multivariate Phenotype Association Analysis by Marker‐Set Kernel Machine Regression , 2012, Genetic epidemiology.

[3]  C. Jack,et al.  Alzheimer's Disease Neuroimaging Initiative , 2008 .

[4]  S. Vansteelandt,et al.  A doubly robust test for gene-environment interaction in family-based studies of affected offspring. , 2010, Biostatistics.

[5]  P. Diggle Analysis of Longitudinal Data , 1995 .

[6]  Manuel A. R. Ferreira,et al.  PLINK: a tool set for whole-genome association and population-based linkage analyses. , 2007, American journal of human genetics.

[7]  J. Kent Robust properties of likelihood ratio tests , 1982 .

[8]  Bjarni J. Vilhjálmsson,et al.  A mixed-model approach for genome-wide association studies of correlated traits in structured populations , 2012, Nature Genetics.

[9]  N. Breslow,et al.  Approximate inference in generalized linear mixed models , 1993 .

[10]  P. Grambsch,et al.  Penalized Survival Models and Frailty , 2003 .

[11]  M. Stephens,et al.  Genome-wide Efficient Mixed Model Analysis for Association Studies , 2012, Nature Genetics.

[12]  Wei Pan,et al.  Asymptotic tests of association with multiple SNPs in linkage disequilibrium , 2009, Genetic epidemiology.

[13]  Gary K Grunwald,et al.  Testing gene-environment interactions in family-based association studies using trait-based ascertained samples. , 2014, Statistics in medicine.

[14]  S. Zeger,et al.  Longitudinal data analysis using generalized linear models , 1986 .

[15]  Dan-Yu Lin,et al.  A general framework for detecting disease associations with rare variants in sequencing studies. , 2011, American journal of human genetics.

[16]  Xihong Lin,et al.  GEE‐Based SNP Set Association Test for Continuous and Discrete Traits in Family‐Based Association Studies , 2013, Genetic epidemiology.

[17]  Xiaotong Shen,et al.  A Powerful and Adaptive Association Test for Rare Variants , 2014, Genetics.

[18]  Michael Weiner,et al.  Whole genome association study of brain-wide imaging phenotypes for identifying quantitative trait loci in MCI and AD: A study of the ADNI cohort , 2010, NeuroImage.

[19]  Wei Pan,et al.  Longitudinal Analysis Is More Powerful than Cross-Sectional Analysis in Detecting Genetic Association with Neuroimaging Phenotypes , 2014, PloS one.

[20]  Deanne M. Taylor,et al.  Powerful SNP-set analysis for case-control genome-wide association studies. , 2010, American journal of human genetics.

[21]  J. Meigs,et al.  Sequence Kernel Association Test for Quantitative Traits in Family Samples , 2013, Genetic epidemiology.

[22]  S. Leal,et al.  Methods for detecting associations with rare variants for common diseases: application to analysis of sequence data. , 2008, American journal of human genetics.

[23]  Qiong Yang,et al.  Analyze multivariate phenotypes in genetic association studies by combining univariate association tests , 2010, Genetic epidemiology.

[24]  Martin Styner,et al.  Projection Regression Models for Multivariate Imaging Phenotype , 2012, Genetic epidemiology.

[25]  Yingye Zheng,et al.  A Unified Mixed‐Effects Model for Rare‐Variant Association in Sequencing Studies , 2013, Genetic epidemiology.

[26]  Min A. Jhun,et al.  SNP Set Association Analysis for Familial Data , 2012, Genetic epidemiology.

[27]  M. McMullen,et al.  A unified mixed-model method for association mapping that accounts for multiple levels of relatedness , 2006, Nature Genetics.

[28]  Wei Pan,et al.  Testing for association with multiple traits in generalized estimation equations, with application to neuroimaging data , 2014, NeuroImage.

[29]  D. Boos On Generalized Score Tests , 1992 .

[30]  Qiong Yang,et al.  Methods for Analyzing Multivariate Phenotypes in Genetic Association Studies. , 2012, Journal of probability and statistics.

[31]  M. Epstein,et al.  Flexible and Robust Methods for Rare‐Variant Testing of Quantitative Traits in Trios and Nuclear Families , 2014, Genetic epidemiology.

[32]  Zhiwu Zhang,et al.  Mixed linear model approach adapted for genome-wide association studies , 2010, Nature Genetics.

[33]  D. Bates,et al.  Linear Mixed-Effects Models using 'Eigen' and S4 , 2015 .

[34]  Terry M. Therneau,et al.  Mixed Effects Cox Models , 2015 .

[35]  N. Jewell,et al.  Hypothesis testing of regression parameters in semiparametric generalized linear models for cluster correlated data , 1990 .

[36]  Jung-Ying Tzeng,et al.  Studying gene and gene-environment effects of uncommon and common variants on continuous traits: a marker-set approach using gene-trait similarity regression. , 2011, American journal of human genetics.

[37]  Kathryn Roeder,et al.  Pleiotropy and principal components of heritability combine to increase power for association analysis , 2008, Genetic epidemiology.

[38]  Xihong Lin,et al.  Rare-variant association testing for sequencing data with the sequence kernel association test. , 2011, American journal of human genetics.

[39]  Mary Sara McPeek,et al.  Robust Rare Variant Association Testing for Quantitative Traits in Samples With Related Individuals , 2014, Genetic epidemiology.

[40]  Xihong Lin,et al.  A powerful and flexible multilocus association test for quantitative traits. , 2008, American journal of human genetics.