Testing against a high-dimensional alternative in the generalized linear model: asymptotic type I error control

Testing a low-dimensional null hypothesis against a high-dimensional alternative in a generalized linear model may lead to a test statistic that is a quadratic form in the residuals under the null model. Using asymptotic arguments, we show that the distribution of such a test statistic can be approximated by a ratio of quadratic forms in normal variables, for which algorithms are readily available. For generalized linear models, the asymptotic distribution shows good control of type I error for moderate to small samples, even when the number of covariates in the model far exceeds the sample size. Copyright 2011, Oxford University Press.

[1]  A. Davison Treatment effect heterogeneity in paired data , 1992 .

[2]  H C van Houwelingen,et al.  Testing the fit of a regression model via score tests in random effects models. , 1995, Biometrics.

[3]  Bart Mertens,et al.  Case-Control Breast Cancer Study of MALDI-TOF Proteomic Mass Spectrometry Data on Serum Samples , 2008, Statistical applications in genetics and molecular biology.

[4]  Sara van de Geer,et al.  Testing against a high dimensional alternative , 2006 .

[5]  Geert Molenberghs,et al.  The Use of Score Tests for Inference on Variance Components , 2003, Biometrics.

[6]  Wei Pan,et al.  Asymptotic tests of association with multiple SNPs in linkage disequilibrium , 2009, Genetic epidemiology.

[7]  Enno Mammen,et al.  Testing Parametric Versus Semiparametric Modelling in Generalized Linear Models , 1996 .

[8]  Xihong Lin,et al.  Semiparametric Regression of Multidimensional Genetic Pathway Data: Least‐Squares Kernel Machines and Linear Mixed Models , 2007, Biometrics.

[9]  D. Commenges,et al.  Tests of Homogeneity for Generalized Linear Models , 1995 .

[10]  S. le Cessie,et al.  Testing the fit of a regression model via score tests in random effects models. , 1995 .

[11]  F R Rosendaal,et al.  Testing familial aggregation. , 1995, Biometrics.

[12]  Adrian Bowman,et al.  On the Use of Nonparametric Regression for Checking Linear Relationships , 1993 .

[13]  H. Robbins,et al.  Application of the Method of Mixtures to Quadratic Forms in Normal Variates , 1949 .

[14]  Jelle J. Goeman,et al.  A global test for groups of genes: testing association with a clinical outcome , 2004, Bioinform..

[15]  David R. Cox,et al.  Some remarks on overdispersion , 1983 .

[16]  A. V. D. Vaart,et al.  Asymptotic Statistics: Frontmatter , 1998 .

[17]  J. P. BfflOF Computing the distribution of quadratic forms in normal variables , 2005 .

[18]  Van,et al.  A gene-expression signature as a predictor of survival in breast cancer. , 2002, The New England journal of medicine.

[19]  J. Mesirov,et al.  Molecular classification of cancer: class discovery and class prediction by gene expression monitoring. , 1999, Science.

[20]  D. Kuonen Saddlepoint approximations for distributions of quadratic forms in normal variables , 1999 .

[21]  Daniel Commenges,et al.  Generalized Score Test of Homogeneity Based on Correlated Random Effects Models , 1997 .

[22]  Friedrich Götze,et al.  Asymptotic Distribution of Quadratic Forms and Applications , 2002 .