Robust and accurate inference for generalized linear models

In the framework of generalized linear models, the nonrobustness of classical estimators and tests for the parameters is a well known problem, and alternative methods have been proposed in the literature. These methods are robust and can cope with deviations from the assumed distribution. However, they are based on first order asymptotic theory, and their accuracy in moderate to small samples is still an open question. In this paper, we propose a test statistic which combines robustness and good accuracy for moderate to small sample sizes. We combine results from Cantoni and Ronchetti [E. Cantoni, E. Ronchetti, Robust inference for generalized linear models, Journal of the American Statistical Association 96 (2001) 1022-1030] and Robinson, Ronchetti and Young [J. Robinson, E. Ronchetti, G.A. Young, Saddlepoint approximations and tests based on multivariate M-estimators, The Annals of Statistics 31 (2003) 1154-1169] to obtain a robust test statistic for hypothesis testing and variable selection, which is asymptotically @g^2-distributed as the three classical tests but with a relative error of order O(n^-^1). This leads to reliable inference in the presence of small deviations from the assumed model distribution, and to accurate testing and variable selection, even in moderate to small samples.

[1]  D. Ruppert,et al.  Optimally bounded score functions for generalized linear models with applications to logistic regression , 1986 .

[2]  G. A. Young,et al.  Saddlepoint approximation for the studentized mean, with an application to the bootstrap , 1991 .

[3]  P. McCullagh,et al.  Generalized Linear Models , 1992 .

[4]  Elvezio Ronchetti,et al.  Saddlepoint approximations and tests based on multivariate M-estimates , 2003 .

[5]  Andreas Ruckstuhl,et al.  ROBUST FITTING OF THE BINOMIAL MODEL , 2001 .

[6]  Saddlepoint Approximations for Marginal and Conditional Probabilities of Transformed Variables , 1994 .

[7]  Christophe Croux,et al.  Implementing the Bianco and Yohai estimator for logistic regression , 2003, Comput. Stat. Data Anal..

[8]  Chris Field,et al.  Small sample confidence intervals , 1990 .

[9]  Chris Field,et al.  Approximations for marginal densities of M‐estimators , 1995 .

[10]  Thomas J. DiCiccio,et al.  Approximations of marginal tail probabilities and inference for scalar parameters , 1990 .

[11]  Elvezio Ronchetti,et al.  Small Sample Asymptotics , 1990 .

[12]  Ana M. Bianco,et al.  Robust Estimation in the Logistic Regression Model , 1996 .

[13]  D. Pregibon Resistant fits for some commonly used logistic models with medical application. , 1982, Biometrics.

[14]  Maria-Pia Victoria-Feser,et al.  Robust inference with binary data , 2001 .

[15]  S. Morgenthaler Least-Absolute-Deviations Fits for Generalized Linear Models , 1992 .

[16]  E. Ronchetti,et al.  General Saddlepoint Approximations of Marginal Densities and Tail Probabilities , 1996 .

[17]  R. Carroll,et al.  Conditionally Unbiased Bounded-Influence Estimation in General Regression Models, with Applications to Generalized Linear Models , 1989 .

[18]  S. Hosseinian,et al.  Robust inference for generalized linear models , 2009 .

[19]  B. Ripley,et al.  Robust Statistics , 2018, Encyclopedia of Mathematical Geosciences.

[20]  Elvezio Ronchetti,et al.  Empirical Saddlepoint Approximations for Multivariate M-estimators , 1994 .

[21]  Ronald W. Butler,et al.  Saddlepoint Approximations with Applications: Preface , 2007 .

[22]  Accurate and efficient construction of bootstrap likelihoods , 1995 .

[23]  Saddlepoint approximations in conditional inference , 1993 .

[24]  P. McCullagh,et al.  Generalized Linear Models , 1984 .