Small area estimation of the mean using non-parametric M-quantile regression: a comparison when a linear mixed model does not hold

The demand for reliable statistics in subpopulations, when only reduced sample sizes are available, has promoted the development of small area estimation methods. In particular, an approach that is now widely used is based on the seminal work by Battese et al. [An error-components model for prediction of county crop areas using survey and satellite data, J. Am. Statist. Assoc. 83 (1988), pp. 28–36] that uses linear mixed models (MM). We investigate alternatives when a linear MM does not hold because, on one side, linearity may not be assumed and/or, on the other, normality of the random effects may not be assumed. In particular, Opsomer et al. [Nonparametric small area estimation using penalized spline regression, J. R. Statist. Soc. Ser. B 70 (2008), pp. 265–283] propose an estimator that extends the linear MM approach to the case in which a linear relationship may not be assumed using penalized splines regression. From a very different perspective, Chambers and Tzavidis [M-quantile models for small area estimation, Biometrika 93 (2006), pp. 255–268] have recently proposed an approach for small-area estimation that is based on M-quantile (MQ) regression. This allows for models robust to outliers and to distributional assumptions on the errors and the area effects. However, when the functional form of the relationship between the qth MQ and the covariates is not linear, it can lead to biased estimates of the small area parameters. Pratesi et al. [Semiparametric M-quantile regression for estimating the proportion of acidic lakes in 8-digit HUCs of the Northeastern US, Environmetrics 19(7) (2008), pp. 687–701] apply an extended version of this approach for the estimation of the small area distribution function using a non-parametric specification of the conditional MQ of the response variable given the covariates [M. Pratesi, M.G. Ranalli, and N. Salvati, Nonparametric m-quantile regression using penalized splines, J. Nonparametric Stat. 21 (2009), pp. 287–304]. We will derive the small area estimator of the mean under this model, together with its mean-squared error estimator and compare its performance to the other estimators via simulations on both real and simulated data.

[1]  Peter Craven,et al.  Smoothing noisy data with spline functions , 1978 .

[2]  N. Tzavidis,et al.  M-quantile models for small area estimation , 2006 .

[3]  R. Chambers,et al.  Empirical Best Linear Unbiased Prediction for Out of Sample Areas , 2005 .

[4]  Ali S. Hadi,et al.  Finding Groups in Data: An Introduction to Chster Analysis , 1991 .

[5]  B. Ripley,et al.  Semiparametric Regression: Preface , 2003 .

[6]  R. Munn,et al.  The Design of Air Quality Monitoring Networks , 1981 .

[7]  Monica Pratesi,et al.  Semiparametric M‐quantile regression for estimating the proportion of acidic lakes in 8‐digit HUCs of the Northeastern US , 2008 .

[8]  F. Breidt,et al.  Nonparametric Small Area Estimation Using Penalized Spline Regression , 2005 .

[9]  Rachel M. Harter,et al.  An Error-Components Model for Prediction of County Crop Areas Using Survey and Satellite Data , 1988 .

[10]  Monica Pratesi,et al.  Nonparametric M-quantile regression using penalised splines , 2009 .

[11]  Paul H. C. Eilers,et al.  Flexible smoothing with B-splines and penalties , 1996 .

[12]  G. Wahba Smoothing noisy data with spline functions , 1975 .

[13]  R. Chambers,et al.  Estimating distribution functions from survey data , 1986 .

[14]  Peter J. Huber,et al.  Robust Statistics , 2005, Wiley Series in Probability and Statistics.

[15]  Richard M. Royall,et al.  Variance Estimation in Finite Population Sampling , 1978 .

[16]  M. Pratesi,et al.  Robust prediction of small area means and distributions , 2007 .

[17]  F. Breidt,et al.  Non‐parametric small area estimation using penalized spline regression , 2008 .

[18]  J. Rao,et al.  The estimation of the mean squared error of small-area estimators , 1990 .

[19]  B. Ripley,et al.  Robust Statistics , 2018, Encyclopedia of Mathematical Geosciences.