Robust estimation for finite populations based on a working model

A common scenario in finite population inference is that it is possible to find a working superpopulation model which explains the main features of the population but which may not capture all the fine details. In addition, there are often outliers in the population which do not follow the assumed superpopulation model. In situations like these, it is still advantageous to make use of the working model to estimate finite population quantities, provided that we do it in a robust manner. The approach that we suggest is first to fit the working model to the sample and then to fine-tune for departures from the model assumed by estimating the conditional distribution of the residuals as a function of the auxiliary variable. This is a more direct approach to handling outliers and model misspecification than the Huber approach that is currently being used. Two simple methods, stratification and nearest neighbour smoothing, are used to estimate the conditional distributions of the residuals, which result in two modifications to the standard model-based estimator of the population distribution function. The estimators suggested perform very well in simulation studies involving two types of model departure and have small variances due to their model-based construction as well as acceptable bias. The potential advantage of the proposed robustified model-based approach over direct nonparametric regression is also demonstrated.

[1]  Jaroslav Hájek,et al.  Theory of rank tests , 1969 .

[2]  Rodney C. Wolff,et al.  Methods for estimating a conditional distribution function , 1999 .

[3]  J. Rao,et al.  On estimating distribution functions and quantiles from survey data using auxiliary information , 1990 .

[4]  R. Chambers Outlier Robust Finite Population Estimation , 1986 .

[5]  R. Chambers,et al.  Estimating distribution functions from survey data , 1986 .

[6]  M. C. Jones,et al.  Local Linear Quantile Regression , 1998 .

[7]  Anthony Y. C. Kuk,et al.  Estimation of distribution functions and medians under sampling with unequal probabilities , 1988 .

[8]  Alan H. Dorfman,et al.  A COMPARISON OF DESIGN-BASED AND MODEL-BASED ESTIMATORS OF THE FINITE POPULATION DISTRIBUTION FUNCTION , 1993 .

[9]  Alan H. Dorfman,et al.  Bias Robust Estimation in Finite Populations Using Nonparametric Calibration , 1993 .

[10]  E. Ronchetti,et al.  Bias‐calibrated estimation from sample surveys containing outliers , 1998 .

[11]  M. H. Hansen An Evaluation of Model-Dependent and Probability-Sampling Inferences in Sample Surveys , 1983 .

[12]  P. Sen,et al.  Theory of rank tests , 1969 .

[13]  D. Pfeffermann The Role of Sampling Weights when Modeling Survey Data , 1993 .

[14]  Chris J. Skinner,et al.  Estimating distribution functions with auxiliary information using poststratification , 1995 .

[15]  Anthony Y. C. Kuk,et al.  Median Estimation in the Presence of Auxiliary Information , 1989 .

[16]  David Firth,et al.  Robust models in probability sampling , 1998 .