Non-parametric kernel and nearest neighbour estimates represent flexible alternatives to parametric modelling. For non-constant densities of the explanatory variables, kernel estimates are usually biased for finite samples, even in the case of linear regression functions. This can be seen by looking at the asymptotic expression of the bias of kernel regression estimates derived under certain mixing conditions. In this paper bias reduction techniques using asymmetric kernel functions are suggested. In contrast to well-designed experiments, in environmental data analysis the positions of the design points are not under control, and therefore the measurements of the explanatory variables are arbitrarily scattered over the factor space in an unbalanced way. In this case the asymmetric kernel techniques outperform the usual symmetric kernel methods as demonstrated by an application of both methods to the relationship between the oxygen concentration and the temperature of river water. Furthermore the new methods lead to better predictions in an autoregressive time series model for air pollution measurements such as nitrogen dioxide and sulphur dioxide concentrations.
[1]
M. Rosenblatt.
Remarks on Some Nonparametric Estimates of a Density Function
,
1956
.
[2]
E. Nadaraya.
On Estimating Regression
,
1964
.
[3]
G. S. Watson,et al.
Smooth regression analysis
,
1964
.
[4]
C. B. Tilanus,et al.
Applied Economic Forecasting
,
1966
.
[5]
Theo Gasser,et al.
Optimal convergence properties of kernel estimates of derivatives of a density function
,
1979
.
[6]
Y. Mack,et al.
Local Properties of k-NN Regression Estimates
,
1981
.
[7]
Gérard Collomb,et al.
From Non Parametric Regression to Non Parametric Prediction: Survey of the Mean Square Error and Original Results on the Predictogram
,
1983
.
[8]
P. Robinson.
NONPARAMETRIC ESTIMATORS FOR TIME SERIES
,
1983
.
[9]
Ricardo Fraiman,et al.
Asymptotic Distribution of Robust Estimators for Nonparametric Models from Mixing Processes
,
1990
.
[10]
Paul Michels.
Nichtparametrische Analyse und Prognose von Zeitreihen
,
1992
.