Spatial outlier detection in the PM10 monitoring network of Normandy (France)

Abstract We consider hourly PM 10 measurements from 22 monitoring stations located in Basse–Normandie and Haute–Normandie regions (France) and also in the neighboring regions. All considered monitoring stations are either urban background stations or rural ones. The paper focuses on the statistical detection of outliers of the hourly PM 10 concentrations from a spatial point of view. The general strategy uses a jackknife type approach and is based on the comparison of the actual measurement with some robust spatial prediction. Two spatial predictions are considered: the first one is based on the median of the concentrations of the closest neighboring stations which directly consider weighted concentrations while the second one is based on kriging increments, instead of more traditional pseudo–innovations. The two methods are applied to the PM 10 monitoring network in Normandy and are fully implemented by Air Normand (the official association for air quality monitoring in Haute–Normandie) in the Measurements Quality Control process. Some numerical results are provided on recent data from January 1, 2013 to May 31, 2013 to illustrate and compare the two methods.

[1]  Jean-Michel Poggi,et al.  Variable selection using random forests , 2010, Pattern Recognit. Lett..

[2]  D. Grancher,et al.  Estimation de champs de pollution par adaptation statistique locale et approche non stationnaire , 2005 .

[3]  Marco Riani,et al.  The Ordering of Spatial Data and the Detection of Multiple Outliers , 1999 .

[4]  Noel A Cressie,et al.  Statistics for Spatial Data, Revised Edition. , 1994 .

[5]  Peter Filzmoser,et al.  Noname manuscript No. (will be inserted by the editor) Identification of local multivariate outliers , 2022 .

[6]  B. Efron The jackknife, the bootstrap, and other resampling plans , 1987 .

[7]  Chang-Tien Lu,et al.  Spatial Weighted Outlier Detection , 2006, SDM.

[8]  Alma Hodzic,et al.  Long-term urban aerosol simulation versus routine particulate matter observations , 2005 .

[9]  Noel A Cressie,et al.  Statistics for Spatial Data. , 1992 .

[10]  Shashi Shekhar,et al.  A Unified Approach to Detecting Spatial Outliers , 2003, GeoInformatica.

[11]  Geostatistical analysis of the temporal variability of ozone concentrations. Comparison between CHIMERE model and surface observations , 2011 .

[12]  Sirin Nitinawarat,et al.  Universal outlier detection , 2013, 2013 Information Theory and Applications Workshop (ITA).

[13]  VARUN CHANDOLA,et al.  Anomaly detection: A survey , 2009, CSUR.

[14]  F. Meleux,et al.  Predictability of European air quality: Assessment of 3 years of operational forecasts and analyses by the PREV'AIR system , 2008 .

[15]  B. Efron,et al.  The Jackknife: The Bootstrap and Other Resampling Plans. , 1983 .

[16]  Edzer J. Pebesma,et al.  Multivariable geostatistics in S: the gstat package , 2004, Comput. Geosci..

[17]  Peter J. Rousseeuw,et al.  Robust regression and outlier detection , 1987 .

[18]  Viviane Planchon,et al.  Traitement des valeurs aberrantes : concepts actuels et tendances générales , 2005 .

[19]  Christine Thomas-Agnan,et al.  GeoXp: an R package for exploratory spatial data analysis , 2012 .

[20]  Vic Barnett,et al.  Environmental Statistics: Methods and Applications , 2004 .

[21]  Chang-Tien Lu,et al.  Algorithms for spatial outlier detection , 2003, Third IEEE International Conference on Data Mining.

[22]  Graham J. Wills,et al.  Dynamic Graphics for Exploring Spatial Data with Application to Locating Global and Local Anomalies , 1991 .

[23]  Peter J. Rousseeuw,et al.  Robust Regression and Outlier Detection , 2005, Wiley Series in Probability and Statistics.