lvlodel Validation and Spatial Interpolation by Combining Observations \ vith Outputs from Numerical lvlodels via Bayesian lvlelding 1

Constructing maps of pollution levels is vital for air quality management, and presents statistical problems typical of many environmental and spatial applications. Ideally, such maps would be based on a dense network of monitoring stations, but this does not exist. Instead, there are two main sources of information in the U.S.: one is pollution measurements at a sparse set of about 50 monitoring stations called CASTNet, and the other is pollution emissions data. The pollution emissions data do not give direct information about pollution levels, but instead are combined with numerical models of weather and the emissions process and information about land use and cover (collectively called Models-3), to produce maps. Here we develop a formal method for combining these two sources of information. vVe specify a simple model for both the Models-3 output and the CASTNet observations in terms of the unobserved ground truth, and estimate the model in a Bayesian way. This yields solutions to the spatial prediction, model validation and bias removal problems simultaneously. It provides improved spatial prediction via the posterior distribution of the ground truth, allows us to validate Models-3 via the posterior predictive distribution of the CASTNet observations, and enables us to remove the bias in the Models-3 output by estimating additive and multiplicative bias parameters in the model. vVe apply our methods to data on 802 concentrations. Key tlJords: air pollution, Ba~yesian inference, change of support, likelihood approaches, Matern covariance, nonstationary process, spatial-temporal statistics.

[1]  Montserrat Fuentes,et al.  A high frequency kriging approach for non‐stationary environmental processes , 2001 .

[2]  Peter Guttorp,et al.  20 Methods for estimating heterogeneous spatial covariance functions with environmental applications , 1994, Environmental Statistics.

[3]  N. L. Johnson,et al.  Multivariate Analysis , 1958, Nature.

[4]  A. Raftery Analysis of a Simple Debugging Model. , 1988 .

[5]  M. Fuentes Spectral methods for nonstationary spatial processes , 2002 .

[6]  A. Raftery,et al.  Inference for Deterministic Simulation Models: The Bayesian Melding Approach , 2000 .

[7]  M. Stein Predicting random fields with increasing dense observations , 1999 .

[8]  J. F. Clarke,et al.  Dry deposition calculations for the clean air status and trends network , 1997 .

[9]  Robert Haining,et al.  Statistics for spatial data: by Noel Cressie, 1991, John Wiley & Sons, New York, 900 p., ISBN 0-471-84336-9, US $89.95 , 1993 .

[10]  A. Yaglom,et al.  An Introduction to the Theory of Stationary Random Functions , 1963 .

[11]  Montserrat Fuentes,et al.  A New Class of Nonstationary Spatial Models , 2001 .

[12]  P. Guttorp,et al.  Nonparametric Estimation of Nonstationary Spatial Covariance Structure , 1992 .

[13]  Daewon W. Byun,et al.  The next generation of integrated air quality modeling: EPA's models-3 , 1996 .

[14]  Noel A Cressie,et al.  Statistics for Spatial Data, Revised Edition. , 1994 .

[15]  P. Guttorp,et al.  A space-time analysis of ground-level ozone data , 1994 .

[16]  P D Sampson,et al.  Operational evaluation of air quality models. , 1999, Novartis Foundation symposium.

[17]  D. Rubin Bayesianly Justifiable and Relevant Frequency Calculations for the Applied Statistician , 1984 .

[18]  T. C. Haas,et al.  Local Prediction of a Spatio-Temporal Process with an Application to Wet Sulfate Deposition , 1995 .

[19]  Richard J. Smith,et al.  Estimating Nonstationary Spatial Correlations , 1996 .