Multi‐objective calibration of forecast ensembles using Bayesian model averaging

[1] Bayesian Model Averaging (BMA) has recently been proposed as a method for statistical postprocessing of forecast ensembles from numerical weather prediction models. The BMA predictive probability density function (PDF) of any weather quantity of interest is a weighted average of PDFs centered on the bias-corrected forecasts from a set of different models. However, current applications of BMA calibrate the forecast specific PDFs by optimizing a single measure of predictive skill. Here we propose a multi-criteria formulation for postprocessing of forecast ensembles. Our multi-criteria framework implements different diagnostic measures to reflect different but complementary metrics of forecast skill, and uses a numerical algorithm to solve for the Pareto set of parameters that have consistently good performance across multiple performance metrics. Two illustrative case studies using 48-hour ensemble data of surface temperature and sea level pressure, and multi-model seasonal forecasts of temperature, show that a multi-criteria formulation provides a more appealing basis for selecting the appropriate BMA model.

[1]  H. Hersbach Decomposition of the Continuous Ranked Probability Score for Ensemble Prediction Systems , 2000 .

[2]  G. Casella,et al.  Statistical Inference , 2003, Encyclopedia of Social Network Analysis and Mining.

[3]  Andrew P. Morse,et al.  DEVELOPMENT OF A EUROPEAN MULTIMODEL ENSEMBLE SYSTEM FOR SEASONAL-TO-INTERANNUAL PREDICTION (DEMETER) , 2004 .

[4]  T. A. Brown Admissible Scoring Systems for Continuous Distributions. , 1974 .

[5]  J. M. Sloughter,et al.  Probabilistic Quantitative Precipitation Forecasting Using Bayesian Model Averaging , 2007 .

[6]  A. Raftery,et al.  Using Bayesian Model Averaging to Calibrate Forecast Ensembles , 2005 .

[7]  M. Claussen,et al.  The atmospheric general circulation model ECHAM-4: Model description and simulation of present-day climate , 1996 .

[8]  T. Palmer,et al.  Development of a European Multi-Model Ensemble System for Seasonal to Inter-Annual Prediction (DEMETER) , 2004 .

[9]  A. Hense,et al.  A Bayesian approach to climate model evaluation and multi‐model averaging with an application to global mean surface temperatures from IPCC AR4 coupled climate models , 2006 .

[10]  Gerry Leversha,et al.  Statistical inference (2nd edn), by Paul H. Garthwaite, Ian T. Jolliffe and Byron Jones. Pp.328. £40 (hbk). 2002. ISBN 0 19 857226 3 (Oxford University Press). , 2003, The Mathematical Gazette.

[11]  M. Déqué,et al.  Seasonal predictability of tropical rainfall: probabilistic formulation and validation , 2001 .

[12]  Mojib Latif,et al.  The Max-Planck-Institute global ocean/sea ice model with orthogonal curvilinear coordinates , 2003 .

[13]  Bruce A. Robinson,et al.  Treatment of uncertainty using ensemble methods: Comparison of sequential data assimilation and Bayesian model averaging , 2007 .

[14]  V. Pope,et al.  The impact of new physical parametrizations in the Hadley Centre climate model: HadAM3 , 2000 .

[15]  R. L. Winkler,et al.  Scoring Rules for Continuous Probability Distributions , 1976 .

[16]  E. Grimit,et al.  Initial Results of a Mesoscale Short-Range Ensemble Forecasting System over the Pacific Northwest , 2002 .

[17]  Padraig Cunningham,et al.  Calibrating Probability Density Forecasts with Multi-Objective Search , 2006, ECAI.

[18]  A. H. Murphy,et al.  Diagnostic verification of probability forecasts , 1992 .