Similarity‐based semilocal estimation of post‐processing models

Weather forecasts are typically given in the form of forecast ensembles obtained from multiple runs of numerical weather prediction models with varying initial conditions and physics parameterizations. Such ensemble predictions tend to be biased and underdispersive and thus require statistical post‐processing. In the ensemble model output statistics approach, a probabilistic forecast is given by a single parametric distribution with parameters depending on the ensemble members. The paper proposes two semilocal methods for estimating the ensemble model output statistics coefficients where the training data for a specific observation station are augmented with corresponding forecast cases from stations with similar characteristics. Similarities between stations are determined by using either distance functions or clustering based on various features of the climatology, forecast errors and locations of the observation stations. In a case‐study on wind speed over Europe with forecasts from the ‘Grand limited area model ensemble prediction system’, the similarity‐based semilocal models proposed show significant improvement in predictive performance compared with standard regional and local estimation methods. They further allow for estimating complex models without numerical stability issues and are computationally more efficient than local parameter estimation.

[1]  J. A. Hartigan,et al.  A k-means clustering algorithm , 1979 .

[2]  Maurice K. Wong,et al.  Algorithm AS136: A k-means clustering algorithm. , 1979 .

[3]  S. Planton,et al.  A Simple Parameterization of Land Surface Processes for Meteorological Models , 1989 .

[4]  J. Kain,et al.  A One-Dimensional Entraining/Detraining Plume Model and Its Application in Convective Parameterization , 1990 .

[5]  Peter J. Rousseeuw,et al.  Finding Groups in Data: An Introduction to Cluster Analysis , 1990 .

[6]  Léon Bottou,et al.  Local Learning Algorithms , 1992, Neural Computation.

[7]  F. Diebold,et al.  Comparing Predictive Accuracy , 1994, Business Cycles.

[8]  Thomas M. Hamill,et al.  Verification of Eta–RSM Short-Range Ensemble Forecasts , 1997 .

[9]  Adrian E. Raftery,et al.  How Many Clusters? Which Clustering Method? Answers Via Model-Based Cluster Analysis , 1998, Comput. J..

[10]  Bent Hansen Sass A research version of the STRACO cloud scheme , 2002 .

[11]  D. Ruppert The Elements of Statistical Learning: Data Mining, Inference, and Prediction , 2004 .

[12]  Ecmwf Newsletter,et al.  EUROPEAN CENTRE FOR MEDIUM-RANGE WEATHER FORECASTS , 2004 .

[13]  A. Raftery,et al.  Using Bayesian Model Averaging to Calibrate Forecast Ensembles , 2005 .

[14]  Anton H. Westveld,et al.  Calibrated Probabilistic Forecasting Using Ensemble Model Output Statistics and Minimum CRPS Estimation , 2005 .

[15]  Adrian E. Raftery,et al.  Weather Forecasting with Ensemble Methods , 2005, Science.

[16]  A. Raftery,et al.  Probabilistic forecasts, calibration and sharpness , 2007 .

[17]  A. Raftery,et al.  Strictly Proper Scoring Rules, Prediction, and Estimation , 2007 .

[18]  P. Pinson,et al.  Trading Wind Generation From Short-Term Probabilistic Forecasts of Wind Power , 2007, IEEE Transactions on Power Systems.

[19]  Renate Hagedorn,et al.  Probabilistic Forecast Calibration Using ECMWF and GFS Ensemble Reforecasts. Part II: Precipitation , 2008 .

[20]  Tim N. Palmer,et al.  Ensemble forecasting , 2008, J. Comput. Phys..

[21]  T. Gneiting Making and Evaluating Point Forecasts , 2009, 0912.0902.

[22]  Richard Swinbank,et al.  Medium‐range multimodel ensemble combination and calibration , 2009 .

[23]  Robert Tibshirani,et al.  The Elements of Statistical Learning: Data Mining, Inference, and Prediction, 2nd Edition , 2001, Springer Series in Statistics.

[24]  Tilmann Gneiting,et al.  Probabilistic forecasts of wind speed: ensemble model output statistics by using heteroscedastic censored regression , 2010 .

[25]  M. Schmeits,et al.  A Comparison between Raw Ensemble Output, (Modified) Bayesian Model Averaging, and Extended Logistic Regression Using ECMWF Ensemble Precipitation Reforecasts , 2010 .

[26]  A. Raftery,et al.  Calibrating Multimodel Forecast Ensembles with Exchangeable and Missing Members Using Bayesian Model Averaging , 2010 .

[27]  Adrian E. Raftery,et al.  Locally Calibrated Probabilistic Temperature Forecasting Using Geostatistical Model Averaging and Local Bayesian Model Averaging , 2011 .

[28]  T. Iversen,et al.  Evaluation of ‘GLAMEPS’—a proposed multimodel EPS for short range forecasting , 2011 .

[29]  Adrian E. Raftery,et al.  Probabilistic Weather Forecasting in R , 2011 .

[30]  Renate Hagedorn,et al.  Comparing TIGGE multimodel forecasts with reforecast‐calibrated ECMWF ensemble forecasts , 2012 .

[31]  T. Gneiting,et al.  Ensemble Model Output Statistics for Wind Vectors , 2012, 1201.2612.

[32]  J. Ruiz,et al.  How sensitive are probabilistic precipitation forecasts to the choice of calibration algorithms and the ensemble generation method? Part I: sensitivity to calibration methods , 2012 .

[33]  Pierre Pinson,et al.  Verification of the ECMWF ensemble forecasts of wind speed against analyses and observations , 2012 .

[34]  J. M. Sloughter,et al.  Probabilistic Wind Speed Forecasting Using Ensembles and Bayesian Model Averaging , 2010 .

[35]  Piet Termonia,et al.  Evaluating the performance of SURFEXv5 as a new land surface scheme for the ALADINcy36 and ALARO-0 models , 2013 .

[36]  T. Thorarinsdottir,et al.  Comparison of non-homogeneous regression models for probabilistic wind speed forecasting , 2013, 1305.2026.

[37]  Hadley Wickham,et al.  ggmap: Spatial Visualization with ggplot2 , 2013, R J..

[38]  M. Scheuerer Probabilistic quantitative precipitation forecasting using Ensemble Model Output Statistics , 2013, 1302.0893.

[39]  Pierre Pinson,et al.  Wind Energy: Forecasting Challenges for Its Operational Management , 2013, 1312.6471.

[40]  Z. B. Bouallègue,et al.  Enhancing COSMO-DE ensemble forecasts by inexpensive techniques , 2013 .

[41]  Sándor Baran,et al.  Probabilistic wind speed forecasting using Bayesian model averaging with truncated normal components , 2013, Comput. Stat. Data Anal..

[42]  S. Baran,et al.  Log‐normal distribution based Ensemble Model Output Statistics models for probabilistic wind‐speed forecasting , 2014, 1407.3252.

[43]  M. Scheuerer,et al.  Spatially adaptive post‐processing of ensemble forecasts for temperature , 2013, 1302.0883.

[44]  T. Gneiting 719 Calibration of Medium-Range Weather Forecasts , 2014 .

[45]  Christopher A. T. Ferro,et al.  A comparison of ensemble post‐processing methods for extreme events , 2014 .

[46]  F. Pappenberger,et al.  Trends in the predictive performance of raw ensemble weather forecasts , 2014 .

[47]  Trevor Hastie,et al.  CUSTOMIZED TRAINING WITH AN APPLICATION TO MASS SPECTROMETRIC IMAGING OF CANCER TISSUE. , 2015, The annals of applied statistics.

[48]  Constantin Junk,et al.  Analog-Based Ensemble Model Output Statistics , 2015 .

[49]  Alain Joly,et al.  PEARP, the Météo‐France short‐range ensemble prediction system , 2015 .

[50]  Michael Scheuerer,et al.  Probabilistic wind speed forecasting on a grid based on ensemble model output statistics , 2015, 1511.02001.

[51]  T. Thorarinsdottir,et al.  Spatial Postprocessing of Ensemble Forecasts for Temperature Using Nonhomogeneous Gaussian Regression , 2014, 1407.0058.

[52]  A. Lenkoski,et al.  Spatially adaptive, Bayesian estimation for probabilistic temperature forecasts , 2015, 1507.05066.

[53]  Roman Schefzik,et al.  A Similarity-Based Implementation of the Schaake Shuffle , 2015, 1507.02079.

[54]  S. Baran,et al.  Mixture EMOS model for calibrating ensemble forecasts of wind speed , 2015, Environmetrics.

[55]  Florian Pappenberger,et al.  The TIGGE Project and Its Achievements , 2016 .