On the perception of morphodynamic model skill

Abstract The quality of morphodynamic predictions is generally expressed by an overall grid-point based skill score, which measures the relative accuracy of a morphological prediction over a prediction of zero morphological change, using the mean-squared error (MSE) as the accuracy measure. Through a generic ranking for morphodynamic model predictions, this MSE based skill score (MSESS) aims at making model performance comparable across different prediction situations (geographical locations, forcing conditions, time periods, internal dynamics). The implicit assumptions underlying this approach are that the MSE is an appropriate measure of correspondence for morphological predictions and that the accuracy of the initial bed as the reference correctly reflects the inherent difficulty or ease of prediction situations. This paper presents a thorough analysis of the perception of model skill through the MSE skill score. Using synthetic examples, an example from literature and a long-yearly Delft3D model simulation, we demonstrate that unexpected skill may be reported due to a violation of either of the above assumptions. It is shown that the accuracy of the reference fails to reflect the relative difficulty of prediction situations with a different morphological development prior to the evaluation time (for instance trend, cyclic/seasonal, episodic, speed of the development). We further demonstrate that the MSESS tends to favor model results that underestimate the variance of cumulative bed changes, a feature inherited from the MSE. As a consequence of these limitations, the MSESS may report a relative ranking of predictions not matching the intuitive judgment of experts. Guidelines are suggested for how to adjust calibration and validation procedures to be more in line with a morphologist's expert judgment.

[1]  A. H. Murphy,et al.  Skill Scores and Correlation Coefficients in Model Verification , 1989 .

[2]  A. H. Murphy,et al.  Skill Scores Based on the Mean Square Error and Their Relationships to the Correlation Coefficient , 1988 .

[3]  J. Roelvink,et al.  Process‐based, morphodynamic hindcast of decadal deposition patterns in San Pablo Bay, California, 1856–1887 , 2011 .

[4]  W. Briggs Statistical Methods in the Atmospheric Sciences , 2007 .

[5]  G. Dodet,et al.  Morphological evolution of an ephemeral tidal inlet from opening to closure: The Albufeira inlet, Portugal , 2014 .

[6]  R. Anthes Regional Models of the Atmosphere in Middle Latitudes , 1983 .

[7]  Adrián Pedrozo-Acuña,et al.  On the cross-shore profile change of gravel beaches , 2006 .

[8]  R. McCall,et al.  Modelling gravel barrier profile response to combined waves and tides using XBeach: Laboratory and field results , 2012 .

[9]  Richard Soulsby,et al.  Evaluating the performance of morphological models , 2004 .

[10]  Kenneth J. Westrick,et al.  Does Increasing Horizontal Resolution Produce More Skillful Forecasts , 2002 .

[11]  G. Brier VERIFICATION OF FORECASTS EXPRESSED IN TERMS OF PROBABILITY , 1950 .

[12]  M. van der Wegen,et al.  Reproduction of estuarine bathymetry by means of a process-based model: Western Scheldt case study, the Netherlands , 2012 .

[13]  A. H. Murphy,et al.  General Decompositions of MSE-Based Skill Scores: Measures of Some Basic Aspects of Forecast Quality , 1996 .

[14]  Robert E. Livezey,et al.  Verification of official monthly mean 700-hPa height forecasts: an update , 1995 .

[15]  D. Walstra,et al.  Seasonal-scale nearshore morphological evolution: Field observations and numerical modeling , 2009 .

[16]  G. Stelling,et al.  Development and validation of a three-dimensional morphological model , 2004 .

[17]  C. Willmott Some Comments on the Evaluation of Model Performance , 1982 .

[18]  F. M. Achete Morphodynamics of the Ameland Bornrif: An analogue for the Sand Engine , 2011 .

[19]  A. H. Murphy,et al.  Scoring rules and the evaluation of probabilities , 1996 .

[20]  N. Plant,et al.  Two-dimensional time dependent hurricane overwash and erosion modeling at Santa Rosa Island , 2010 .

[21]  K. Taylor Summarizing multiple aspects of model performance in a single diagram , 2001 .

[22]  SCALE-SELECTIVE VALIDATION OF MORPHODYNAMIC MODELS , 2014 .

[23]  Dirk-Jan R. Walstra,et al.  On bar growth and decay during interannual net offshore migration , 2012 .

[24]  Ian L. Turner,et al.  Forecasting seasonal to multi-year shoreline change , 2010 .

[25]  B. G. Ruessink,et al.  Numerical predictability experiments of cross‐shore sandbar migration , 2008 .

[26]  D. Roelvink,et al.  Modelling storm impacts on beaches, dunes and barrier islands , 2009 .

[27]  A. H. Murphy,et al.  What Is a Good Forecast? An Essay on the Nature of Goodness in Weather Forecasting , 1993 .

[28]  R. L. Winkler Evaluating probabilities: asymmetric scoring rules , 1994 .

[29]  R. Ranasinghe,et al.  A New Alternative to Saving Our Beaches from Sea-Level Rise: The Sand Engine , 2013 .

[30]  A. Lorenc,et al.  The response of numerical weather prediction systems to fgge level iib data. Part II: Forecast verifications and implications for predictability , 2007 .

[31]  A. H. Murphy Climatology, Persistence, and Their Linear Combination as Standards of Reference in Skill Scores , 1992 .

[32]  Herman Gerritsen,et al.  Composite modelling of interactions between beaches and structures , 2011 .

[33]  Steve Elgar,et al.  Observations of sand bar evolution on a natural beach , 1998 .

[34]  N. Ganju,et al.  Discontinuous hindcast simulations of estuarine bathymetric change: A case study from Suisun Bay, California , 2011 .

[35]  Richard Taylor Interpretation of the Correlation Coefficient: A Basic Review , 1990 .

[36]  P. A. Newberger,et al.  Nearshore sandbar migration predicted by an eddy-diffusive boundary layer model , 2004 .

[37]  Jamie MacMahan,et al.  Megacusps on rip channel bathymetry: Observations and modeling , 2011 .

[38]  A. Reniers,et al.  Displacement-based error metrics for morphodynamic models , 2014 .

[39]  Mick van der Wegen,et al.  LONG-TERM PERFORMANCE OF PROCESS-BASED MODELS IN ESTUARIES , 2013 .

[40]  Hoshin Vijai Gupta,et al.  Decomposition of the mean squared error and NSE performance criteria: Implications for improving hydrological modelling , 2009 .

[41]  Shunqi Pan,et al.  The predictability of cross-shore bed evolution of sandy beaches at the time scale of storms and seasons using process-based Profile models , 2003 .

[42]  Tania Ruth Scott,et al.  Data assimilation for a coastal area morphodynamic model: Morecambe Bay , 2007 .

[43]  Glenn W. Brier,et al.  Verification of Weather Forecasts , 1951 .

[44]  T. Koh,et al.  A diagnostic suite to assess NWP performance , 2012 .

[45]  Ad Reniers,et al.  Modeling cross-shore sandbar behavior on the timescale of weeks , 2007 .

[46]  F.A.J. Minneboo Jaarlijkse kustmetingen: Richtlijnen voor de inwinning, bewerking, en opslag van gegevens van jaarlijkse kustmetingen , 1995 .

[47]  A. H. Murphy The Finley Affair: A Signal Event in the History of Forecast Verification , 1996 .