Observation Influence Diagnostic of a Data Assimilation System

The influence matrix is used in ordinary least-squares applications for monitoring statistical multiple-regression analyses. Concepts related to the influence matrix provide diagnostics on the influence of individual data on the analysis, the analysis change that would occur by leaving one observation out, and the effective information content (degrees of freedom for signal) in any sub-set of the analysed data. In this paper, the corresponding concepts are derived in the context of linear statistical data assimilation in Numerical Weather Prediction. An approximate method to compute the diagonal elements of the influence matrix (the self-sensitivities) has been developed for a large-dimension variational data assimilation system (the 4D-Var system of the European Centre for Medium-Range Weather Forecasts). Results show that, in the ECMWF operational system, 18 % of the global influence is due to the assimilated observations, and the complementary 82 % is the influence of the prior (background) information, a short-range forecast containing information from earlier assimilated observations. About 20 % of the observational information is currently provided by surface-based observing systems, and 80 % by satellite systems.A toy-model is developed to illustrate how the observation influence depends on the data assimilation covariance matrices. In particular, the role of high-correlated observation error and high-correlated background error with respect to uncorrelated ones is presented. Low-influence data points usually occur in data-rich areas, while high-influence data points are in data-sparse areas or in dynamically active regions. Background error correlations also play an important role: high correlation diminishes the observation influence and amplifies the importance of the surrounding real and pseudo observations (prior information in observation space). To increase the observation influence in presence of high correlated background error is necessary to introduce the observation error correlation but also observation and background error variances must be of similar size. Incorrect specifications of background and observation error covariance matrices can be identified, interpreted and better understood by the use of influence matrix diagnostics for the variety of observation types and observed variables used in the data assimilation system.

[1]  C. Cardinali Monitoring the observation impact on the short‐range forecast , 2009 .

[2]  Rolf H. Langland,et al.  Estimation of observation impact using the NRL atmospheric variational data assimilation adjoint system , 2004 .

[3]  F. Mosteller,et al.  Understanding robust and exploratory data analysis , 1985 .

[4]  Philippe Courtier,et al.  Dynamical structure functions in a four‐dimensional variational assimilation: A case study , 1996 .

[5]  Jean-Noël Thépaut,et al.  Assimilation experiments with CHAMP GPS radio occultation measurements , 2006 .

[6]  Philippe Courtier,et al.  Interactions of Dynamics and Observations in a Four-Dimensional Variational Assimilation , 1993 .

[7]  Roberto Buizza,et al.  Impact of singular‐vector‐based satellite data thinning on NWP , 2011 .

[8]  J. Tukey Data analysis, computation and mathematics , 1972 .

[9]  Takemasa Miyoshi,et al.  Analysis sensitivity calculation in an ensemble Kalman filter , 2009 .

[10]  Rosemary Munro,et al.  Diagnosis of background errors for radiances and other observable quantities in a variational data assimilation scheme, and the explanation of a case of poor convergence , 2000 .

[11]  Jean-Paul Chilès,et al.  Wiley Series in Probability and Statistics , 2012 .

[12]  Peter Bauer,et al.  Observation errors in all‐sky data assimilation , 2011 .

[13]  Olivier Talagrand,et al.  Assimilation of Observations, an Introduction (gtSpecial IssueltData Assimilation in Meteology and Oceanography: Theory and Practice) , 1997 .

[14]  Carla Cardinali,et al.  Performance measurement with advanced diagnostic tools of all‐sky microwave imager radiances in 4D‐Var , 2011 .

[15]  Roy E. Welsch,et al.  Efficient Computing of Regression Diagnostics , 1981 .

[16]  Xiaotong Shen,et al.  Nonparametric Hypothesis Testing for a Spatial Signal , 2002 .

[17]  Florence Rabier,et al.  Channel selection methods for Infrared Atmospheric Sounding Interferometer radiances , 2002 .

[18]  Philippe Courtier,et al.  Four‐Dimensional Assimilation In the Presence of Baroclinic Instability , 1992 .

[19]  Jean-Noël Thépaut,et al.  The Spatial Structure of Observation Errors in Atmospheric Motion Vectors from Geostationary Satellite Data , 2003 .

[20]  G. Wahba Spline models for observational data , 1990 .

[21]  A. Simmons,et al.  The ECMWF operational implementation of four‐dimensional variational assimilation. I: Experimental results with simplified physics , 2007 .

[22]  Feng Gao,et al.  Adaptive Tuning of Numerical Weather Prediction Models: Randomized GCV in Three- and Four-Dimensional Data Assimilation , 1995 .

[23]  R. Welsch,et al.  The Hat Matrix in Regression and ANOVA , 1978 .

[24]  Peter Craven,et al.  Smoothing noisy data with spline functions , 1978 .

[25]  R. Gelaro,et al.  Observation Sensitivity Calculations Using the Adjoint of the Gridpoint Statistical Interpolation (GSI) Analysis System , 2008 .

[26]  Erik Andersson,et al.  Influence‐matrix diagnostic of a data assimilation system , 2004 .

[27]  Jianming Ye On Measuring and Correcting the Effects of Data Mining and Model Selection , 1998 .

[28]  Andrew C. Lorenc,et al.  Analysis methods for numerical weather prediction , 1986 .