Influence‐matrix diagnostic of a data assimilation system

The influence matrix is used in ordinary least-squares applications for monitoring statistical multiple-regression analyses. Concepts related to the influence matrix provide diagnostics on the influence of individual data on the analysis—the analysis change that would occur by leaving one observation out, and the effective information content (degrees of freedom for signal) in any sub-set of the analysed data. In this paper, the corresponding concepts have been derived in the context of linear statistical data assimilation in numerical weather prediction. An approximate method to compute the diagonal elements of the influence matrix (the self-sensitivities) has been developed for a large-dimension variational data assimilation system (the four-dimensional variational system of the European Centre for Medium-Range Weather Forecasts). Results show that, in the boreal spring 2003 operational system, 15% of the global influence is due to the assimilated observations in any one analysis, and the complementary 85% is the influence of the prior (background) information, a short-range forecast containing information from earlier assimilated observations. About 25% of the observational information is currently provided by surface-based observing systems, and 75% by satellite systems. Low-influence data points usually occur in data-rich areas, while high-influence data points are in data-sparse areas or in dynamically active regions. Background-error correlations also play an important role: high correlation diminishes the observation influence and amplifies the importance of the surrounding real and pseudo observations (prior information in observation space). Incorrect specifications of background and observation-error covariance matrices can be identified, interpreted and better understood by the use of influence-matrix diagnostics for the variety of observation types and observed variables used in the data assimilation system. Copyright © 2004 Royal Meteorological Society

[1]  Rosemary Munro,et al.  Diagnosis of background errors for radiances and other observable quantities in a variational data assimilation scheme, and the explanation of a case of poor convergence , 2000 .

[2]  Olivier Talagrand,et al.  Assimilation of Observations, an Introduction (gtSpecial IssueltData Assimilation in Meteology and Oceanography: Theory and Practice) , 1997 .

[3]  Jean-Noël Thépaut,et al.  390 Validation of the NESDIS Near Real Time AIRS channel selection , 2002 .

[4]  Peter Craven,et al.  Smoothing noisy data with spline functions , 1978 .

[5]  H.-L. Huang,et al.  Estimating effective data density in a satellite retrieval or an objective analysis , 1993 .

[6]  Philippe Courtier,et al.  Four‐Dimensional Assimilation In the Presence of Baroclinic Instability , 1992 .

[7]  Noel A Cressie,et al.  Nonparametric hypothesis testing for a spatial signal , 2002, IEEE Workshop on Statistical Signal Processing, 2003.

[8]  Florence Rabier,et al.  Channel selection methods for Infrared Atmospheric Sounding Interferometer radiances , 2002 .

[9]  Jean-Noël Thépaut,et al.  Evaluation of the AIRS near‐real‐time channel selection for application to numerical weather prediction , 2003 .

[10]  Roy E. Welsch,et al.  Efficient Computing of Regression Diagnostics , 1981 .

[11]  Zhong-Zhi Bai,et al.  Parallel nonlinear AOR method and its convergence , 1996 .

[12]  Philippe Courtier,et al.  Interactions of Dynamics and Observations in a Four-Dimensional Variational Assimilation , 1993 .

[13]  Philippe Courtier,et al.  Dynamical structure functions in a four‐dimensional variational assimilation: A case study , 1996 .

[14]  M. Fisher Estimation of Entropy Reduction and Degrees of Freedom for Signal for Large Variational Analysis Systems , 2003 .

[15]  Grace Wahba,et al.  Spline Models for Observational Data , 1990 .

[16]  Jianming Ye On Measuring and Correcting the Effects of Data Mining and Model Selection , 1998 .

[17]  P. Courtier,et al.  The ECMWF implementation of three‐dimensional variational assimilation (3D‐Var). I: Formulation , 1998 .

[18]  Feng Gao,et al.  Adaptive Tuning of Numerical Weather Prediction Models: Randomized GCV in Three- and Four-Dimensional Data Assimilation , 1995 .

[19]  Jean-Noël Thépaut,et al.  The Spatial Structure of Observation Errors in Atmospheric Motion Vectors from Geostationary Satellite Data , 2003 .

[20]  Andrew C. Lorenc,et al.  Analysis methods for numerical weather prediction , 1986 .

[21]  M. Fisher,et al.  347 Developments in 4 D-Var and Kalman Filtering , 1994 .

[22]  G. Golub,et al.  Some large-scale matrix computation problems , 1996 .

[23]  F. Bouttier,et al.  Observing‐system experiments in the ECMWF 4D‐Var data assimilation system , 2001 .

[24]  P. Courtier,et al.  The ECMWF implementation of three‐dimensional variational assimilation (3D‐Var). II: Structure functions , 1998 .

[25]  R. Welsch,et al.  The Hat Matrix in Regression and ANOVA , 1978 .

[26]  J. Tukey Data analysis, computation and mathematics , 1972 .