Evaluation of methods for the combination of phenological time series and outlier detection.

There are several applications of combined phenological time series; e.g., trend analysis with long continuous time series, obtaining a compound and representative time series around weather stations for model fitting, data gap filling and outlier detection. Various methods to combine phenological time series have been proposed. We show that all of these methods can be analyzed within the theory of linear models. This has the advantage that the underlying assumptions for each model become transparent providing a theoretical basis for selecting a model for a particular situation. Moreover, the common theoretical background provides a means of comparing methods by Monte-Carlo simulation and with real data. Additionally, we explored the influences of two outlier detection methods. We show that the error called the month-mistake, whose origin is known and which is one of the few mistakes that can be detected in phenological data because of its large deviation, is best detected by the distribution-free 30-day residual rule in combination with a robust estimation procedure based on the minimization of the sum of absolute residuals (L1-norm).

[1]  Joseph W. McKean,et al.  Coefficients of determination for least absolute deviation analysis , 1987 .

[2]  P. Hari,et al.  Predicting spring phenology and frost damage risk of Betula spp. under climatic warming: a comparison of two models. , 2000, Tree physiology.

[3]  D. Sengupta Linear models , 2003 .

[4]  P. Rousseeuw Least Median of Squares Regression , 1984 .

[5]  P. Hari,et al.  Methods for combining phenological time series: application to bud burst in birch (Betula pendula) in Central Finland for the period 1896-1955. , 1995, Tree physiology.

[6]  H. Hartley,et al.  Computing Maximum Likelihood Estimates for the Mixed A.O.V. Model Using the W Transformation , 1973 .

[7]  T. Linkosalo Analyses of the spring phenology of boreal trees and its response to climate change , 2000 .

[8]  I. Barrodale,et al.  An Improved Algorithm for Discrete $l_1 $ Linear Approximation , 1973 .

[9]  李幼升,et al.  Ph , 1989 .

[10]  H. D. Patterson,et al.  Recovery of inter-block information when block sizes are unequal , 1971 .

[11]  S. R. Searle,et al.  Restricted Maximum Likelihood (REML) Estimation of Variance Components in the Mixed Model , 1976 .

[12]  Richard K. Burdick Linear Models in Statistics , 2001, Technometrics.

[13]  T. A. Black,et al.  Effects of climatic variability on the annual carbon sequestration by a boreal aspen forest , 1999 .

[14]  Dallas E. Johnson,et al.  Analysis of messy data , 1992 .

[15]  M. Hubert The breakdown value of the L1 estimator in contingency tables , 1997 .

[16]  Peter J. Rousseeuw,et al.  Robust regression and outlier detection , 1987 .

[17]  W. R. Buckland,et al.  Outliers in Statistical Data , 1979 .

[18]  W. Steiger,et al.  Least Absolute Deviations: Theory, Applications and Algorithms , 1984 .

[19]  S. Running,et al.  The impact of growing-season length variability on carbon assimilation and evapotranspiration over 88 years in the eastern US deciduous forest , 1999, International journal of biometeorology.

[20]  Y. Dodge on Statistical data analysis based on the L1-norm and related methods , 1987 .

[21]  C. D. Keeling,et al.  Increased activity of northern vegetation inferred from atmospheric CO2 measurements , 1996, Nature.

[22]  C. Tucker,et al.  Increased plant growth in the northern high latitudes from 1981 to 1991 , 1997, Nature.

[23]  H. Hartley,et al.  Maximum-likelihood estimation for the mixed analysis of variance model. , 1967, Biometrika.

[24]  Josef Schmee,et al.  Analysis of Messy Data, Volume I: Designed Experiments , 1985 .

[25]  Ian Barrodale,et al.  Algorithm 478: Solution of an Overdetermined System of Equations in the l1 Norm [F4] , 1974, Commun. ACM.

[26]  K. Kramer,et al.  Modelling comparison to evaluate the importance of phenology and spring frost damage for the effects of climate change on growth of mixed temperate-zone deciduous forests , 1996 .

[27]  Robin Thompson Iterative Estimation of Variance Components for Non-Orthogonal Data , 1969 .

[28]  I. Leinonen,et al.  The importance of phenology for the evaluation of impact of climate change on growth of boreal, temperate and Mediterranean forests ecosystems: an overview , 2000, International journal of biometeorology.

[29]  D. E. Johnson,et al.  Analysis of Messy Data Volume I: Designed Experiments , 1985 .

[30]  Mia Hubert,et al.  Robust regression with both continuous and binary regressors , 1997 .

[31]  E. King On Some Procedures for the Rejection of Suspected Data , 1953 .

[32]  H. O. Hartley,et al.  A Simple 'Synthesis'-Based Method of Variance Component Estimation , 1978 .

[33]  J. William Munger,et al.  Exchange of Carbon Dioxide by a Deciduous Forest: Response to Interannual Climate Variability , 1996, Science.

[34]  P. Hari,et al.  Improving the reliability of a combined phenological time series by analyzing observation quality. , 1996, Tree physiology.

[35]  W. J. Dixon,et al.  Analysis of Extreme Values , 1950 .