Information-statistical approach for temporal-spatial data with application

Abstract A treatment for temporal-spatial data such as atmospheric temperature using an information-statistical approach is proposed. Conditioning on specific spatial nature of the data, the temporal aspect of the data is first modeled parametrically as Gaussian, and Schwarz information criterion is then applied to detect multiple mean change points—thus the Gaussian statistical models—to account for changes of the population mean over time. To examine the spatial characteristics of the data, successive mean change points are qualified by finite categorical values. The distribution of the finite categorical values is then used to estimate a non-parametric probability model through a non-linear SVD-based optimization approach; where the optimization criterion is Shannon expected entropy. This optimal probability model accounts for the spatial characteristics of the data and is then used to derive spatial association patterns subject to chi-square statistic hypothesis test. The proposed approach is applied to examine the weather data set obtained from NOAA. Selected temperature data are studied. These data cover different geographical localities in the United States, with some spanning over 200 years. Preliminary results are reported.

[1]  Thomas C. Peterson,et al.  On the Development and Use of Homogenized Climate Datasets , 1996 .

[2]  R. Fisher 034: The Conditions Under Which x2 Measures the Discrepancy Between Observation and Hypothesis. , 1924 .

[3]  Thomas C. Peterson,et al.  Creation of homogeneous composite climatological reference series , 1994 .

[4]  Detecting changes of mean in multidimensional normal sequences with applications to literature and geology , 1996 .

[5]  S. Woodruff,et al.  Changes in Global Surface Temperature From 1880 to 1977 Derived From Historical Records of Sea Surface Temperature , 1981 .

[6]  A. K. Shahani,et al.  Statistical Quality Control. , 1974 .

[7]  L. A. Goodman The Analysis of Cross-Classified Data: Independence, Quasi-Independence, and Interactions in Contingency Tables with or without Missing Entries , 1968 .

[8]  Carl August Zehnder,et al.  Numerical methods of mathematical optimization , 1971 .

[9]  Arjun K. Gupta,et al.  Testing and Locating Variance Changepoints with Application to Stock Prices , 1997 .

[10]  Narendra Karmarkar,et al.  A new polynomial-time algorithm for linear programming , 1984, Comb..

[11]  K. Borgwardt The Simplex Method: A Probabilistic Analysis , 1986 .

[12]  U. Grenander Elements of Pattern Theory , 1996 .

[13]  Yang Wang,et al.  High-Order Pattern Discovery from Discrete-Valued Data , 1997, IEEE Trans. Knowl. Data Eng..

[14]  T. Barnett Estimating Variability of Surface Air Temperature in the Northern Hemisphere , 1978 .

[15]  R. A. Leibler,et al.  On Information and Sufficiency , 1951 .

[16]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[17]  C. E. SHANNON,et al.  A mathematical theory of communication , 1948, MOCO.