Functional principal component analysis of spatially correlated data

This paper focuses on the analysis of spatially correlated functional data. We propose a parametric model for spatial correlation and the between-curve correlation is modeled by correlating functional principal component scores of the functional data. Additionally, in the sparse observation framework, we propose a novel approach of spatial principal analysis by conditional expectation to explicitly estimate spatial correlations and reconstruct individual curves. Assuming spatial stationarity, empirical spatial correlations are calculated as the ratio of eigenvalues of the smoothed covariance surface Cov$$(X_i(s),X_i(t))$$(Xi(s),Xi(t)) and cross-covariance surface Cov$$(X_i(s), X_j(t))$$(Xi(s),Xj(t)) at locations indexed by i and j. Then a anisotropy Matérn spatial correlation model is fitted to empirical correlations. Finally, principal component scores are estimated to reconstruct the sparsely observed curves. This framework can naturally accommodate arbitrary covariance structures, but there is an enormous reduction in computation if one can assume the separability of temporal and spatial components. We demonstrate the consistency of our estimates and propose hypothesis tests to examine the separability as well as the isotropy effect of spatial correlation. Using simulation studies, we show that these methods have some clear advantages over existing methods of curve reconstruction and estimation of model parameters.

[1]  K. Haskard,et al.  An anisotropic Matern spatial covariance model: REML estimation and properties. , 2007 .

[2]  Catherine A. Sugar,et al.  Clustering for Sparsely Sampled Functional Data , 2003 .

[3]  Bernard W. Silverman,et al.  Functional Data Analysis , 1997 .

[4]  John A. D. Aston,et al.  Tests for separability in nonparametric covariance operators of random surfaces , 2015, 1505.02023.

[5]  Jie Peng,et al.  A Geometric Approach to Maximum Likelihood Estimation of the Functional Principal Components From Sparse Longitudinal Data , 2007, 0710.5343.

[6]  Surajit Ray,et al.  Functional factor analysis for periodic remote sensing data , 2012, 1206.6962.

[7]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[8]  Thorsten Gerber,et al.  Handbook Of Mathematical Functions , 2016 .

[9]  H. Müller,et al.  Shrinkage Estimation for Functional Principal Component Scores with Application to the Population Kinetics of Plasma Folate , 2003, Biometrics.

[10]  Peter J. Diggle,et al.  An Introduction to Model-Based Geostatistics , 2003 .

[11]  R. Fletcher,et al.  A New Approach to Variable Metric Algorithms , 1970, Comput. J..

[12]  Robin Thompson,et al.  Estimation of genetic parameters. , 2005 .

[13]  A. Rao,et al.  Estimation of Genetic Parameters: principles , 2003 .

[14]  T. Hsing,et al.  Uniform convergence rates for nonparametric regression and principal component analysis in functional/longitudinal data , 2010, 1211.2137.

[15]  Robert Haining,et al.  Statistics for spatial data: by Noel Cressie, 1991, John Wiley & Sons, New York, 900 p., ISBN 0-471-84336-9, US $89.95 , 1993 .

[16]  B. Silverman,et al.  Estimating the mean and covariance structure nonparametrically when the data are curves , 1991 .

[17]  F. Yao,et al.  Penalized spline models for functional principal component analysis , 2006 .

[18]  D. Goldfarb A family of variable-metric methods derived by variational means , 1970 .

[19]  B. Mallick,et al.  Bayesian Hierarchical Spatially Correlated Functional Data Analysis with Application to Colon Carcinogenesis , 2008, Biometrics.

[20]  Ana-Maria Staicu,et al.  Modeling Functional Data with Spatially Heterogeneous Shape Characteristics , 2012, Biometrics.

[21]  Clifford M. Hurvich,et al.  Smoothing parameter selection in nonparametric regression using an improved Akaike information criterion , 1998 .

[22]  Jie Peng,et al.  Principal components analysis for sparsely observed correlated functional data using a kernel smoothing approach , 2008, 0807.1106.

[23]  Marina Fruehauf,et al.  Nonlinear Programming Analysis And Methods , 2016 .

[24]  Ana-Maria Staicu,et al.  Fast methods for spatially correlated multilevel functional data. , 2010, Biostatistics.

[25]  Jeng-Min Chiou,et al.  Identifying cluster number for subspace projected functional data clustering , 2011, Comput. Stat. Data Anal..

[26]  Arnab Maity,et al.  Reduced Rank Mixed Effects Models for Spatially Correlated Hierarchical Functional Data , 2010, Journal of the American Statistical Association.

[27]  Piotr Kokoszka,et al.  Estimation and testing for spatially indexed curves with application to ionospheric and magnetic field trends , 2012, 1206.6655.

[28]  Sudipto Banerjee,et al.  Coregionalized Single‐ and Multiresolution Spatially Varying Growth Curve Modeling with Application to Weed Growth , 2006, Biometrics.

[29]  C. G. Broyden The Convergence of a Class of Double-rank Minimization Algorithms 1. General Considerations , 1970 .

[30]  H. Müller,et al.  Functional Data Analysis for Sparse Longitudinal Data , 2005 .

[31]  P. Hall,et al.  Properties of principal component methods for functional and longitudinal data analysis , 2006, math/0608022.

[32]  Alessandra Menafoglio,et al.  Kriging for Hilbert-space valued random fields: The operatorial point of view , 2016, J. Multivar. Anal..

[33]  R. Carroll,et al.  Nonparametric estimation of correlation functions in longitudinal and spatial data, with application to colon carcinogenesis experiments , 2007, 0710.3638.

[34]  D. Shanno Conditioning of Quasi-Newton Methods for Function Minimization , 1970 .