Robust functional principal component analysis via a functional pairwise spatial sign operator

Functional principal component analysis (FPCA) has been widely used to capture major modes of variation and reduce dimensions in functional data analysis. However, standard FPCA based on the sample covariance estimator does not work well if the data exhibits heavy‐tailedness or outliers. To address this challenge, a new robust FPCA approach based on a functional pairwise spatial sign (PASS) operator, termed PASS FPCA, is introduced. We propose robust estimation procedures for eigenfunctions and eigenvalues. Theoretical properties of the PASS operator are established, showing that it adopts the same eigenfunctions as the standard covariance operator and also allows recovering ratios between eigenvalues. We also extend the proposed procedure to handle functional data measured with noise. Compared to existing robust FPCA approaches, the proposed PASS FPCA requires weaker distributional assumptions to conserve the eigenspace of the covariance function. Specifically, existing work are often built upon a class of functional elliptical distributions, which requires inherently symmetry. In contrast, we introduce a class of distributions called the weakly functional coordinate symmetry (weakly FCS), which allows for severe asymmetry and is much more flexible than the functional elliptical distribution family. The robustness of the PASS FPCA is demonstrated via extensive simulation studies, especially its advantages in scenarios with nonelliptical distributions. The proposed method was motivated by and applied to analysis of accelerometry data from the Objective Physical Activity and Cardiovascular Health Study, a large‐scale epidemiological study to investigate the relationship between objectively measured physical activity and cardiovascular health among older women.

[1]  Guangxing Wang,et al.  Robust Functional Principal Component Analysis via Functional Pairwise Spatial Signs , 2021, 2101.06415.

[2]  C. Jentsch,et al.  Asymptotic theory , 2018, Statistics.

[3]  Jacob Bien,et al.  Valid Inference Corrected for Outlier Removal , 2017, Journal of Computational and Graphical Statistics.

[4]  A. LaCroix,et al.  The Objective Physical Activity and Cardiovascular Disease Health in Older Women (OPACH) Study , 2017, BMC Public Health.

[5]  D. Vogel,et al.  The spatial sign covariance matrix and its application for robust correlation estimation , 2016, 1606.02274.

[6]  Hans-Georg Müller,et al.  Functional Data Analysis , 2016 .

[7]  David E. Tyler,et al.  On the eigenvalues of the spatial sign covariance matrix in more than two dimensions , 2015, 1512.02863.

[8]  Jane-Ling Wang,et al.  Review of Functional Data Analysis , 2015, 1507.05135.

[9]  G. Boente,et al.  S-Estimators for Functional Principal Component Analysis , 2015 .

[10]  D. Vogel,et al.  Robust Correlation Estimation and Testing Based on Spatial Signs , 2015 .

[11]  Han Liu,et al.  ECA: High-Dimensional Elliptical Component Analysis in Non-Gaussian Distributions , 2013, 1310.3561.

[12]  Dietrich Stoyan,et al.  Deviation test construction and power comparison for marked spatial point patterns , 2013, 1306.1028.

[13]  Victor M. Panaretos,et al.  Dispersion operators and resistant second-order functional data analysis , 2012 .

[14]  Runze Li,et al.  MULTIVARIATE VARYING COEFFICIENT MODEL FOR FUNCTIONAL RESPONSES. , 2012, Annals of statistics.

[15]  I. Koch,et al.  Robustifying principal component analysis with spatial sign vectors , 2012 .

[16]  David E. Tyler,et al.  Robust functional principal components: A projection-pursuit approach , 2011, 1203.2027.

[17]  G. Boente,et al.  Principal points and elliptical distributions from the multivariate setting to the functional case , 2009, 2006.04188.

[18]  D. Gervini Robust functional estimation using the median and spherical principal components , 2008 .

[19]  P. Hall,et al.  On properties of functional principal components analysis , 2006 .

[20]  Jane-ling Wang Nonparametric Regression Analysis of Longitudinal Data , 2005 .

[21]  H. Müller,et al.  Functional Data Analysis for Sparse Longitudinal Data , 2005 .

[22]  H. Cardot Nonparametric estimation of smoothed principal components analysis of sampled noisy functions , 2000 .

[23]  J. Marden Some robust estimates of principal components , 1999 .

[24]  A. M. Aguilera,et al.  Robust principal component analysis for functional data , 1999 .

[25]  J. O. Ramsay,et al.  Functional Data Analysis (Springer Series in Statistics) , 1997 .

[26]  B. Silverman,et al.  Smoothed functional principal components analysis by choice of norm , 1996 .

[27]  B. Silverman,et al.  Estimating the mean and covariance structure nonparametrically when the data are curves , 1991 .

[28]  J. Ramsay,et al.  Some Tools for Functional Data Analysis , 1991 .

[29]  E. A. Sylvestre,et al.  Principal modes of variation for processes with continuous sample curves , 1986 .

[30]  J. Dauxois,et al.  Asymptotic theory for the principal component analysis of a vector random function: Some applications to statistical inference , 1982 .

[31]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .