Principal component analysis for functional data on Riemannian manifolds and spheres

Functional data analysis on nonlinear manifolds has drawn recent interest. Sphere-valued functional data, which are encountered for example as movement trajectories on the surface of the earth, are an important special case. We consider an intrinsic principal component analysis for smooth Riemannian manifold-valued functional data and study its asymptotic properties. Riemannian functional principal component analysis (RFPCA) is carried out by first mapping the manifold-valued data through Riemannian logarithm maps to tangent spaces around the time-varying Frechet mean function, and then performing a classical multivariate functional principal component analysis on the linear tangent spaces. Representations of the Riemannian manifold-valued functions and the eigenfunctions on the original manifold are then obtained with exponential maps. The tangent-space approximation through functional principal component analysis is shown to be well-behaved in terms of controlling the residual variation if the Riemannian manifold has nonnegative curvature. Specifically, we derive a central limit theorem for the mean function, as well as root-$n$ uniform convergence rates for other model components, including the covariance function, eigenfunctions, and functional principal component scores. Our applications include a novel framework for the analysis of longitudinal compositional data, achieved by mapping longitudinal compositional data to trajectories on the sphere, illustrated with longitudinal fruit fly behavior patterns. RFPCA is shown to be superior in terms of trajectory recovery in comparison to an unrestricted functional principal component analysis in applications and simulations and is also found to produce principal component scores that are better predictors for classification compared to traditional functional functional principal component scores.

[1]  J. Marron,et al.  Analysis of principal nested spheres. , 2012, Biometrika.

[2]  Nicholas I. Fisher,et al.  Statistical Analysis of Spherical Data. , 1987 .

[3]  Yu Zheng,et al.  Trajectory Data Mining , 2015, ACM Trans. Intell. Syst. Technol..

[4]  I. Chavel Riemannian Geometry: Subject Index , 2006 .

[5]  G. S. Watson,et al.  Smooth regression analysis , 1964 .

[6]  E. Nadaraya On Estimating Regression , 1964 .

[7]  Dong Chen,et al.  Nonlinear manifold representations for functional data , 2012, 1205.6040.

[8]  Stephan F. Huckemann,et al.  Backward nested descriptors asymptotics with inference on stem cell differentiation , 2016, The Annals of Statistics.

[9]  Michael B. Marcus,et al.  Central limit theorems for C(S)-valued random variables , 1975 .

[10]  H. Muller,et al.  Functional data analysis for density functions by transformation to a Hilbert space , 2016, 1601.02869.

[11]  F. Yao,et al.  Functional regression on the manifold with contamination , 2017 .

[12]  P. Thomas Fletcher,et al.  Principal geodesic analysis for the study of nonlinear statistics of shape , 2004, IEEE Transactions on Medical Imaging.

[13]  E. A. Sylvestre,et al.  Principal modes of variation for processes with continuous sample curves , 1986 .

[14]  T. K. Carne,et al.  Shape and Shape Theory , 1999 .

[15]  T. Hsing,et al.  Theoretical foundations of functional data analysis, with an introduction to linear operators , 2015 .

[16]  B. Afsari Riemannian Lp center of mass: existence, uniqueness, and convexity , 2011 .

[17]  Geert Molenberghs,et al.  European Surveillance of Antimicrobial Consumption (ESAC): outpatient antibiotic use in Europe (1997-2009). , 2011, The Journal of antimicrobial chemotherapy.

[18]  K. J. Utikal,et al.  Inference for Density Families Using Functional Principal Component Analysis , 2001 .

[19]  P. Jupp,et al.  Fitting Smooth Paths to Spherical Data , 1987 .

[20]  Leif Ellingson,et al.  Nonparametric Statistics on Manifolds and Their Applications to Object Data Analysis , 2015 .

[21]  S. R. Jammalamadaka,et al.  Directional Statistics, I , 2011 .

[22]  David B. Dunson,et al.  Extrinsic Local Regression on Manifold-Valued Data , 2015, Journal of the American Statistical Association.

[23]  Jeng-Min Chiou,et al.  Multivariate functional principal component analysis: A normalization approach , 2014 .

[24]  Jane-Ling Wang,et al.  Review of Functional Data Analysis , 2015, 1507.05135.

[25]  Rabi Bhattacharya,et al.  Omnibus CLTs for Fr\'echet means and nonparametric inference on non-Euclidean spaces , 2013, 1306.5806.

[26]  F. Yao,et al.  Functional Regression with Unknown Manifold Structures , 2017, 1704.03005.

[27]  Peter Schröder,et al.  Multiscale Representations for Manifold-Valued Data , 2005, Multiscale Model. Simul..

[28]  Herman Goossens,et al.  European Surveillance of Antimicrobial Consumption (ESAC): outpatient antibiotic use in Europe. , 2011, The Journal of antimicrobial chemotherapy.

[29]  H. Muller,et al.  Fréchet regression for random objects with Euclidean predictors , 2016, The Annals of Statistics.

[30]  Nicolas Courty,et al.  Motion Compression using Principal Geodesics Analysis , 2009, Comput. Graph. Forum.

[31]  D. Bosq Linear Processes in Function Spaces: Theory And Applications , 2000 .

[32]  Rushil Anirudh,et al.  Elastic Functional Coding of Riemannian Trajectories , 2016, IEEE Transactions on Pattern Analysis and Machine Intelligence.

[33]  Hongtu Zhu,et al.  Regression models on Riemannian symmetric spaces , 2017, Journal of the Royal Statistical Society. Series B, Statistical methodology.

[34]  Nikos Papadopoulos,et al.  Age-specific and lifetime behavior patterns in Drosophila melanogaster and the Mediterranean fruit fly, Ceratitis capitata , 2006, Experimental Gerontology.

[35]  Jon A. Wellner,et al.  Weak Convergence and Empirical Processes: With Applications to Statistics , 1996 .

[36]  K. Mardia,et al.  Functional models of growth for landmark data , 2010 .

[37]  Rushil Anirudh,et al.  Elastic functional coding of human actions: From vector-fields to latent variables , 2015, 2015 IEEE Conference on Computer Vision and Pattern Recognition (CVPR).

[38]  A. Munk,et al.  INTRINSIC SHAPE ANALYSIS: GEODESIC PCA FOR RIEMANNIAN MANIFOLDS MODULO ISOMETRIC LIE GROUP ACTIONS , 2007 .

[39]  M. Pierrynowski,et al.  Functional Inference on Rotational Curves and Identification of Human Gait at the Knee Joint , 2016, 1611.03665.

[40]  John Aitchison,et al.  The Statistical Analysis of Compositional Data , 1986 .

[41]  Hans-Georg Müller,et al.  Functional Data Analysis , 2016 .

[42]  John A. D. Aston,et al.  Smooth Principal Component Analysis over two-dimensional manifolds with an application to Neuroimaging , 2016, 1601.03670.

[43]  Peter X.-K. Song,et al.  Simplex Mixed‐Effects Models for Longitudinal Proportional Data , 2008 .

[44]  R. Bhattacharya,et al.  Large sample theory of intrinsic and extrinsic sample means on manifolds--II , 2005, math/0507423.