Preprocessing noisy functional data using factor models

We consider functional data which are measured on a discrete set of observation points. Often such data are measured with noise, and then the target is to recover the underlying signal. Most commonly, practitioners use some smoothing approach, e.g.,\ kernel smoothing or spline fitting towards this goal. The drawback of such curve fitting techniques is that they act function by function, and don't take into account information from the entire sample. In this paper we argue that signal and noise can be naturally represented as the common and idiosyncratic component, respectively, of a factor model. Accordingly, we propose to an estimation scheme which is based on factor models. The purpose of this paper is to explain the reasoning behind our approach and to compare its performance on simulated and on real data to competing methods.

[1]  D. Bosq Linear Processes in Function Spaces: Theory And Applications , 2000 .

[2]  Piotr Kokoszka,et al.  Inference for Functional Data with Applications , 2012 .

[3]  H. Müller,et al.  Functional Data Analysis for Sparse Longitudinal Data , 2005 .

[4]  J. Bai,et al.  Determining the Number of Factors in Approximate Factor Models , 2000 .

[5]  In Choi,et al.  EFFICIENT ESTIMATION OF FACTOR MODELS , 2011, Econometric Theory.

[6]  G. S. Watson,et al.  Smooth regression analysis , 1964 .

[7]  Marco Lippi,et al.  The Generalized Dynamic Factor Model , 2002 .

[8]  Anja Vogler,et al.  An Introduction to Multivariate Statistical Analysis , 2004 .

[9]  S. Wood Generalized Additive Models: An Introduction with R , 2006 .

[10]  M. Hallin,et al.  The Generalized Dynamic-Factor Model: Identification and Estimation , 2000, Review of Economics and Statistics.

[11]  Marco Lippi,et al.  THE GENERALIZED DYNAMIC FACTOR MODEL: REPRESENTATION THEORY , 2001, Econometric Theory.

[12]  Marie Frei,et al.  Functional Data Analysis With R And Matlab , 2016 .

[13]  Jane-ling Wang Nonparametric Regression Analysis of Longitudinal Data , 2005 .

[14]  A. Onatski Determining the Number of Factors from Empirical Distribution of Eigenvalues , 2010, The Review of Economics and Statistics.

[15]  J. Bai,et al.  Inferential Theory for Factor Models of Large Dimensions , 2003 .

[16]  Matthew P. Wand,et al.  Kernel Smoothing , 1995 .

[17]  Marc Hallin,et al.  The Generalized Dynamic Factor Model. One-Sided Estimation and Forecasting , 2003 .

[18]  Jianqing Fan,et al.  Large covariance estimation by thresholding principal orthogonal complements , 2011, Journal of the Royal Statistical Society. Series B, Statistical methodology.

[19]  Kunpeng Li,et al.  STATISTICAL ANALYSIS OF FACTOR MODELS OF HIGH DIMENSION , 2012, 1205.6617.

[20]  Kunpeng Li,et al.  Maximum Likelihood Estimation and Inference for Approximate Factor Models of High Dimension , 2016, Review of Economics and Statistics.

[21]  J. Stock,et al.  Forecasting Using Principal Components From a Large Number of Predictors , 2002 .

[22]  A. B. Owen,et al.  Bi-cross-validation for factor analysis , 2015, 1503.03515.

[23]  Yuan Liao,et al.  Efficient Estimation of Approximate Factor Models via Regularized Maximum Likelihood , 2012, 1209.5911.

[24]  P. Kokoszka,et al.  Weakly dependent functional data , 2010, 1010.0792.

[25]  Fang Yao,et al.  Functional Variance Processes , 2006 .

[26]  Volker S. Schmid,et al.  Stochastic geometry, spatial statistics and random fields : models and algorithms , 2014 .

[27]  Hans-Georg Müller,et al.  Functional Data Analysis , 2016 .

[28]  J. Stock,et al.  Macroeconomic Forecasting Using Diffusion Indexes , 2002 .

[29]  E. Nadaraya On Estimating Regression , 1964 .

[30]  M. Hallin,et al.  Determining the Number of Factors in the General Dynamic Factor Model , 2007 .

[31]  M. Rothschild,et al.  Arbitrage, Factor Structure, and Mean-Variance Analysis on Large Asset Markets , 1982 .