Penalized solutions to functional regression problems

Recent technological advances in continuous biological monitoring and personal exposure assessment have led to the collection of subject-specific functional data. A primary goal in such studies is to assess the relationship between the functional predictors and the functional responses. The historical functional linear model (HFLM) can be used to model such dependencies of the response on the history of the predictor values. An estimation procedure for the regression coefficients that uses a variety of regularization techniques is proposed. An approximation of the regression surface relating the predictor to the outcome by a finite-dimensional basis expansion is used, followed by penalization of the coefficients of the neighboring basis functions by restricting the size of the coefficient differences to be small. Penalties based on the absolute values of the basis function coefficient differences (corresponding to the LASSO) and the squares of these differences (corresponding to the penalized spline methodology) are studied. The fits are compared using an extension of the Akaike Information Criterion that combines the error variance estimate, degrees of freedom of the fit and the norm of the bases function coefficients. The performance of the proposed methods is evaluated via simulations. The LASSO penalty applied to the linearly transformed coefficients yields sparser representations of the estimated regression surface, while the quadratic penalty provides solutions with the smallest L(2)-norm of the basis functions coefficients. Finally, the new estimation procedure is applied to the analysis of the effects of occupational particulate matter (PM) exposure on the heart rate variability (HRV) in a cohort of boilermaker workers. Results suggest that the strongest association between PM exposure and HRV in these workers occurs as a result of point exposures to the increased levels of particulate matter corresponding to smoking breaks.

[1]  J. Ramsay,et al.  The historical functional linear model , 2003 .

[2]  Paul H. C. Eilers,et al.  Flexible smoothing with B-splines and penalties , 1996 .

[3]  R. Tibshirani,et al.  Varying‐Coefficient Models , 1993 .

[4]  A. N. Tikhonov,et al.  Solutions of ill-posed problems , 1977 .

[5]  Jane-Ling Wang,et al.  Functional canonical analysis for square integrable stochastic processes , 2003 .

[6]  Thomas S. Shively,et al.  Variable Selection and Function Estimation in Additive Nonparametric Regression Using a Data-Based Prior , 1999 .

[7]  B. Silverman,et al.  Functional Data Analysis , 1997 .

[8]  A. Cuevas,et al.  Linear functional regression: The case of fixed design and functional response , 2002 .

[9]  David Ruppert,et al.  Variable Selection and Function Estimation in Additive Nonparametric Regression Using a Data-Based Prior: Comment , 1999 .

[10]  D. Braess Finite Elements: Theory, Fast Solvers, and Applications in Solid Mechanics , 1995 .

[11]  M P Wand,et al.  Generalized additive distributed lag models: quantifying mortality displacement. , 2000, Biostatistics.

[12]  Delbert J Eatough,et al.  Ambient particulate air pollution, heart rate variability, and blood markers of inflammation in a panel of elderly subjects. , 2004, Environmental health perspectives.

[13]  R. Tibshirani,et al.  On the “degrees of freedom” of the lasso , 2007, 0712.0881.

[14]  L. R. Scott,et al.  The Mathematical Theory of Finite Element Methods , 1994 .

[15]  Shirley Almon The Distributed Lag Between Capital Appropriations and Expenditures , 1965 .

[16]  Yves Grandvalet Least Absolute Shrinkage is Equivalent to Quadratic Penalization , 1998 .

[17]  James O. Ramsay,et al.  Applied Functional Data Analysis: Methods and Case Studies , 2002 .

[18]  Thomas J. Smith,et al.  Association of Heart Rate Variability With Occupational and Environmental Exposure to Particulate Air Pollution , 2001, Circulation.

[19]  D. Dockery,et al.  An association between air pollution and mortality in six U.S. cities. , 1993, The New England journal of medicine.