Functional mixture regression.

In functional linear models (FLMs), the relationship between the scalar response and the functional predictor process is often assumed to be identical for all subjects. Motivated by both practical and methodological considerations, we relax this assumption and propose a new class of functional regression models that allow the regression structure to vary for different groups of subjects. By projecting the predictor process onto its eigenspace, the new functional regression model is simplified to a framework that is similar to classical mixture regression models. This leads to the proposed approach named as functional mixture regression (FMR). The estimation of FMR can be readily carried out using existing software implemented for functional principal component analysis and mixture regression. The practical necessity and performance of FMR are illustrated through applications to a longevity analysis of female medflies and a human growth study. Theoretical investigations concerning the consistent estimation and prediction properties of FMR along with simulation experiments illustrating its empirical properties are presented in the supplementary material available at Biostatistics online. Corresponding results demonstrate that the proposed approach could potentially achieve substantial gains over traditional FLMs.

[1]  T. Tony Cai,et al.  Prediction in functional linear regression , 2006 .

[2]  Catherine A. Sugar,et al.  Clustering for Sparsely Sampled Functional Data , 2003 .

[3]  Wenxin Jiang,et al.  Hierarchical Mixtures-of-Experts for Exponential Family Regression Models with Generalized Linear Mean Functions: A Survey of Approximation and Consistency Results , 1998, UAI.

[4]  B. Silverman,et al.  Functional Data Analysis , 1997 .

[5]  W. DeSarbo,et al.  A mixture likelihood approach for generalized linear models , 1995 .

[6]  Fang Yao,et al.  Functional Additive Models , 2008 .

[7]  H. Muller,et al.  Generalized functional linear models , 2005, math/0505638.

[8]  Padhraic Smyth,et al.  Curve Clustering with Random Effects Regression Mixtures , 2003, AISTATS.

[9]  F. Yao,et al.  Penalized spline models for functional principal component analysis , 2006 .

[10]  Zongwu Cai,et al.  Adaptive varying‐coefficient linear models , 2000 .

[11]  B. Silverman,et al.  Functional Data Analysis , 1997 .

[12]  P. Sarda,et al.  Functional linear model , 1999 .

[13]  L Molinari,et al.  Velocity and acceleration of height growth using kernel estimation. , 1984, Annals of human biology.

[14]  Ying Zhang,et al.  Time‐Varying Functional Regression for Predicting Remaining Lifetime Distributions from Longitudinal Trajectories , 2005, Biometrics.

[15]  H. Müller,et al.  Functional Data Analysis for Sparse Longitudinal Data , 2005 .

[16]  M. Tanner,et al.  Hierarchical mixtures-of-experts for exponential family regression models: approximation and maximum , 1999 .

[17]  Prasad A. Naik,et al.  Extending the Akaike Information Criterion to Mixture Regression Models , 2007 .

[18]  R. Westendorp,et al.  Human longevity at the cost of reproductive success , 1998, Nature.

[19]  J L Wang,et al.  Relationship of age patterns of fecundity to mortality, longevity, and lifetime reproduction in a large cohort of Mediterranean fruit fly females. , 1998, The journals of gerontology. Series A, Biological sciences and medical sciences.

[20]  Lancelot F. James,et al.  Consistent estimation of mixture complexity , 2001 .

[21]  Joel L. Horowitz,et al.  Methodology and convergence rates for functional linear regression , 2007, 0708.0466.

[22]  W. DeSarbo,et al.  A maximum likelihood methodology for clusterwise linear regression , 1988 .

[23]  R. D. Tuddenham,et al.  Physical growth of California boys and girls from birth to eighteen years. , 1954, Publications in child development. University of California, Berkeley.

[24]  P. Sarda,et al.  SPLINE ESTIMATORS FOR THE FUNCTIONAL LINEAR MODEL , 2003 .

[25]  Christian Hennig,et al.  Identifiablity of Models for Clusterwise Linear Regression , 2000, J. Classif..

[26]  Marina Vannucci,et al.  Wavelet-Based Nonparametric Modeling of Hierarchical Functions in Colon Carcinogenesis , 2003 .

[27]  Jianqing Fan,et al.  Two‐step estimation of functional linear models with applications to longitudinal data , 1999 .

[28]  E. A. Sylvestre,et al.  Principal modes of variation for processes with continuous sample curves , 1986 .

[29]  Hongzhe Li,et al.  Clustering of time-course gene expression data using a mixed-effects model with B-splines , 2003, Bioinform..

[30]  H. Müller Functional Modelling and Classification of Longitudinal Data * , 2005 .

[31]  Adrian E. Raftery,et al.  Model-Based Clustering, Discriminant Analysis, and Density Estimation , 2002 .

[32]  B. Silverman,et al.  Estimating the mean and covariance structure nonparametrically when the data are curves , 1991 .

[33]  Linda Partridge,et al.  Evolutionary biology: Costs of reproduction , 1985, Nature.

[34]  R D Bock,et al.  Comparison of height acceleration curves in the Fels, Zurich, and Berkeley growth data. , 1995, Annals of human biology.