Curve prediction and clustering with mixtures of Gaussian process functional regression models

Shi, Wang, Murray-Smith and Titterington (Biometrics 63:714–723, 2007) proposed a Gaussian process functional regression (GPFR) model to model functional response curves with a set of functional covariates. Two main problems are addressed by their method: modelling nonlinear and nonparametric regression relationship and modelling covariance structure and mean structure simultaneously. The method gives very good results for curve fitting and prediction but side-steps the problem of heterogeneity. In this paper we present a new method for modelling functional data with ‘spatially’ indexed data, i.e., the heterogeneity is dependent on factors such as region and individual patient’s information. For data collected from different sources, we assume that the data corresponding to each curve (or batch) follows a Gaussian process functional regression model as a lower-level model, and introduce an allocation model for the latent indicator variables as a higher-level model. This higher-level model is dependent on the information related to each batch. This method takes advantage of both GPFR and mixture models and therefore improves the accuracy of predictions. The mixture model has also been used for curve clustering, but focusing on the problem of clustering functional relationships between response curve and covariates, i.e. the clustering is based on the surface shape of the functional response against the set of functional covariates. The model is examined on simulated data and real data.

[1]  B. Silverman,et al.  Functional Data Analysis , 1997 .

[2]  P. Green Reversible jump Markov chain Monte Carlo computation and Bayesian model determination , 1995 .

[3]  Roderick Murray-Smith,et al.  Hierarchical Gaussian process mixtures for regression , 2005, Stat. Comput..

[4]  Adrian E. Raftery,et al.  MCLUST Version 3: An R Package for Normal Mixture Modeling and Model-Based Clustering , 2006 .

[5]  Zongwu Cai,et al.  Adaptive varying‐coefficient linear models , 2000 .

[6]  Adrian E. Raftery,et al.  MCLUST Version 3 for R: Normal Mixture Modeling and Model-Based Clustering † , 2007 .

[7]  Hans-Georg Ller,et al.  Functional Modelling and Classification of Longitudinal Data. , 2005 .

[8]  B. Silverman,et al.  Estimating the mean and covariance structure nonparametrically when the data are curves , 1991 .

[9]  G. Schwarz Estimating the Dimension of a Model , 1978 .

[10]  M. Stephens Bayesian analysis of mixture models with an unknown number of components- an alternative to reversible jump methods , 2000 .

[11]  Padhraic Smyth,et al.  Curve Clustering with Random Effects Regression Mixtures , 2003, AISTATS.

[12]  J. Faraway Regression analysis for a functional response , 1997 .

[13]  William M. Rand,et al.  Objective Criteria for the Evaluation of Clustering Methods , 1971 .

[14]  D. M. Titterington,et al.  Bayesian regression and classification using mixtures of Gaussian processes , 2003 .

[15]  O. Cappé,et al.  Reversible jump, birth‐and‐death and more general continuous time Markov chain Monte Carlo samplers , 2003 .

[16]  P. Green,et al.  Spatially Correlated Allocation Models for Count Data , 2000 .

[17]  P. Green,et al.  Modelling spatially correlated data via mixtures: a Bayesian approach , 2002 .

[18]  A. O'Hagan,et al.  Bayesian calibration of computer models , 2001 .

[19]  J Q Shi,et al.  Gaussian Process Functional Regression Modeling for Batch Data , 2007, Biometrics.

[20]  Geoffrey J. McLachlan,et al.  Finite Mixture Models , 2019, Annual Review of Statistics and Its Application.

[21]  D. Mackay,et al.  Introduction to Gaussian processes , 1998 .

[22]  A. F. Smith,et al.  Statistical analysis of finite mixture distributions , 1986 .

[23]  T. Louis Finding the Observed Information Matrix When Using the EM Algorithm , 1982 .

[24]  T. Bajd,et al.  Nonlinear modeling of FES-supported standing-up in paraplegia for selection of feedback sensors , 2005, IEEE Transactions on Neural Systems and Rehabilitation Engineering.

[25]  T Bajd,et al.  Functional electrical stimulation and arm supported sit-to-stand transfer after paraplegia: a study of kinetic parameters. , 1999, Artificial organs.

[26]  Catherine A. Sugar,et al.  Clustering for Sparsely Sampled Functional Data , 2003 .