Tree-based varying coefficient regression for longitudinal ordinal responses

A tree-based algorithm for longitudinal regression analysis that aims to learn whether and how the effects of predictor variables depend on moderating variables is presented. The algorithm is based on multivariate generalized linear mixed models and it builds piecewise constant coefficient functions. Moreover, it is scalable for many moderators of possibly mixed scales, integrates interactions between moderators and can handle nonlinearities. Although the scope of the algorithm is quite general, the focus is on its usage in an ordinal longitudinal regression setting. The potential of the algorithm is illustrated by using data derived from the British Household Panel Study, to show how the effect of unemployment on self-reported happiness varies across individual life circumstances.11R-codes and datasets are available online as supplementary files (see Appendix B).

[1]  Peter Norvig,et al.  Artificial Intelligence: A Modern Approach , 1995 .

[2]  O. Lipps,et al.  Does Unemployment Hurt Less if There is More of it Around? A Panel Analysis of Life Satisfaction in Germany and Switzerland , 2011, SSRN Electronic Journal.

[3]  Daowen Zhang Generalized Linear Mixed Models with Varying Coefficients for Longitudinal Data , 2004, Biometrics.

[4]  Christopher M. Bishop,et al.  Classification and regression , 1997 .

[5]  C. Field,et al.  Bootstrapping clustered data , 2007 .

[6]  P. McCullagh Regression Models for Ordinal Data , 1980 .

[7]  R. Tibshirani,et al.  Varying‐Coefficient Models , 1993 .

[8]  Soo-Heang Eo,et al.  Tree-Structured Mixed-Effects Regression Modeling for Longitudinal Data , 2014 .

[9]  L. Fahrmeir,et al.  Multivariate statistical modelling based on generalized linear models , 1994 .

[10]  D. Hedeker,et al.  A random-effects ordinal regression model for multilevel analysis. , 1994, Biometrics.

[11]  B. Singh,et al.  Inferential statistics on the dynamic system model with time-dependent failure-rate , 2013 .

[12]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[13]  Wesley O. Johnson,et al.  Interaction trees: exploring the differential effects of an intervention programme for breast cancer survivors , 2011 .

[14]  Achim Zeileis,et al.  Rasch Trees: A New Method for Detecting Differential Item Functioning in the Rasch Model , 2015, Psychometrika.

[15]  N. Hjort,et al.  Tests For Constancy Of Model Parameters Over Time , 2002 .

[16]  W. Loh,et al.  REGRESSION TREES WITH UNBIASED VARIABLE SELECTION AND INTERACTION DETECTION , 2002 .

[17]  K. Hornik,et al.  Model-Based Recursive Partitioning , 2008 .

[18]  K. Hornik,et al.  Unbiased Recursive Partitioning: A Conditional Inference Framework , 2006 .

[19]  A. Zeileis,et al.  Gaining insight with recursive partitioning of generalized linear models , 2013 .

[20]  Denis Larocque,et al.  Mixed effects regression trees for clustered data , 2008 .

[21]  Jeffrey S. Simonoff,et al.  RE-EM trees: a data mining approach for longitudinal and clustered data , 2011, Machine Learning.

[22]  G. Kauermann,et al.  Modeling Longitudinal Data with Ordinal Response by Varying Coefficients , 2000, Biometrics.

[23]  M LeBlanc,et al.  Binary partitioning for continuous longitudinal data: categorizing a prognostic variable , 2002, Statistics in medicine.

[24]  Leo Breiman,et al.  Random Forests , 2001, Machine Learning.

[25]  G. Tutz,et al.  Random effects in ordinal regression models , 1996 .

[26]  Ioannis Kosmidis,et al.  Improved estimation in cumulative link models , 2012, 1204.0105.

[27]  S. Rutkowski,et al.  A longitudinal study of the prevalence and characteristics of pain in the first 5 years following spinal cord injury , 2003, Pain.

[28]  K. Hornik,et al.  Generalized M‐fluctuation tests for parameter instability , 2007 .

[29]  D. Andrews Tests for Parameter Instability and Structural Change with Unknown Change Point , 1993 .

[30]  Wei-Yin Loh,et al.  Classification and regression trees , 2011, WIREs Data Mining Knowl. Discov..

[31]  Gerhard Tutz,et al.  Generalized linear random effects models with varying coefficients , 2003, Comput. Stat. Data Anal..

[32]  J. Nyblom Testing for the Constancy of Parameters over Time , 1989 .

[33]  Trevor Hastie,et al.  Boosted Varying-Coefficient Regression Models for Product Demand Prediction , 2014 .