Personalized Policy Learning Using Longitudinal Mobile Health Data

We address the personalized policy learning problem using longitudinal mobile health application usage data. Personalized policy represents a paradigm shift from developing a single policy that may prescribe personalized decisions by tailoring. Specifically, we aim to develop the best policy, one per user, based on estimating random effects under generalized linear mixed model. With many random effects, we consider new estimation method and penalized objective to circumvent high-dimension integrals for marginal likelihood approximation. We establish consistency and optimality of our method with endogenous app usage. We apply our method to develop personalized push ("prompt") schedules in 294 app users, with a goal to maximize the prompt response rate given past app usage and other contextual factors. We found the best push schedule given the same covariates varied among the users, thus calling for personalized policies. Using the estimated personalized policies would have achieved a mean prompt response rate of 23% in these users at 16 weeks or later: this is a remarkable improvement on the observed rate (11%), while the literature suggests 3%-15% user engagement at 3 months after download. The proposed method compares favorably to existing estimation methods including using the R function "glmer" in a simulation study.

[1]  M. Pepe,et al.  A cautionary note on inference for marginal regression models with longitudinal data and general correlated response data , 1994 .

[2]  Matt Sutton,et al.  The dynamics of physical and mental health in the older population , 2017, Journal of the economics of ageing.

[3]  Ambuj Tewari,et al.  From Ads to Interventions: Contextual Bandits in Mobile Health , 2017, Mobile Health - Sensors, Analytic Methods, and Applications.

[4]  Samuel Müller,et al.  Joint Selection in Mixed Models using Regularized PQL , 2017 .

[5]  Annie Qu,et al.  Personalize treatment for longitudinal data using unspecified random-effects model , 2017 .

[6]  Bibhas Chakraborty,et al.  Sequential multiple assignment randomized trial (SMART) with adaptive randomization for quality improvement in depression treatment program , 2015, Biometrics.

[7]  J. Smyth,et al.  Ecological momentary interventions: incorporating mobile technology into psychosocial and health behaviour treatments. , 2010, British journal of health psychology.

[8]  S. Murphy,et al.  Optimal dynamic treatment regimes , 2003 .

[9]  Susan A. Murphy,et al.  Linear mixed models under endogeneity: modeling sequential treatment effects with application to a mobile health study , 2019, 1902.10861.

[10]  N. Schork,et al.  The n-of-1 clinical trial: the ultimate strategy for individualizing medicine? , 2011, Personalized medicine.

[11]  Donglin Zeng,et al.  Estimating Individualized Treatment Rules Using Outcome Weighted Learning , 2012, Journal of the American Statistical Association.

[12]  Eric B. Laber,et al.  Interactive model building for Q-learning. , 2014, Biometrika.

[13]  C. Depp,et al.  Mobile Interventions for Severe Mental Illness: Design and Preliminary Data From Three Approaches , 2010, The Journal of nervous and mental disease.

[14]  Donglin Zeng,et al.  New Statistical Learning Methods for Estimating Optimal Dynamic Treatment Regimes , 2015, Journal of the American Statistical Association.

[15]  M. Yuan,et al.  Model selection and estimation in regression with grouped variables , 2006 .

[16]  Rebecca C Rossom,et al.  IntelliCare: An Eclectic, Skills-Based App Suite for the Treatment of Depression and Anxiety , 2017, Journal of medical Internet research.

[17]  T. Steihaug The Conjugate Gradient Method and Trust Regions in Large Scale Optimization , 1983 .

[18]  Susan A. Murphy,et al.  Linear mixed models with endogenous covariates: modeling sequential treatment effects with application to a mobile health study. , 2019, Statistical science : a review journal of the Institute of Mathematical Statistics.

[19]  Ambuj Tewari,et al.  An Actor-Critic Contextual Bandit Algorithm for Personalized Mobile Health Interventions , 2017, ArXiv.

[20]  S. Nash A survey of truncated-Newton methods , 2000 .

[21]  Eric B. Laber,et al.  A Robust Method for Estimating Optimal Treatment Regimes , 2012, Biometrics.

[22]  L. Piwek,et al.  The Rise of Consumer Health Wearables: Promises and Barriers , 2016, PLoS medicine.

[23]  S. Murphy,et al.  PERFORMANCE GUARANTEES FOR INDIVIDUALIZED TREATMENT RULES. , 2011, Annals of statistics.

[24]  David C. Mohr,et al.  Evaluation of a recommender app for apps for the treatment of depression and anxiety: an analysis of longitudinal user engagement , 2018, J. Am. Medical Informatics Assoc..

[25]  M R Kosorok,et al.  Penalized Q-Learning for Dynamic Treatment Regimens. , 2011, Statistica Sinica.

[26]  W. Nilsen,et al.  Health behavior models in the age of mobile interventions: are our theories up to the task? , 2011, Translational behavioral medicine.

[27]  D. Bates,et al.  Fitting Linear Mixed-Effects Models Using lme4 , 2014, 1406.5823.

[28]  R. Strawderman,et al.  Constructing dynamic treatment regimes over indefinite time horizons , 2018, Biometrika.

[29]  H. Christensen,et al.  Adherence in Internet Interventions for Anxiety and Depression: Systematic Review , 2009, Journal of medical Internet research.

[30]  Michael R. Kosorok,et al.  Estimating Dynamic Treatment Regimes in Mobile Health Using V-Learning , 2016, Journal of the American Statistical Association.

[31]  David C Mohr,et al.  Continuous evaluation of evolving behavioral intervention technologies. , 2013, American journal of preventive medicine.

[32]  B. Wansink,et al.  Factors Related to Sustained Use of a Free Mobile App for Dietary Self-Monitoring With Photography and Peer Feedback: Retrospective Cohort Study , 2014, Journal of medical Internet research.

[33]  Ambuj Tewari,et al.  An Actor-Critic Contextual Bandit Algorithm for Personalized Interventions using Mobile Devices , 2014 .

[34]  Johannes Schöning,et al.  Falling asleep with Angry Birds, Facebook and Kindle: a large scale study on mobile application usage , 2011, Mobile HCI.

[35]  S. Murphy,et al.  Assessing Time-Varying Causal Effect Moderation in Mobile Health , 2016, Journal of the American Statistical Association.

[36]  P. Diggle,et al.  Analysis of Longitudinal Data , 2003 .