Expect the gap: A recommender approach to estimate the absenteeism of self-monitoring mHealth app users

Adherence and phases of non-adherence in the usage of mHealth apps for self-monitoring generate time series characterized by gaps of varying duration. These reduce the knowledge that can be gained from the data because important observations are not captured. In this paper, an approach is presented that allows experts to estimate the possible duration of a user’s absence in order to build on it and take appropriate action.For this purpose, the users’ time series (with gaps and of different lengths) are decomposed into sequences that have no gaps anymore. Each sequence is associated with the duration of the subsequent gap. Using unsupervised binning, these gap values are sorted into a predefined number of categories. Sequences with similar gap durations are thus assigned the same category labels, grouped using unsupervised time series clustering and are assigned with cluster labels. Thus triplets are derived consisting of user ID, cluster label and gap category label. These triplets can then be used for collaborative filtering with matrix factorization. It is now possible to estimate the gap duration even if the same sequence has not yet been observed for the app user.The results show that the quality of binning depends on the appropriate choice of the number of categories, on the technique used, and on the maximum length of the gaps. In this example, a 5-star rating, the Fischer-Jenks algorithm and a maximum gap length of 30 days. We demonstrate how clustering can be used to gradually adapt the time series to the conditions of collaborative filtering and how complex matrix factorization models respond better to the complex structures of the data and lead to better results. In this work, K-Medoids and Agglomerative Clustering obtained applicable results and for matrix factorization SVD++.We believe that our approach provides a valuable back-end tool for experts to better assess the adherence of users to a selfmonitoring app.

[1]  Xile Zhao,et al.  Nonstationary Temporal Matrix Factorization for Multivariate Time Series Forecasting , 2022, ArXiv.

[2]  M. Spiliopoulou,et al.  When Can I Expect the mHealth User to Return? Prediction Meets Time Series with Gaps , 2022, AIME.

[3]  Roy T. Sabo,et al.  Testing for Phases of Dropout Attrition Using Change-Point Hazard Models , 2021 .

[4]  S. Wonderlich,et al.  Baseline and momentary predictors of ecological momentary assessment adherence in a sample of adults with binge-eating disorder. , 2021, Eating behaviors.

[5]  M. Spiliopoulou,et al.  Understanding adherence to the recording of ecological momentary assessments in the example of tinnitus monitoring , 2020, Scientific Reports.

[6]  Erick Giovani Sperandio Nascimento,et al.  Multivariate Real Time Series Data Using Six Unsupervised Machine Learning Algorithms , 2020, Anomaly Detection - Recent Advances, Issues and Challenges [Working Title].

[7]  Yan Guo,et al.  A Novel Approach Based on Matrix Factorization for Recovering Missing Time Series Sensor Data , 2020, IEEE Sensors Journal.

[8]  Nicolas Hug,et al.  Surprise: A Python library for recommender systems , 2020, J. Open Source Softw..

[9]  Alex H Krist,et al.  Identifying Attrition Phases in Survey Data: Applicability and Assessment Study , 2019, Journal of medical Internet research.

[10]  Myra Spiliopoulou,et al.  Editorial: Towards an Understanding of Tinnitus Heterogeneity , 2019, Front. Aging Neurosci..

[11]  P. Dijk,et al.  Towards an Understanding of Tinnitus Heterogeneity , 2019, Frontiers Research Topics.

[12]  Manfred Reichert,et al.  Measuring the Moment-to-Moment Variability of Tinnitus: The TrackYourTinnitus Smart Phone App , 2016, Front. Aging Neurosci..

[13]  Inderjit S. Dhillon,et al.  Temporal Regularized Matrix Factorization for High-dimensional Time Series Prediction , 2016, NIPS.

[14]  S. Woolf,et al.  Methods for Evaluating Respondent Attrition in Web-Based Surveys , 2016, Journal of medical Internet research.

[15]  S. Salway,et al.  Targeting the Use of Reminders and Notifications for Uptake by Populations (TURNUP): a systematic review and evidence synthesis , 2014 .

[16]  Yehuda Koren,et al.  Collaborative filtering with temporal dynamics , 2009, KDD.

[17]  Yehuda Koren,et al.  Factorization meets the neighborhood: a multifaceted collaborative filtering model , 2008, KDD.

[18]  P. Rousseeuw,et al.  Partitioning Around Medoids (Program PAM) , 2008 .

[19]  A. L. Dal-Fabbro,et al.  Adherence to long term therapies: evidence for action , 2005 .

[20]  G. Eysenbach The Law of Attrition , 2005, Journal of medical Internet research.

[21]  Daniel T. Larose,et al.  Discovering Knowledge in Data: An Introduction to Data Mining , 2005 .

[22]  Peter J. Rousseeuw,et al.  Finding Groups in Data: An Introduction to Cluster Analysis , 1990 .

[23]  G. Jenks The Data Model Concept in Statistical Mapping , 1967 .