Pattern‐based clustering of daily weigh‐in trajectories using dynamic time warping

“Smart”‐scales are a new tool for frequent monitoring of weight change as well as weigh‐in behavior. These scales give researchers the opportunity to discover patterns in the frequency that individuals weigh themselves over time, and how these patterns are associated with overall weight loss. Our motivating data come from an 18‐month behavioral weight loss study of 55 adults classified as overweight or obese who were instructed to weigh themselves daily. Adherence to daily weigh‐in routines produces a binary times series for each subject, indicating whether a participant weighed in on a given day. To characterize weigh‐in by time‐invariant patterns rather than overall adherence, we propose using hierarchical clustering with dynamic time warping (DTW). We perform an extensive simulation study to evaluate the performance of DTW compared to Euclidean and Jaccard distances to recover underlying patterns in adherence time series. In addition, we compare cluster performance using cluster validation indices (CVIs) under the single, average, complete, and Ward linkages and evaluate how internal and external CVIs compare for clustering binary time series. We apply conclusions from the simulation to cluster our real data and summarize observed weigh‐in patterns. Our analysis finds that the adherence trajectory pattern is significantly associated with weight loss.

[1]  E. Melanson,et al.  Comparison of weight loss induced by daily caloric restriction versus intermittent fasting (DRIFT) in individuals with obesity: study protocol for a 52-week randomized clinical trial , 2022, Trials.

[2]  J. Pépin,et al.  Continuous positive airway pressure adherence trajectories in sleep apnea: Clustering with summed discrete Fréchet and dynamic time warping dissimilarities , 2021, Statistics in medicine.

[3]  Xin Liu,et al.  An Unsupervised Learning Method with Convolutional Auto-Encoder for Vessel Trajectory Similarity Computation , 2021, Ocean Engineering.

[4]  M. Marshall,et al.  DYNAMIC TIME WARPING FOR CROPS MAPPING , 2020, The International Archives of the Photogrammetry, Remote Sensing and Spatial Information Sciences.

[5]  Jingxian Liu,et al.  Adaptively constrained dynamic time warping for time series classification and clustering , 2020, Inf. Sci..

[6]  R. Krukowski,et al.  Measuring Weight with Electronic Scales in Clinical and Research Settings During the Coronavirus Disease 2019 Pandemic , 2020, Obesity.

[7]  Volkmar Frinken,et al.  DTW-NN: A novel neural network for time series recognition using dynamic alignment between inputs and weights , 2020, Knowl. Based Syst..

[8]  Shweta Sharma,et al.  Comparative Study of Single Linkage, Complete Linkage, and Ward Method of Agglomerative Clustering , 2019, 2019 International Conference on Machine Learning, Big Data, Cloud and Parallel Computing (COMITCon).

[9]  G. Tison,et al.  Temporal patterns of self-weighing behavior and weight changes assessed by consumer purchased scales in the Health eHeart Study , 2019, Journal of Behavioral Medicine.

[10]  Pedro Antonio Gutiérrez,et al.  Time-Series Clustering Based on the Characterization of Segment Typologies , 2018, IEEE Transactions on Cybernetics.

[11]  Elke A. Rundensteiner,et al.  Generalized Dynamic Time Warping: Unleashing the Warping Power Hidden in Point-Wise Distances , 2018, 2018 IEEE 34th International Conference on Data Engineering (ICDE).

[12]  Chin-Teng Lin,et al.  A review of clustering techniques and developments , 2017, Neurocomputing.

[13]  R. Wing,et al.  Characterizing the Pattern of Weight Loss and Regain in Adults Enrolled in a 12-Week Internet-Based Weight Management Program , 2017, Obesity.

[14]  C. Pacanowski,et al.  Three-Year Follow-Up of Participants from a Self-Weighing Randomized Controlled Trial , 2017, Journal of obesity.

[15]  Hossein Hamooni,et al.  AWarp: Fast Warping Distance for Sparse Time Series , 2016, 2016 IEEE 16th International Conference on Data Mining (ICDM).

[16]  Lars Schmidt-Thieme,et al.  Learning DTW-Shapelets for Time-Series Classification , 2016, CODS.

[17]  Nitin Pise,et al.  Clustering Techniques and the Similarity Measures used in Clustering: A Survey , 2016 .

[18]  Sadaaki Miyamoto,et al.  Ward method of hierarchical clustering for non-Euclidean similarity measures , 2015, 2015 7th International Conference of Soft Computing and Pattern Recognition (SoCPaR).

[19]  A. Vinaya Babu,et al.  Quantitative Evaluation of Performance and Validity Indices for Clustering the Web Navigational Sessions , 2015, ArXiv.

[20]  René S. Kahn,et al.  A comparison of indices for identifying the number of clusters in hierarchical clustering: A study on cognition in schizophrenia patients , 2015 .

[21]  Dori M. Steinberg,et al.  Weighing every day matters: daily weighing improves weight loss and adoption of weight control behaviors. , 2015, Journal of the Academy of Nutrition and Dietetics.

[22]  D. Bond,et al.  Review of Innovations in Digital Health Technology to Promote Weight Control , 2014, Current Diabetes Reports.

[23]  Gustavo E. A. P. A. Batista,et al.  An Empirical Comparison of Dissimilarity Measures for Time Series Classification , 2013, 2013 Brazilian Conference on Intelligent Systems.

[24]  Deborah F. Tate,et al.  The efficacy of a daily self-weighing weight loss intervention using smart scales and email , 2013, Obesity.

[25]  Eamonn J. Keogh,et al.  Clustering Time Series Using Unsupervised-Shapelets , 2012, 2012 IEEE 12th International Conference on Data Mining.

[26]  Elizabeth Ann Maharaj,et al.  Wavelets-based clustering of multivariate time series , 2012, Fuzzy Sets Syst..

[27]  Hui Xiong,et al.  Understanding of Internal Clustering Validation Measures , 2010, 2010 IEEE International Conference on Data Mining.

[28]  Sarah C. Goslee,et al.  The ecodist Package for Dissimilarity-based Analysis of Ecological Data , 2007 .

[29]  Vit Niennattrakul,et al.  On Clustering Multimedia Time Series Data Using K-Means and Dynamic Time Warping , 2007, 2007 International Conference on Multimedia and Ubiquitous Engineering (MUE'07).

[30]  Edward R. Dougherty,et al.  Model-based evaluation of clustering validation measures , 2007, Pattern Recognit..

[31]  Nicolaas P. Pronk,et al.  Self-weighing in weight gain prevention and weight loss trials , 2005, Annals of behavioral medicine : a publication of the Society of Behavioral Medicine.

[32]  R. Wing,et al.  Long-term weight loss maintenance. , 2005, The American journal of clinical nutrition.

[33]  Philip Chan,et al.  Determining the number of clusters/segments in hierarchical clustering/segmentation algorithms , 2004, 16th IEEE International Conference on Tools with Artificial Intelligence.

[34]  James W. Anderson,et al.  Long-term weight-loss maintenance: a meta-analysis of US studies. , 2001, The American journal of clinical nutrition.

[35]  Eamonn J. Keogh,et al.  Scaling up dynamic time warping for datamining applications , 2000, KDD '00.

[36]  Donald J. Berndt,et al.  Using Dynamic Time Warping to Find Patterns in Time Series , 1994, KDD Workshop.

[37]  B.-H. Juang,et al.  On the hidden Markov model and dynamic time warping for speech recognition — A unified view , 1984, AT&T Bell Laboratories Technical Journal.

[38]  Alexis Sardá-Espinosa,et al.  Time-Series Clustering in R Using the dtwclust Package , 2019, R J..

[39]  Shai Ben-David,et al.  A Characterization of Linkage-Based Hierarchical Clustering , 2016, J. Mach. Learn. Res..

[40]  Maciej Luczak,et al.  Hierarchical clustering of time series data with parametric derivative dynamic time warping , 2016, Expert Syst. Appl..

[41]  R. Wing,et al.  Weight-loss maintenance for 10 years in the National Weight Control Registry. , 2014, American journal of preventive medicine.

[42]  R Core Team,et al.  R: A language and environment for statistical computing. , 2014 .

[43]  Olatz Arbelaitz,et al.  An extensive comparative study of cluster validity indices , 2013, Pattern Recognit..

[44]  Lior Rokach,et al.  Data Mining And Knowledge Discovery Handbook , 2005 .

[45]  L. Rokach,et al.  Clustering Methods , 2005, The Data Mining and Knowledge Discovery Handbook.