Identifying clinical course patterns in SMS data using cluster analysis

BackgroundRecently, there has been interest in using the short message service (SMS or text messaging), to gather frequent information on the clinical course of individual patients. One possible role for identifying clinical course patterns is to assist in exploring clinically important subgroups in the outcomes of research studies. Two previous studies have investigated detailed clinical course patterns in SMS data obtained from people seeking care for low back pain. One used a visual analysis approach and the other performed a cluster analysis of SMS data that had first been transformed by spline analysis. However, cluster analysis of SMS data in its original untransformed form may be simpler and offer other advantages. Therefore, the aim of this study was to determine whether cluster analysis could be used for identifying clinical course patterns distinct from the pattern of the whole group, by including all SMS time points in their original form. It was a ‘proof of concept’ study to explore the potential, clinical relevance, strengths and weakness of such an approach.MethodsThis was a secondary analysis of longitudinal SMS data collected in two randomised controlled trials conducted simultaneously from a single clinical population (n = 322). Fortnightly SMS data collected over a year on ‘days of problematic low back pain’ and on ‘days of sick leave’ were analysed using Two-Step (probabilistic) Cluster Analysis.ResultsClinical course patterns were identified that were clinically interpretable and different from those of the whole group. Similar patterns were obtained when the number of SMS time points was reduced to monthly. The advantages and disadvantages of this method were contrasted to that of first transforming SMS data by spline analysis.ConclusionsThis study showed that clinical course patterns can be identified by cluster analysis using all SMS time points as cluster variables. This method is simple, intuitive and does not require a high level of statistical skill. However, there are alternative ways of managing SMS data and many different methods of cluster analysis. More research is needed, especially head-to-head studies, to identify which technique is best to use under what circumstances.

[1]  Mohammad Ghodsi,et al.  Comparison of artificial neural network and logistic regression models for prediction of mortality in head trauma based on initial clinical data , 2005, BMC Medical Informatics Decis. Mak..

[2]  C. Maher,et al.  A Guide to Interpretation of Studies Investigating Subgroups of Responders to Physical Therapy Interventions , 2009, Physical Therapy.

[3]  William S Marras,et al.  Low Back Pain Recurrence in Occupational Environments , 2007, Spine.

[4]  I. Axén,et al.  Prevalence of pain-free weeks in chiropractic subjects with low back pain - a longitudinal study using data gathered with text messages , 2011, Chiropractic & manual therapies.

[5]  J. Hair Multivariate data analysis , 1972 .

[6]  Peter M. Smith,et al.  The recovery patterns of back pain among workers with compensated occupational back injuries , 2007, Occupational and Environmental Medicine.

[7]  Danilo P. Mandic,et al.  Recurrent Neural Networks for Prediction: Learning Algorithms, Architectures and Stability , 2001 .

[8]  K. Jordan,et al.  Trajectories of pain in adolescents: A prospective cohort study , 2011, PAIN®.

[9]  M. Kohler Wallace CS: Statistical and inductive inference by minimum message length , 2006 .

[10]  A. Hausheer,et al.  Comparison of Stratified Primary Care Management for Low Back Pain with Current Best Practice (STarTBack): A Randomised Controlled Trial , 2013, physioscience.

[11]  Ricky Mullis,et al.  A primary care back pain screening tool: identifying patient subgroups for initial treatment. , 2008, Arthritis and rheumatism.

[12]  N. Wedderkopp,et al.  Rest versus exercise as treatment for patients with low back pain and Modic changes. a randomized controlled clinical trial , 2012, BMC Medicine.

[13]  John D. Childs,et al.  A Clinical Prediction Rule To Identify Patients with Low Back Pain Most Likely To Benefit from Spinal Manipulation: A Validation Study , 2004, Annals of Internal Medicine.

[14]  Niels Wedderkopp,et al.  Comparison between data obtained through real-time data capture by SMS and a retrospective telephone interview , 2010, Chiropractic & osteopathy.

[15]  J. Concato,et al.  A simulation study of the number of events per variable in logistic regression analysis. , 1996, Journal of clinical epidemiology.

[16]  J. Concato,et al.  The Risk of Determining Risk with Multivariable Models , 1993, Annals of Internal Medicine.

[17]  Alice Kongsted,et al.  The Nordic back pain subpopulation program: course patterns established through weekly follow-ups in patients treated for low back pain , 2010, Chiropractic & osteopathy.

[18]  Mevlut Ture,et al.  Comparing performances of logistic regression, classification and regression tree, and neural networks for predicting coronary artery disease , 2008, Expert Syst. Appl..

[19]  C. Maher,et al.  Definitions of Recurrence of an Episode of Low Back Pain: A Systematic Review , 2009, Spine.

[20]  B. Horisberger,et al.  The course of chronic and recurrent low back pain in the general population , 2010, Pain.

[21]  Peter Grünwald,et al.  Invited review of the book Statistical and Inductive Inference by Minimum Message Length , 2006 .

[22]  K. Jordan,et al.  Characterizing the course of low back pain: a latent class analysis. , 2006, American journal of epidemiology.

[23]  Danilo P. Mandic,et al.  Recurrent Neural Networks for Prediction , 2001 .

[24]  Cluster Analysis Gets Complicated , 2022 .

[25]  C. Bulpitt Randomised Controlled Clinical Trials , 1983, Developments in Biostatistics and Epidemiology.

[26]  Lennart Bodin,et al.  Clustering patients on the basis of their individual course of low back pain over a six month period , 2011, BMC musculoskeletal disorders.

[27]  Brian Everitt,et al.  Cluster analysis , 1974 .

[28]  N. Foster,et al.  Subgrouping patients with low back pain in primary care: are we getting any better at it? , 2011, Manual therapy.

[29]  M. Sarstedt,et al.  A Concise Guide to Market Research , 2019, Springer Texts in Business and Economics.

[30]  P. Sopp Cluster analysis. , 1996, Veterinary immunology and immunopathology.

[31]  S. Bryan,et al.  Comparison of Stratified Primary Care Management for Low Back Pain with Current Best Practice (STarTBack): A Randomised Controlled Trial , 2013, physioscience.